The bootstrap scripts should also setup redash and scan/load Hive datasets stored within the parquet bucket.
See https://github.com/vitillo/emr-bootstrap-presto. I have tried airpal as well but it doesn't seem to support arrays, maps and structs, which are used by our longitudinal dataset.
Status: NEW → RESOLVED
Last Resolved: 2 years ago
Resolution: --- → FIXED
Note that some changes  were required to PyHive, the Python interface to Presto used by redash, to display correctly structs.  https://github.com/vitillo/PyHive/commit/26a565c88e6efffbc1997dcee6630c43d166355e
You need to log in before you can comment on or make changes to this bug.