Closed
Bug 1347246
Opened 8 years ago
Closed 8 years ago
[emr-bootstrap-presto] Set hive configuration hive.parquet.use-column-names=true
Categories
(Data Platform and Tools Graveyard :: Presto, enhancement, P1)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: amiyaguchi, Assigned: robotblake)
References
Details
(Whiteboard: [SvcOps])
The parquet data in our data lake is stored using schema evolution. The vanilla Presto configuration on EMR will default to using offsets unless ` hive.parquet.use-column-names=true` is specified. This configuration is documented in [1] and should be reflected in the emr-bootstrap-presto[2] repository.
[1] https://github.com/fbertsch/schema_evolution_exploration
[2] https://github.com/mozilla/emr-bootstrap-presto
Updated•8 years ago
|
Assignee: nobody → jthomas
Priority: -- → P1
Whiteboard: [SvcOps]
| Assignee | ||
Comment 2•8 years ago
|
||
This has been added to the configs on the new cluster.
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
Updated•5 years ago
|
Product: Data Platform and Tools → Data Platform and Tools Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•