Closed Bug 1347246 Opened 8 years ago Closed 8 years ago

[emr-bootstrap-presto] Set hive configuration hive.parquet.use-column-names=true

Categories

(Data Platform and Tools Graveyard :: Presto, enhancement, P1)

x86_64
Linux
enhancement

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: amiyaguchi, Assigned: robotblake)

References

Details

(Whiteboard: [SvcOps])

The parquet data in our data lake is stored using schema evolution. The vanilla Presto configuration on EMR will default to using offsets unless ` hive.parquet.use-column-names=true` is specified. This configuration is documented in [1] and should be reflected in the emr-bootstrap-presto[2] repository. [1] https://github.com/fbertsch/schema_evolution_exploration [2] https://github.com/mozilla/emr-bootstrap-presto
Assignee: nobody → jthomas
Priority: -- → P1
Whiteboard: [SvcOps]
Depends on: 1358232
This will be addressed as part of bug 1358232.
Assignee: jthomas → bimsland
This has been added to the configs on the new cluster.
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
Product: Data Platform and Tools → Data Platform and Tools Graveyard
You need to log in before you can comment on or make changes to this bug.