Closed Bug 1294735 Opened 8 years ago Closed 8 years ago

Parquet2hive should read schema from the most recently generated file

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P2)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: rvitillo, Assigned: frank)

References

Details

User Story

In order to support schema evolution (e.g. adding a new nullable column), parquet2hive should load the schema from the most recent file in a dataset.
      No description provided.
Assignee: nobody → fbertsch
Points: --- → 2
Priority: -- → P2
Status: NEW → ASSIGNED
Created pull request: https://github.com/mozilla/parquet2hive/pull/12

Note that https://github.com/mozilla/parquet2hive/pull/10 will need to be merged first before the new pull request will be able to be merged.
Status: ASSIGNED → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
Product: Cloud Services → Cloud Services Graveyard
You need to log in before you can comment on or make changes to this bug.