Parquet2hive should read schema from the most recently generated file

RESOLVED FIXED

Status

P2
normal
RESOLVED FIXED
2 years ago
4 days ago

People

(Reporter: rvitillo, Assigned: frank)

Tracking

Firefox Tracking Flags

(Not tracked)

Details

User Story

In order to support schema evolution (e.g. adding a new nullable column), parquet2hive should load the schema from the most recent file in a dataset.
Comment hidden (empty)
(Reporter)

Updated

2 years ago
Assignee: nobody → fbertsch
Points: --- → 2
Priority: -- → P2
(Assignee)

Updated

2 years ago
Status: NEW → ASSIGNED
(Assignee)

Comment 1

2 years ago
Created pull request: https://github.com/mozilla/parquet2hive/pull/12

Note that https://github.com/mozilla/parquet2hive/pull/10 will need to be merged first before the new pull request will be able to be merged.
(Assignee)

Updated

2 years ago
Status: ASSIGNED → RESOLVED
Last Resolved: 2 years ago
Resolution: --- → FIXED

Updated

4 days ago
Product: Cloud Services → Cloud Services Graveyard
You need to log in before you can comment on or make changes to this bug.