Closed Bug 1294735 Opened 8 years ago Closed 8 years ago

Parquet2hive should read schema from the most recently generated file

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: rvitillo, Assigned: frank)

References

Details

User Story

In order to support schema evolution (e.g. adding a new nullable column), parquet2hive should load the schema from the most recent file in a dataset.

Roberto Agostino Vitillo (:rvitillo)

Reporter

Description

•

8 years ago

      No description provided.

Roberto Agostino Vitillo (:rvitillo)

Reporter

Updated

•

8 years ago

Assignee: nobody → fbertsch

Points: --- → 2

Priority: -- → P2

Frank Bertsch [:frank]

Assignee

Updated

•

8 years ago

Status: NEW → ASSIGNED

Frank Bertsch [:frank]

Assignee

Comment 1

•

8 years ago

Created pull request: https://github.com/mozilla/parquet2hive/pull/12

Note that https://github.com/mozilla/parquet2hive/pull/10 will need to be merged first before the new pull request will be able to be merged.

Frank Bertsch [:frank]

Assignee

Updated

•

8 years ago

Status: ASSIGNED → RESOLVED

Closed: 8 years ago

Resolution: --- → FIXED

BMO Automation

Updated

•

6 years ago

Product: Cloud Services → Cloud Services Graveyard

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Parquet2hive should read schema from the most recently generated file

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P2)

Tracking

(Not tracked)

People

(Reporter: rvitillo, Assigned: frank)

References

Details

Crash Data

Security

(public)

User Story

Description

Updated

Updated

Comment 1

Updated

Updated