Closed Bug 1301386 Opened 8 years ago Closed 8 years ago

parquet2hive should load all datasets within a prefix

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P2)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: rvitillo, Assigned: frank)

References

Details

User Story

It would be convenient if parquet2hive could load all datasets withing a prefix, like s3://telemetry-parquet.

We have talked in the past about having ETL jobs communicating directly with the Hive metastore. This solution would clearly have some lag but at the same time we wouldn't need to change our jobs to add some logic that keeps the metastore up-to-date.
No description provided.
Blocks: 1255752
Points: --- → 1
Priority: -- → P3
User Story: (updated)
Priority: P3 → P2
Assignee: nobody → fbertsch
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
Product: Cloud Services → Cloud Services Graveyard
You need to log in before you can comment on or make changes to this bug.