Create a Parquet2Hive airflow operator

NEW
Unassigned

Status

Data Platform and Tools
Scheduling
P2
normal
6 months ago
6 months ago

People

(Reporter: amiyaguchi, Unassigned)

Tracking

Details

(Reporter)

Description

6 months ago
New parquet partitions are added to the hive metastore through stored parquet2hive commands in a cron table.

Adding a parquet2hive operator would remove the extra step of modifying the crontab on the p2h server, and make it easier to reason about scheduled data processing from execution to availability on redash.
(Reporter)

Updated

6 months ago
Points: --- → 2
Priority: -- → P2

Updated

6 months ago
Component: Metrics: Pipeline → Scheduling
Product: Cloud Services → Data Platform and Tools
You need to log in before you can comment on or make changes to this bug.