Closed Bug 1321414 Opened 9 years ago Closed 9 years ago

Create v2 desktop churn/retention dataset in telemetry-parquet

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P1)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: amiyaguchi, Assigned: amiyaguchi)

References

Details

Attachments

(2 files)

The churn notebook currently dumps gzipped data to s3 in two separate buckets [1][2]. In order to make the data eventually available to re:dash, it should be stored in the telemetry-parquet bucket as a parquet dataset. There is already a churn dataset located in `telemetry-parquet/churn/v1`, corresponding to a view generated via telemetry-batch-view. These two datasets are not necessarily the same, but I don't see any harm in storing this as `telemetry-parquet/churn/v2`. [1] s3://net-mozaws-prod-us-west-2-pipeline-analysis/mreid/churn/ [2] s3://net-mozaws-prod-metrics-data/telemetry-churn/
Blocks: 1311816
Assignee: nobody → amiyaguchi
Points: --- → 2
Priority: -- → P1
Attached file Github pull request
The attached pull request will contain the relevant changes for this bug.
The churn notebook now lives in `mozilla/mozilla-reports/etl/churn.kp`. It has been modified to read the current date from the environment, and will upload parquet to `s3://telemetry-parquet/churn/v2`.
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Product: Cloud Services → Cloud Services Graveyard
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: