Closed Bug 1485152 Opened 6 years ago Closed 6 years ago

Add `active_addons` to clients_daily to remove Longitudinal dependency for TAAR

Categories

(Data Platform and Tools :: General, enhancement, P2)

enhancement
Points:
2

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: bugzilla, Assigned: bugzilla)

References

Details

(Whiteboard: [DataPlatform])

Attachments

(1 file)

TAAR currently has three jobs that depend on longitudinal: taar_locale and taar_similarity in python_mozetl, and addon_recommender in telemetry-batch-view. In all of these cases, the job only takes the most recent ping from a client. They all also benefit from fresher data -- that is, while longitudinal is clients from 1 sample_id over six months, if we can get an equivalent number of clients by using more sample_ids over a shorter period the recommendations would probably fare better. Given this and the high likelihood that longitudinal will be deprecated in the near future, switching the upstream dataset seems wise. clients_daily would work pretty well for this purpose except it's missing a few fields that TAAR needs (most obviously, the full active_addons column). I'm going to NI Victor so he can let us know what other fields the code relies on.
Flags: needinfo?(vng)
Points: --- → 2
Priority: -- → P2
taar_locale could easily run off of clients_daily with the active_addons column added. taar_similarity also needs just the active_addons column added. AddonRecommender (for collaborative recommendations) could also be ported to use clients_daily if we had active_addons added as a column. TADAR (formerly TAAR-lite) could also be ported if we had active_addons added. I don't think we need any other columns added for TAAR or TAARlite other than active_addons.
Flags: needinfo?(vng)
Summary: Add fields to clients_daily to remove Longitudinal dependency for TAAR → Add `active_addons` to clients_daily to remove Longitudinal dependency for TAAR
Whiteboard: [DataPlatform]
Assignee: nobody → ssuh
I just did a quick spot check on the last day's clients_daily run and the column is in place and the data I checked looks good
Status: NEW → RESOLVED
Closed: 6 years ago
Resolution: --- → FIXED
Component: Datasets: General → General
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: