Closed
Bug 1485152
Opened 6 years ago
Closed 6 years ago
Add `active_addons` to clients_daily to remove Longitudinal dependency for TAAR
Categories
(Data Platform and Tools :: General, enhancement, P2)
Data Platform and Tools
General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: bugzilla, Assigned: bugzilla)
References
Details
(Whiteboard: [DataPlatform])
Attachments
(1 file)
TAAR currently has three jobs that depend on longitudinal: taar_locale and taar_similarity in python_mozetl, and addon_recommender in telemetry-batch-view. In all of these cases, the job only takes the most recent ping from a client. They all also benefit from fresher data -- that is, while longitudinal is clients from 1 sample_id over six months, if we can get an equivalent number of clients by using more sample_ids over a shorter period the recommendations would probably fare better.
Given this and the high likelihood that longitudinal will be deprecated in the near future, switching the upstream dataset seems wise. clients_daily would work pretty well for this purpose except it's missing a few fields that TAAR needs (most obviously, the full active_addons column).
I'm going to NI Victor so he can let us know what other fields the code relies on.
Updated•6 years ago
|
Points: --- → 2
Priority: -- → P2
Comment 1•6 years ago
|
||
taar_locale could easily run off of clients_daily with the active_addons column added.
taar_similarity also needs just the active_addons column added.
AddonRecommender (for collaborative recommendations) could also be ported to use clients_daily if we had active_addons added as a column.
TADAR (formerly TAAR-lite) could also be ported if we had active_addons added.
I don't think we need any other columns added for TAAR or TAARlite other than active_addons.
Flags: needinfo?(vng)
Summary: Add fields to clients_daily to remove Longitudinal dependency for TAAR → Add `active_addons` to clients_daily to remove Longitudinal dependency for TAAR
Whiteboard: [DataPlatform]
Comment 3•6 years ago
|
||
I just did a quick spot check on the last day's clients_daily run and the column is in place and the data I checked looks good
Status: NEW → RESOLVED
Closed: 6 years ago
Resolution: --- → FIXED
Updated•2 years ago
|
Component: Datasets: General → General
You need to log in
before you can comment on or make changes to this bug.
Description
•