Check for repeated client counts in new_records in Fennec dashboard data

RESOLVED FIXED

Status

Cloud Services
Metrics: Pipeline
P1
normal
RESOLVED FIXED
a year ago
a year ago

People

(Reporter: gfritzsche, Assigned: gfritzsche)

Tracking

Firefox Tracking Flags

(Not tracked)

Details

(Whiteboard: [measurement:client])

(Assignee)

Description

a year ago
One hypothesis for d0/.../d30 being off in bug 1284932 is, that we count clients in these more than once.

We need to confirm this with an analysis.
(Assignee)

Comment 1

a year ago
This re:dash query keeps getting killed:
https://sql.telemetry.mozilla.org/queries/767
Blake says this is pulling Gigabytes of data and gets killed by the OOM watcher.

I'll have to move this to some kind of Spark job.
(Assignee)

Updated

a year ago
Assignee: nobody → gfritzsche
(Assignee)

Comment 2

a year ago
Ok, i ran an analysis on this and thats definitely not the problem:
https://gist.github.com/georgf/0ffe4f915861be180909037a7204d7b9

There are only 4 clients for which "profiledate == submissiondate" was true for more than one day.
Furthermore, there are only 6509 clients who ever submitted more than one profiledate value.

This was my first time using sqlContext though - Mauro, do you think you could give this notebook a quick review?
Flags: needinfo?(mdoglio)
(Assignee)

Comment 3

a year ago
Per IRC, Mauro found the analysis ok.
Status: NEW → RESOLVED
Last Resolved: a year ago
Flags: needinfo?(mdoglio)
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.