Closed Bug 1142543 Opened 9 years ago Closed 7 years ago

Many submissions from a few clientIds

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P5)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: mreid, Unassigned)

References

Details

(Whiteboard: [fixed by bug 1143714?] [unifiedTelemetry][measurement:client])

I don't have a lot of information yet, but in the course of preparing a data export for bug 1140094, I noticed that there were two clients who each submitted more than 500MB of data (thousands of submissions).

One client submitted a bit more than 500 times per day from Feb. 27 to Mar. 3rd

Another submitted up to 2877 times per day between Mar 6 and Mar 8.
Just a quick update that I checked the "payload.info.reason" field for these two clients and for both clients, all but three submissions were "environment-change", and the others were "idle-daily".
Could we sample these reports on what the environment changes are?
Changes in addons or pref changes?
Maybe this is some automation setup?
In a lot of cases, there are consecutive pings with exactly the same environment.

One possibly interesting thing is that many of the pings have "subsessionLength":0

I will get you some sample data to look at.
Depends on: 1143714
Whiteboard: [fixed by bug 1143714?]
Mark, the environment changes landed (after some bouncing) on 2015-04-03.
Can you check whether the situation improved since then?
Flags: needinfo?(mreid)
Waiting on bug 1151839 for this. Please ni? after that is fixed.
Flags: needinfo?(mreid)
Depends on: 1151839
(In reply to Mark Reid [:mreid] from comment #5)
> Waiting on bug 1151839 for this. Please ni? after that is fixed.
Flags: needinfo?(mreid)
There have not been any clients submitting thousands of times per day in the past several days.  It still appears that a few clients are maxing out the 5-minute threshold (~280 submissions per day), but the throttling seems to have worked.
Flags: needinfo?(mreid)
We should still follow up with the clients that are hitting the threshold all the time. Can you send me a couple samples?
Flags: needinfo?(mreid)
I've uploaded daily samples for one such client to gdrive (shared "telemetry_data_validation" dir). The file is called client24.tar.gz, and the data format is the same as bug 1149666.

It contains that user's submissions for April 5th, 10th, and 15th.  The throttling fix appears to have kicked in either on Apr. 6 or 7.

Let me know if you'd like more / different data.
Flags: needinfo?(mreid)
Benjamin, did you have a chance to look at the samples?
Flags: needinfo?(benjamin)
Whiteboard: [fixed by bug 1143714?] → [fixed by bug 1143714?] [rC] [unifiedTelemetry]
No, I have not looked at this.
Flags: needinfo?(benjamin)
Blocks: 1122482
No longer blocks: 1120356
Priority: -- → P3
Whiteboard: [fixed by bug 1143714?] [rC] [unifiedTelemetry] → [fixed by bug 1143714?] [rC] [unifiedTelemetry][data-validation]
Points: --- → 2
Whiteboard: [fixed by bug 1143714?] [rC] [unifiedTelemetry][data-validation] → [fixed by bug 1143714?] [rC] [unifiedTelemetry][data-validation][measurement:client]
Just want to put a note in this bug for posterity: the issue of "many submissions from a few clients" may actually be masking many clients sharing a clientId (from copied machine images, etc).
We should be able to differentiate here by session chaining and activity date overlaps?
(In reply to Georg Fritzsche [:gfritzsche] from comment #13)
> We should be able to differentiate here by session chaining and activity
> date overlaps?

yep, that is correct.
Summary: Many submissions from a few clients → Many submissions from a few clientIds
Component: Telemetry → Metrics: Pipeline
Product: Toolkit → Cloud Services
Whiteboard: [fixed by bug 1143714?] [rC] [unifiedTelemetry][data-validation][measurement:client] → [fixed by bug 1143714?] [rC] [unifiedTelemetry][measurement:client]
Version: 39 Branch → other
Whiteboard: [fixed by bug 1143714?] [rC] [unifiedTelemetry][measurement:client] → [fixed by bug 1143714?] [unifiedTelemetry][measurement:client]
Priority: P3 → P4
Priority: P4 → P5
We have code to de-dupe in multiple places now.
Status: NEW → RESOLVED
Closed: 7 years ago
Resolution: --- → FIXED
Product: Cloud Services → Cloud Services Graveyard
You need to log in before you can comment on or make changes to this bug.