Closed Bug 1157734 Opened 9 years ago Closed 9 years ago

Wrong "type" submitted for aborted-session-ping payloads

Categories

(Toolkit :: Telemetry, defect, P3)

defect

Tracking

()

RESOLVED WORKSFORME
Tracking Status
firefox40 --- affected

People

(Reporter: mreid, Unassigned)

References

Details

(Whiteboard: [unifiedTelemetry] [measurement:client])

Attachments

(2 files)

The "type" in the ping appears to include the full path to the ping file.
Example from Mark:

{"type":"C:\\Users\\**snip**.default\\datareporting\\aborted-session-ping","id":"abe7eddd-cb3e-43ca-b74a-488fc11e941c","creationDate":"2015-04-21T09:12:45.764Z","version":4,"application":{"architecture":"x86-64","buildId":"20150420030204","name":"Firefox","version":"40.0a1","vendor":"Mozilla","platformVersion":"40.0a1","xpcomAbi":"x86_64-msvc","channel":"nightly"},"payload":true}
EnvVersion: 1
Severity: 7
Fields: [name:"submissionDate" value_string:"20150422"  name:"appVersion" value_string:"40.0a1"  name:"Host" value_string:"incoming.telemetry.mozilla.org"  name:"appUpdateChannel" value_string:"nightly"  name:"sourceVersion" value_string:"4"  name:"creationTimestamp" value_type:DOUBLE value_double:1.429607565764e+18  name:"docType" value_string:"C:\\Users\\**snip**.default\\datareporting\\aborted-session-ping"  name:"appBuildId" value_string:"20150420030204"  name:"geoCountry" value_string:"SE"  name:"appVendor" value_string:"Mozilla"  name:"documentId" value_string:"abe7eddd-cb3e-43ca-b74a-488fc11e941c"  name:"sourceName" value_string:"telemetry"  name:"appName" value_string:"Firefox" ]
Does this only happen for Windows users?
Flags: needinfo?(mreid)
Does it matter? Sounds like we can fix it regardless.
Indeed.
Flags: needinfo?(mreid)
Mark, does this still happen on latest nightlies?
Flags: needinfo?(mreid)
:Dexter, here is a monitor that's counting doc types for nightly builds on/after May 1st:

https://pipeline-prototype-cep.prod.mozaws.net/#plugins/filters/PrototypeSandbox-mreid_CountRecentByDocType
Flags: needinfo?(mreid)
Looks like this is still happening (one bad type so far since I added the monitoring yesterday).
I suspect the one remaining bad type is from a custom build, which (despite a recent buildid) probably does not include your recent telemetry changes.

Should we be seeing actual submissions with type = "aborted-session-ping"?
Flags: needinfo?(alessio.placitelli)
No, we shouldn't be seeing submissions with an "aborted-session-ping" type. Aborted session pings must have a "main" ping type with an "aborted-session" reason.
Flags: needinfo?(alessio.placitelli)
Mark, would you kindly extend the monitor to consider a 7 days history? Thanks!
Flags: needinfo?(mreid)
Done!
Flags: needinfo?(mreid)
Mark, would you kindly provide us with some sample pings affected by this issue?
Flags: needinfo?(mreid)
Mark, could you give us the source to this monitor? Or filter out the v4 payloads that don't have the |environment.settings.defaultSearchEngine| property?
(In reply to Alessio Placitelli [:Dexter] from comment #15)
> Mark, could you give us the source to this monitor? Or filter out the v4
> payloads that don't have the |environment.settings.defaultSearchEngine|
> property?

The idea being that we want to confirm whether this is really old source builds that are submitting that data.
(In reply to Mark Reid [:mreid] from comment #8)
> I suspect the one remaining bad type is from a custom build, which (despite
> a recent buildid) probably does not include your recent telemetry changes.

Does this monitor filter on the "nightly" channel too?
Per the discussion here: https://mail.mozilla.org/pipermail/fhr-dev/2015-June/000483.html
... that should probably be sufficient.
(In reply to Georg Fritzsche [:gfritzsche] from comment #17)
> (In reply to Mark Reid [:mreid] from comment #8)
> > I suspect the one remaining bad type is from a custom build, which (despite
> > a recent buildid) probably does not include your recent telemetry changes.
> 
> Does this monitor filter on the "nightly" channel too?
> Per the discussion here:
> https://mail.mozilla.org/pipermail/fhr-dev/2015-June/000483.html
> ... that should probably be sufficient.
That monitor is filtering for only "nightly".
Configuration for the heka monitor
Code for the monitor
Some recent examples sent via email.
Flags: needinfo?(mreid)
Checking the monitor we don't see any recent submissions with this issue.

I also checked on Kibana with this filter to see if there was recent submissions:
-clientId:* AND appName:Firefox AND appUpdateChannel:nightly AND appBuildId:201505*
... but we don't seem to have much data (yet?) for this month, so that doesn't seem useful.
We haven't seen any recent submissions coming up, so closing this for now.
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → WORKSFORME
Whiteboard: [unifiedTelemetry]
We're still seeing wacky data in Telemetry fields, like integer clientIDs when they should be GUID strings. This could be a sign of a JS engine bug or GC bug.

I think the first step is to try to reproduce this in a controlled environment. 

Georg, what do you think about "stress-testing" telemetry by changing the thresholds to create pings every 5 mins, leaving it running for a day, then validating the resulting pings? Maybe Sam could help with this?
Flags: needinfo?(gfritzsche)
Flags: needinfo?(gfritzsche)
Priority: -- → P3
Whiteboard: [unifiedTelemetry] → [unifiedTelemetry] [measurement:client]
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: