Closed Bug 1045076 Opened 10 years ago Closed 10 years ago

Launch events occasionally incorrectly output timestamp as delta

Categories

(Firefox OS Graveyard :: Gaia::PerformanceTest, defect, P1)

ARM
Gonk (Firefox OS)
defect

Tracking

(b2g-v2.0 fixed, b2g-v2.1 fixed)

RESOLVED FIXED
2.1 S1 (1aug)
Tracking Status
b2g-v2.0 --- fixed
b2g-v2.1 --- fixed

People

(Reporter: Eli, Assigned: Eli)

References

Details

(Keywords: perf, Whiteboard: [c=automation p=2 s= u=])

Attachments

(1 file)

The test durations for some of the launch events are occasionally outputting values that are equivalent to a timestamp instead of using that as a delta between dates. This is visualized in Datazilla [1] as extreme chart spiking, but for a seemingly unrelated issue as was resolved in bug 1037520.

[1] https://datazilla.mozilla.org/b2g/?branch=master&device=flame&range=7&test=startup_%3E_moz-app-visually-complete&app_list=calendar,camera,communications/contacts,communications/dialer,costcontrol,email%20FTU,fm,gallery,settings,sms,video&app=costcontrol&gaia_rev=a5f93c8ba175ae83&gecko_rev=4dc57781185d&plot=avg
Status: NEW → ASSIGNED
Jonathan, do you know why the values for costcontrol at this spike point [1] are not visible in the build revision at [2]? All the values in the runs chart appear normal as do the values output from make test-perf, but for some reason there are generated values for average which appear in the large chart, and I do not see where they come from.

[1] http://mzl.la/1AuqA6K
[2] http://jenkins1.qa.scl3.mozilla.com/job/flame.b2g-inbound.perf.gaia/848/artifact/perf.json
Flags: needinfo?(jeads)
The spike values are in the replicates for costcontrol in the testrun.date = 1406511590 in the two json structures returned by the web service call below:

https://datazilla.mozilla.org/b2g/refdata/objectstore/json_blob/revisions?branch=master&gaia_revision=a5f93c8ba175ae83fac6b7740a8414459fbd7a5b&gecko_revision=4dc57781185d&test_id=22&test_type=startup_%3E_moz-app-visually-complete

There's an additional test run available with an identical revision set that does not contain the spikes. The revisions in the two structures are identical.

gaia_revision a5f93c8ba175ae83fac6b7740a8414459fbd7a5b
gecko_revision 4dc57781185d
build_revision 3aa6abd313f965a84aa86c6b213dc154e4875139

The second structure submitted has a testrun.date = 1406511841, the replicates displayed are for the last testrun but the average displayed on the main chart is across both of the testruns because all of the revisions and test/app reference data are identical. So, the "GROUP BY" in SQL is matching the revisions when the mean is calculated https://github.com/mozilla/datazilla/blob/master/datazilla/model/sql/perftest.json#L643.

Not sure why the replicate chart is not showing both sets of replicates, that would be much less misleading.
Flags: needinfo?(jeads)
Added sanity check for instances where the timestamp may not have registered correctly or we fail to retrieve values.
Attachment #8464131 - Flags: review?(hub)
Comment on attachment 8464131 [details] [review]
Link to Github pull-request: https://github.com/mozilla-b2g/gaia/pull/22272

With the patches from bug 1037520 now in place, we need this safeguard to go along with it to prevent incorrect timestamp values from being injected into the data that is visualized in Datazilla.

[Bug caused by] (feature/regressing bug #): bug 1037520
[User impact] if declined: none
[Testing completed]: yes
[Risk to taking this patch] (and alternatives if risky): no user-facing risk
[String changes made]: n/a
Attachment #8464131 - Flags: approval-gaia-v2.0?
Attachment #8464131 - Flags: review?(hub) → review+
will merge in a bit. Just want to make sure TBPL turns green with the intermitents.
Shall we leave this open so that the actual problem gets fixed?
Flags: needinfo?(eperelman)
Hey Hub, I've had the worst luck trying to replicate this issue locally. I've used a Flame with 512MB, 273MB, and 319MB, doing runs of 1, 5, 10, and 30 and could not get this issue to replicate. The safeguard I have in this patch reflects that I don't know what the root cause was, but can at least prevent against it. If we want to dive into this issue deeper, I can do it when I get back from PTO.
Flags: needinfo?(eperelman)
Attachment #8464131 - Flags: approval-gaia-v2.0? → approval-gaia-v2.0+
See Also: → 1049129
Filed 1049129 to dive into root cause of this issue. Safeguard patch in master gives current resolution.
Status: ASSIGNED → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: