Closed
Bug 1272124
Opened 9 years ago
Closed 9 years ago
2.25 - 7.73% cart / tp5o % Processor Time / tp5o_scroll e10s (windows7-32, windows8-64) regression on push b1c6143527c0 (Tue May 10 2016)
Categories
(Firefox :: Untriaged, defect, P2)
Tracking
()
RESOLVED
WONTFIX
Tracking | Status | |
---|---|---|
e10s | + | --- |
People
(Reporter: jmaher, Unassigned)
References
Details
(Keywords: perf, regression, Whiteboard: [talos_regression])
Talos has detected a Firefox performance regression from push b1c6143527c0:
https://hg.mozilla.org/integration/mozilla-inbound/pushloghtml?fromchange=c31a5ad159946c74a32c9ebedcaa8585593ff178&tochange=b1c6143527c0edd015cf9d890a2cb0d61c9aa053
As author of one of the patches included in that push, we need your help to address this regression.
This is a list of all known regressions and improvements related to the push:
https://treeherder.mozilla.org/perf.html#/alerts?id=1142
On the page above you can see an alert for each affected platform as well as a link to a graph showing the history of scores for this test. There is also a link to a treeherder page showing the Talos jobs in a pushlog format.
To learn more about the regressing test(s), please see:
https://wiki.mozilla.org/Buildbot/Talos/Tests#tp5
https://wiki.mozilla.org/Buildbot/Talos/Tests#tp5o_scroll
https://wiki.mozilla.org/Buildbot/Talos/Tests#TART.2FCART
Reproducing and debugging the regression:
If you would like to re-run this Talos test on a potential fix, use try with the following syntax:
try: -b o -p win32,win64 -u none -t tp5o[Windows 7,Windows 8],g1-e10s[Windows 7,Windows 8],svgr-e10s[Windows 7,Windows 8] --rebuild 5 # add "mozharness: --spsProfile" to generate profile data
(we suggest --rebuild 5 to be more confident in the results)
To run the test locally and do a more in-depth investigation, first set up a local Talos environment:
https://wiki.mozilla.lorg/Buildbot/Talos/Running#Running_locally_-_Source_Code
Then run the following command from the directory where you set up Talos:
talos --develop -e [path]/firefox -a tp5o:tp5o_scroll:cart --e10s
Making a decision:
As the patch author we need your feedback to help us handle this regression.
*** Please let us know your plans by Monday, or the offending patch(es) will be backed out! ***
Our wiki page outlines the common responses and expectations:
https://wiki.mozilla.org/Buildbot/Talos/RegressionBugsHandling
Reporter | ||
Comment 1•9 years ago
|
||
I did a lot of retriggers to find the root cause here- and a compare view to show the differences:
https://treeherder.mozilla.org/perf.html#/compare?originalProject=mozilla-inbound&originalRevision=c31a5ad15994&newProject=mozilla-inbound&newRevision=b1c6143527c0&framework=1&filter=tp5
:mattwoodrow, can you look at this and help determine if this is expected or what we might be able to do in order to reduce this regresssion?
Flags: needinfo?(matt.woodrow)
Comment 2•9 years ago
|
||
It's almost certainly this changeset: https://hg.mozilla.org/integration/mozilla-inbound/rev/666c10f1dd52
This makes us block the compositor until the previous frames work has been completed by the GPU.
This reduces the number of frames queued in the GPU pipeline, so should reduce latency to the screen and more accurately reflect what we actually want to measure.
I think given that these regressions are acceptable.
Flags: needinfo?(matt.woodrow)
Reporter | ||
Comment 3•9 years ago
|
||
Thanks Matt! I am fine marking this as resolved/wontfix. Sometimes :avih has a few opinions- lets see if he has anything to add.
Flags: needinfo?(avihpit)
Comment 4•9 years ago
|
||
Trying to figure out the exact regressions we're supposedly/reportedly having, because there's only that much info I can take from the title.
Looking at the alerts view at comment 0, and ignoring the striked-through lines (should I ignore them? if yes, it would be nice if there was an option to hide them), I can see the following regressions (and no improvements):
3% tp5o process time win7-32 opt
20% tp5o responsiveness win7-32 opt
10% tp5o summary win7-32 opt (and e10s too)
7% tp5o_scroll win8-64 (opt and pgo)
And I'm not seeing CART regressions there.
Looking at the compare view from comment 1 (with reasonable amount of retriggers), I see only two lines marked as meaningful:
2% tp5o processor time win7-32 opt e10s
7% tp5o_scroll win8-64 opt e10s
So what regressions are we reporting here exactly?
Flags: needinfo?(avihpit)
Comment 5•9 years ago
|
||
Also, the title currently is:
> 2.25 - 7.73% cart / tp5o % Processor Time / tp5o_scroll e10s (windows7-32, windows8-64)
But CART does not appear anywhere, and processor time appears the least of the regressions however you look at it and it appears right after CART, while responsiveness, which is supposedly the highest regression according to the alert doesn't even appear at the title...
tracking-e10s:
--- → ?
Updated•9 years ago
|
Priority: -- → P2
Reporter | ||
Comment 6•9 years ago
|
||
the regressions are listed nicely in the compare view, cart showed up because retriggers showed this having a difference:
https://treeherder.mozilla.org/perf.html#/graphs?series=%5Bmozilla-inbound,bf7cba00e2c43267bbfdca11eb8f1e7e50fb9f45,1,1%5D&zoom=1462902959640.0862,1462934012459.0518,36.86431226765799,39.07620817843866
It looks like tp5o % processor time (win7) and tp5o_scroll (win8) are the main two which are related to this change.
Comment 7•9 years ago
|
||
(In reply to Joel Maher (:jmaher) from comment #6)
> It looks like tp5o % processor time (win7) and tp5o_scroll (win8) are the
> main two which are related to this change.
Assuming the numbers I posted at comment 4 are correct, the processor time regression is 2% which is relatively small.
Which leaves us with 7% tp5o_scroll on windows 8-64 e10s opt (and possibly pgo - no numbers there) only, as far as I can tell. Do correct me if I'm wrong.
This, together with the fact that the change fixes racy behavior with some videos, makes me think we should accept it, unless Matt think it can still be improved.
Matt, please WONTFIX if you don't have plans to improve it. Thanks.
Flags: needinfo?(matt.woodrow)
Comment 8•9 years ago
|
||
> This, together with the fact that the change fixes racy behavior with some videos,
And also, after discussion with Matt, it seems the change mostly limits the composition rate, which while can affect the scroll test and is not meaningless, is less of a bottleneck in most real world scenarios.
Updated•9 years ago
|
Status: NEW → RESOLVED
Closed: 9 years ago
Flags: needinfo?(matt.woodrow)
Resolution: --- → WONTFIX
Reporter | ||
Comment 9•8 years ago
|
||
these changes are uplifted to beta:
https://treeherder.mozilla.org/perf.html#/alerts?id=1371
and aurora:
https://treeherder.mozilla.org/perf.html#/alerts?id=1357
These are processor time and tart regressions, I suspect these fall under the same wontfix classification as we did for trunk.
Reporter | ||
Updated•8 years ago
|
You need to log in
before you can comment on or make changes to this bug.
Description
•