1411243 - 4.32 - 5.56% damp (linux64, windows10-64, windows7-32) regression on push 60ff8f8364959d64e9dfc838693f8ca3c9fc9913 (Mon Oct 23 2017)

Reporter

Description

•

7 years ago

Talos has detected a Firefox performance regression from push:

https://hg.mozilla.org/integration/autoland/pushloghtml?changeset=60ff8f8364959d64e9dfc838693f8ca3c9fc9913

As author of one of the patches included in that push, we need your help to address this regression.

Regressions:

  6%  damp summary windows10-64 pgo e10s     216.44 -> 228.47
  5%  damp summary windows7-32 pgo e10s      211.30 -> 222.83
  5%  damp summary windows7-32 opt e10s      267.79 -> 279.90
  4%  damp summary windows10-64 opt e10s     254.83 -> 266.24
  4%  damp summary linux64 pgo e10s          231.35 -> 241.48
  4%  damp summary linux64 opt e10s          257.31 -> 268.43


You can find links to graphs and comparison views for each of the above tests at: https://treeherder.mozilla.org/perf.html#/alerts?id=10142

On the page above you can see an alert for each affected platform as well as a link to a graph showing the history of scores for this test. There is also a link to a treeherder page showing the Talos jobs in a pushlog format.

To learn more about the regressing test(s), please see: https://wiki.mozilla.org/Buildbot/Talos/Tests

For information on reproducing and debugging the regression, either on try or locally, see: https://wiki.mozilla.org/Buildbot/Talos/Running

*** Please let us know your plans within 3 business days, or the offending patch(es) will be backed out! ***

Our wiki page outlines the common responses and expectations: https://wiki.mozilla.org/Buildbot/Talos/RegressionBugsHandling

Ionuț Goldan [:igoldan]

Reporter

Updated

•

7 years ago

Component: Untriaged → Developer Tools: Inspector

Ionuț Goldan [:igoldan]

Reporter

Comment 1

•

7 years ago

:jdescottes We noticed these regressions? Can we fix them or is this something we should accept?

Flags: needinfo?(jdescottes)

Julian Descottes [:jdescottes]

Assignee

Comment 2

•

7 years ago

Investigating. 

None of the talos pushes I did for Bug 1171482 highlighted a global regression, but I did modify the last patch in afterwards.
I can't really understand how my modifications would affect tests on the performance tool.

If needed, only the last patch needs to be backed out, the other ones should have no impact.

Ionut: Just to make sure, these patches add a new test: inspector.mutations. I assume the results of this test are not counted in the summary, since,the previous revision does not have the test.

Flags: needinfo?(jdescottes) → needinfo?(igoldan)

Julian Descottes [:jdescottes]

Assignee

Comment 3

•

7 years ago

For now I can't reproduce locally. Three talos pushes:

- throttle: https://treeherder.mozilla.org/perf.html#/comparechooser?newProject=try&newRevision=4ef9754858f6463ab2b29ed3d912d74bc64f04d0
- new test: https://treeherder.mozilla.org/perf.html#/comparechooser?newProject=try&newRevision=639956abc5e52cfd3cdc4f35d816588a4975289a
- baseline: https://treeherder.mozilla.org/perf.html#/comparechooser?newProject=try&newRevision=4b1a25fda6395ee936f54b11fbd11150d69831d0

One thing I didn't do in my tests was to compare with/without my new test. Maybe the new test impacts subsequent tests?

Julian Descottes [:jdescottes]

Assignee

Updated

•

7 years ago

Assignee: nobody → jdescottes

Status: NEW → ASSIGNED

Ionuț Goldan [:igoldan]

Reporter

Comment 4

•

7 years ago

(In reply to Julian Descottes [:jdescottes] from comment #2)
> Ionut: Just to make sure, these patches add a new test: inspector.mutations.
> I assume the results of this test are not counted in the summary, since,the
> previous revision does not have the test.
I see 6 subtest regressions ranging between 3 and 11%.
I think you made a good point though, regarding the new test. I'm ni?ing :jmaher for this, as I am not yet sure.

Flags: needinfo?(igoldan)

Julian Descottes [:jdescottes]

Assignee

Comment 5

•

7 years ago

(In reply to Ionuț Goldan [:igoldan], Performance Sheriffing from comment #4)
> (In reply to Julian Descottes [:jdescottes] from comment #2)
> > Ionut: Just to make sure, these patches add a new test: inspector.mutations.
> > I assume the results of this test are not counted in the summary, since,the
> > previous revision does not have the test.
> I see 6 subtest regressions ranging between 3 and 11%.

Yep there's definitely something off, I'm investigating. I just wanted to clarify the potential impact of the new test

> I think you made a good point though, regarding the new test. I'm ni?ing
> :jmaher for this, as I am not yet sure.

adding ni? flag

Flags: needinfo?(jmaher)

Alexandre Poirot [:ochameau]

Comment 6

•

7 years ago

Unfortunately, I think that's a know issue of the median algorithm used on Talos.

This changeset added this new DAMP subtest:
  damp inspector.mutations - 2,229.87 ± 1.74% 

Which is significant new value in the test set.
That would explain the summary regression.


Is this run a significant one? That's the only one I could get from the alert page with subtests being run:
https://treeherder.mozilla.org/perf.html#/comparesubtest?originalProject=autoland&originalRevision=f9b62ceada2a74d3005d983ff4ccffcc048dd3fd&newProject=autoland&newRevision=60ff8f8364959d64e9dfc838693f8ca3c9fc9913&originalSignature=f687ee347bcbdc55828148af029908049de69b7f&newSignature=f687ee347bcbdc55828148af029908049de69b7f&framework=1

As Julian said, the subtests regressions are surprising.
This patch should not affect performance and console much.
It may highlight some mix up into DAMP subtests...
(I hope to reduce this kind of weird mixup in bug 1407826)
... or unexpected consequences of this patch?

Julian Descottes [:jdescottes]

Assignee

Comment 7

•

7 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #6)
> Unfortunately, I think that's a know issue of the median algorithm used on
> Talos.
> 
> This changeset added this new DAMP subtest:
>   damp inspector.mutations - 2,229.87 ± 1.74% 
> 
> Which is significant new value in the test set.
> That would explain the summary regression.
> 
> 
> Is this run a significant one? That's the only one I could get from the
> alert page with subtests being run:
> https://treeherder.mozilla.org/perf.html#/
> comparesubtest?originalProject=autoland&originalRevision=f9b62ceada2a74d3005d
> 983ff4ccffcc048dd3fd&newProject=autoland&newRevision=60ff8f8364959d64e9dfc838
> 693f8ca3c9fc9913&originalSignature=f687ee347bcbdc55828148af029908049de69b7f&n
> ewSignature=f687ee347bcbdc55828148af029908049de69b7f&framework=1
> 

You can get subtests from other platforms if you go to the linked alert: https://treeherder.mozilla.org/perf.html#/alerts?id=10143

(no reruns, but it looks like linux shows the same kind of regressions, for instance on the simple performance open test)

Julian Descottes [:jdescottes]

Assignee

Comment 8

•

7 years ago

Some results are in:

baseline vs new test: https://treeherder.mozilla.org/perf.html#/compare?originalProject=try&originalRevision=4b1a25fda6395ee936f54b11fbd11150d69831d0&newProject=try&newRevision=639956abc5e52cfd3cdc4f35d816588a4975289a&framework=1

The summary shows the alert, which means that the new test is most likely responsible for the initial alert here.

When it comes to subtests, the new test by itself seems to slow down:
- complicated.webconsole.close
- simple.styleeditor.reload

baseline vs throttle: https://treeherder.mozilla.org/perf.html#/compare?originalProject=try&originalRevision=4b1a25fda6395ee936f54b11fbd11150d69831d0&newProject=try&newRevision=d4d5e9b9e9dd814dfeda50fadbd5fd7ca4798615
(I had to push again, my initial throttle push contained additional perf fixes...)

The summary is still red, but the subtests don't really have the same profile as the ones from the alert (in particular simple.performance.open doesn't show up).

> (no reruns, but it looks like linux shows the same kind of regressions, for instance on the simple performance open test)
Oops, I misread the results. I looked at complicated rather than simple here.

Maybe this is a windows-only behavior. Will spawn new jobs.

Joel Maher ( :jmaher ) (UTC -8)

Comment 9

•

7 years ago

any new tests will change the summary, we use a geometric mean and adding tests will change the value posted.  

typically we ignore subtests as they might be different on each OS and harder to debug, if there is work we can do to fix the subtest regressions, that is great.

Flags: needinfo?(jmaher)

Julian Descottes [:jdescottes]

Assignee

Comment 10

•

7 years ago

Windows results are in: simply adding the new test seems to slow down all the "simple" tests, especially the performance.open test.

See baseline vs new test: https://treeherder.mozilla.org/perf.html#/compare?originalProject=try&originalRevision=4b1a25fda6395ee936f54b11fbd11150d69831d0&newProject=try&newRevision=639956abc5e52cfd3cdc4f35d816588a4975289a&framework=1

Joel Maher ( :jmaher ) (UTC -8)

Comment 11

•

7 years ago

that is really odd, but maybe it is related to runtime or other browser cleanup/interactions.

Alexandre Poirot [:ochameau]

Comment 12

•

7 years ago

You may want to try with such change applied in your base (in your base, as it shuffles all DAMP results!):
https://hg.mozilla.org/try/rev/6155265071fcf9931a444c606e506d09c5e93a26
I should fix various issues where tests are polluting the next ones.
(for example by introducing significant GC during next tests, or by having long lasting async call still running up)

Julian Descottes [:jdescottes]

Assignee

Comment 13

•

7 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #12)
> You may want to try with such change applied in your base (in your base, as
> it shuffles all DAMP results!):
> https://hg.mozilla.org/try/rev/6155265071fcf9931a444c606e506d09c5e93a26
> I should fix various issues where tests are polluting the next ones.
> (for example by introducing significant GC during next tests, or by having
> long lasting async call still running up)

Sure!
- baseline: https://treeherder.mozilla.org/perf.html#/comparechooser?newProject=try&newRevision=4a28cd7e3409612885297a59ae30647818a0019c
- new test: https://treeherder.mozilla.org/perf.html#/comparechooser?newProject=try&newRevision=e2adee1cdc8a621330797cba1fec192120b618c1
- baseline vs new test: https://treeherder.mozilla.org/perf.html#/compare?originalProject=try&originalRevision=4a28cd7e3409612885297a59ae30647818a0019c&newProject=try&newRevision=e2adee1cdc8a621330797cba1fec192120b618c1&framework=1

(just pushed so need to wait a bit to get the results)

Julian Descottes [:jdescottes]

Assignee

Comment 14

•

7 years ago

The results look much better. My new test seems to have very little impact on the other tests when rebasing on top of Alex patches. (note that g2 running time nearly doubles with those patches though :/)

I think so far we identified that:
- the summary alert was simply coming from the new test
- the subtests variations should be fixed by the changes Alex is working on

Alex: should we deactivate the new test until your changes land? I think it can be nice to avoid polluting DAMP if possible.

Flags: needinfo?(poirot.alex)

Alexandre Poirot [:ochameau]

Comment 15

•

7 years ago

(In reply to Julian Descottes [:jdescottes] from comment #14)
> The results look much better. My new test seems to have very little impact
> on the other tests when rebasing on top of Alex patches. (note that g2
> running time nearly doubles with those patches though :/)
> 
> I think so far we identified that:
> - the summary alert was simply coming from the new test
> - the subtests variations should be fixed by the changes Alex is working on
> 
> Alex: should we deactivate the new test until your changes land? I think it
> can be nice to avoid polluting DAMP if possible.

Hum. Isn't it already polluted?

The new test will bump the summary value no matter what we do.
For the others, we do not track performance panel much, so we can ignore it.
If that affects console, you may just put a comment in this doc:
  https://docs.google.com/document/d/1hwq0mnxS4Ycpq8pd8l-v3fRu2IQxAVh9ATu5-wYV2i4/edit
  (you can also comment about the summary bump)

Flags: needinfo?(poirot.alex)

Julian Descottes [:jdescottes]

Assignee

Comment 16

•

7 years ago

Ionut: the regressions spotted only related to the new test added to DAMP, so we will accept this change.

Flags: needinfo?(igoldan)

Ionuț Goldan [:igoldan]

Reporter

Comment 17

•

7 years ago

Ok, thanks for sharing this!

Flags: needinfo?(igoldan)

Ionuț Goldan [:igoldan]

Reporter

Updated

•

7 years ago

Status: ASSIGNED → RESOLVED

Closed: 7 years ago

Resolution: --- → WONTFIX

BMO Automation

Updated

•

6 years ago

Product: Firefox → DevTools

Bugzilla

Quick Search

4.32 - 5.56% damp (linux64, windows10-64, windows7-32) regression on push 60ff8f8364959d64e9dfc838693f8ca3c9fc9913 (Mon Oct 23 2017)

Categories

(DevTools :: Inspector, defect)

Tracking

(Not tracked)

People

(Reporter: igoldan, Assigned: jdescottes)

References

Details

(Keywords: perf, regression, talos-regression)

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Comment 2

Comment 3

Updated

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Comment 14

Comment 15

Comment 16

Comment 17

Updated

Updated