1088154 - 35% OSX 10.8|10.6 TP5 Scroll regression on Inbound (v.36) on Oct 21/22 from revision fa9c6845338e

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Description

•

11 years ago

we have a regression in tp5 scroll on osx 10.8: http://graphs.mozilla.org/graph.html#tests=%5B%5B323,63,24%5D%5D&sel=1413478929189,1414083729189&displayrange=7&datatype=running here is a view of the pushlog and tests: https://tbpl.mozilla.org/?tree=Mozilla-Inbound&fromchange=0660519a81b7&tochange=19d906680583&jobname=Rev5%20MacOSX%20Mountain%20Lion%2010.8%20mozilla-inbound%20talos%20g1 we have all the builds/tests for the revisions around this specific revision: http://hg.mozilla.org/integration/mozilla-inbound/pushloghtml?fromchange=223e2f4b0d47&tochange=fa9c6845338e

Matt Woodrow (:mattwoodrow)

Comment 1

•

11 years ago

We also got a decent tscroll win from this changeset, right? Is there any easy way to get a breakdown of which individual test pages regressed?

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Comment 2

•

11 years ago

I see a 5.5% tscroll win here: http://alertmanager.allizom.org:8080/alerts.html?rev=fa9c6845338e&showAll=1&table=1 it could be that other improvements have not shown up yet, or are miscategorized to another revision (based on coalescing, etc.)

Matt Woodrow (:mattwoodrow)

Comment 3

•

11 years ago

There's a m.d.tree-management email showing a 15.9% tscrollx win for 10.8 for this changeset.

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Comment 4

•

11 years ago

got it, that showed up for m-c (I was looking at inbound), here is a list of all the alerts (almost all are generated when we merge between branches and pgo/non-pgo): http://alertmanager.allizom.org:8080/alerts.html?rev=fa9c6845338e&showAll=1 thanks for being persistent there, now let me figure out what tp5 scroll pages are causing problems.

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Comment 5

•

11 years ago

it appears that about half of the pages had a noticeable regression: https://datazilla.mozilla.org/?start=1413439230&stop=1414096959&product=Firefox&repository=Mozilla-Inbound&os=mac&os_version=OS%20X%2010.8&test=tp5o_scroll&graph_search=fa9c6845338e&tr_id=7456526&graph=naver.com&x86=true&x86_64=true&project=talos (be patient this did load for me) the ones that regressed it was a large enough break in the graph and it has sustained.

Matt Woodrow (:mattwoodrow)

Comment 6

•

11 years ago

Sweet, thanks Joel! A few of those are real clear shifts, should be easy enough to test with.

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Comment 7

•

11 years ago

do ask if you need any help running locally

Matt Woodrow (:mattwoodrow)

Comment 8

•

11 years ago

I can't reproduce any change locally. I'm getting a 'save file' dialog for every page load though, not sure if that's affecting things. I'm running via ./mach talos-test tp5o.

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Comment 9

•

11 years ago

tp5 scroll will need to be tested via ./mach. also mach doesn't set up the pagesets. Can you follow the directions here: https://wiki.mozilla.org/Buildbot/Talos/Running#Running_locally_-_Source_Code you can find tp5n.zip here: http://people.mozilla.org/~jmaher/taloszips/zips/. Download that and put it in your talos/talos/page_load_test directory, then unzip it. Finally you can run talos! if you are in your virtualenv, do: ./talos -e <path/to/firefox> -a tp5o_scroll --develop --results_url tp5.out --datazilla-url tp5.json

Matt Woodrow (:mattwoodrow)

Comment 10

•

11 years ago

Yeah, got it running fine, but I'm not seeing any regression locally.

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Comment 11

•

11 years ago

I suspect this boils down to machine specifics. Here is a summary of the machines we use in automation: https://wiki.mozilla.org/Buildbot/Talos/Misc#Hardware_Profile_of_machines_used_in_automation I have double checked and the changeset is definitely a culprit, although if this doesn't reproduce locally, that reduces the severity of it.

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Comment 12

•

10 years ago

this has landed on Aurora with a 45.4% regression, the original 35% and another ~10% around november 17th. OSX 10.6 shows the same two regressions, just much milder (17% overall)

Summary: 35% OSX 10.8 TP5 Scroll regression on Inbound (v.36) on Oct 21/22 from revision fa9c6845338e → 35% OSX 10.8|10.6 TP5 Scroll regression on Inbound (v.36) on Oct 21/22 from revision fa9c6845338e

Benoit Girard (:BenWa)

Comment 13

•

10 years ago

I wonder if the culling can break a layer from one draw call to multiple draw calls: XXX XXX XXX -> X.X XXX XXX where . is a culled rect. We'd now need to do 4 draw calls instead of one. This could explain a regression.

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Comment 14

•

10 years ago

:BenWa, could we fix this in Beta or Aurora? Not sure if there is a good reason for doing 3 extra draw calls.

Flags: needinfo?(bgirard)

Benoit Girard (:BenWa)

Comment 15

•

10 years ago

So I just disabled matt' culling on inbound and it's reporting regressions on bug TP5+tscroll on 10.8. So the culling probably makes things better overall and I'll probably re-enable it shortly. What I point out in Comment 13 is just a theoretical problem. I need to find time to look at it closer but I don't have any bandwidth left. This may have to wait until I can run some tests (so we don't land a patch blind without understanding it).

Flags: needinfo?(bgirard)

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Reporter

Comment 16

•

10 years ago

ok, we can schedule this in due time. Thanks for looking into this so far!

Benoit Girard (:BenWa)

Comment 17

•

10 years ago

bug 1136766 will bring additional performance improvement to culling. It should make it such that it performs no more draw calls. We're hoping this will address this regression.

No longer blocks: 1085223

Depends on: 1136766, 1085223

Milan Sreckovic [:milan] (needinfo for best results)

Updated

•

9 years ago

Version: unspecified → 36 Branch

Nicolas Silva [:nical]

Updated

•

9 years ago

Whiteboard: [talos_regression] → [talos_regression][gfx-noted]

Milan Sreckovic [:milan] (needinfo for best results)

Updated

•

7 years ago

Priority: -- → P3

BMO Automation

Updated

•

3 years ago

Severity: normal → S3

Gregory Pappas [:gregp]

Updated

•

2 years ago

Status: NEW → RESOLVED

Closed: 2 years ago

Resolution: --- → INCOMPLETE

Bugzilla

35% OSX 10.8|10.6 TP5 Scroll regression on Inbound (v.36) on Oct 21/22 from revision fa9c6845338e

Categories

(Core :: Graphics: Layers, defect, P3)

Tracking

()

People

(Reporter: jmaher, Unassigned)

References

Details

(Keywords: perf, regression, Whiteboard: [talos_regression][gfx-noted])

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Comment 14

Comment 15

Comment 16

Comment 17

Updated

Updated

Updated

Updated

Updated