Closed Bug 920508 Opened 12 years ago Closed 7 years ago

Peacekeeper grid benchmark much slower than in Chrome

Tracking

()

Status:

RESOLVED DUPLICATE of bug 1493420

People

(Reporter: jandem, Unassigned)

References

(Depends on 1 open bug, Blocks 1 open bug)

Details

Attachments

(3 files)

Testcase 12 years ago Jan de Mooij [:jandem] 3.17 KB, text/html		Details
Testcase that only measures the bits included in the "time per frame" time 12 years ago Boris Zbarsky [:bzbarsky] 3.27 KB, text/html		Details
Testcase that more closely matches the benchmark 12 years ago Boris Zbarsky [:bzbarsky] 3.29 KB, text/html		Details

Jan de Mooij [:jandem]

Reporter

Description

•

12 years ago

Attached file Testcase — Details

I'm attaching a standalone version of http://www.peacekeeper.therichins.net/test2.html I had to rewrite the test a bit to not require jQuery but that shouldn't influence the results. Chrome 31 / Safari: 6 ms per frame Opera 12.16: 15 ms per frame Nightly: 30 ms per frame Webkit is a lot faster but Presto is/was also 2x faster than us. The test creates a grid of divs, then modifies the background color of each of them: this.pixels[y][x].style.backgroundColor = this.hex(r, g, 0); When I change this line to: this.pixels[y][x].foo = this.hex(r, g, 0); Every frame takes ~2 ms, same as with Chrome. According to Instruments, most of the time is in Layout/Graphics.

Jan de Mooij [:jandem]

Reporter

Comment 1

•

12 years ago

FWIW, Peacekeeper uses the same grid code for multiple tests. The only difference is the grid size. I hope that if we can make this fast, the tests with smaller grids will also improve: http://www.peacekeeper.therichins.net/test0.html http://www.peacekeeper.therichins.net/test1.html

Boris Zbarsky [:bzbarsky]

Comment 2

•

12 years ago

Just to make sure, is the peacekeeper harness measuring the time being shown here? Because afaict the time being shown is just the time to twiddle the styles, not the time it takes to then process the restyles or paint, right?

Flags: needinfo?(jdemooij)

Boris Zbarsky [:bzbarsky]

Updated

•

12 years ago

Depends on: 903372

Boris Zbarsky [:bzbarsky]

Comment 3

•

12 years ago

Attached file Testcase that only measures the bits included in the "time per frame" time — Details

Assuming that the "time per frame" time is what's being benchmarked here... What I see is: * 20% or so in jitcode and various JS engine bits _not_ counting proxies. * 4% under HTMLElementBinding::get_style (called off the fast ion path, yay). * 11% getting from js::Proxy::set to CSS2PropertiesBinding::set_backgroundColor. If we could fix TI to deal and generate a direct call to the latter, that would sure be nice. See bug 822442, basically, though maybe efaust has other bugs on this too. * 10% parsing the new property value. * 10% calling AttributeWillChange; a good chunk of this (6.5%) is HasAttributeDependentStyle. * 4% getting the inline style rule to modify. * 3% getting the base URI... we need to kill off xml:base. :( * 23% the SetAttrAndNotify call. Some of this is the SetAndTakeAttr (largely the free() call on the old style rule; about 3-4% here!) and about 13% is AttributeChanged. This time the HasAttributeDependentStyle is only about 5%; there is also 2% in GetStyleDisplay(), 2% adding things to the restyle tracker, and a bunch of self time in RestyleManager::AttributeChanged. We should really be able to do better for the AttributeWillChange/AttributeChanged bits for style attributes in particular... :(

Boris Zbarsky [:bzbarsky]

Updated

•

12 years ago

Depends on: 822442

Jan de Mooij [:jandem]

Reporter

Comment 4

•

12 years ago

(In reply to Boris Zbarsky [:bz] from comment #2) > Just to make sure, is the peacekeeper harness measuring the time being shown > here? Because afaict the time being shown is just the time to twiddle the > styles, not the time it takes to then process the restyles or paint, right? The harness does the following for every frame, iterations == 15000 for this test: --- var testTime = benchmark.elapsedTime(); if (testTime > benchmark.test.iterations) { benchmark.test.result = benchmark.renderedFrames / (testTime / 1000); // submit return; } benchmark.renderedFrames++; benchmark.test.run(testTime); // setTimeout, 5 ms --- So it's (number of frames) / (number of seconds)... Sorry, I should have added that same calculation to the standalone test...

Flags: needinfo?(jdemooij)

Boris Zbarsky [:bzbarsky]

Comment 5

•

12 years ago

Ah, so it's measuring wall-clock time, not style-twiddling time, I see.

Boris Zbarsky [:bzbarsky]

Comment 6

•

12 years ago

Attached file Testcase that more closely matches the benchmark — Details

This is showing, for me, about 130ms/frame times for us and about 80ms/frame times for Chrome. Note that all the stuff I profiled before is about 12% of our time now...

Boris Zbarsky [:bzbarsky]

Comment 7

•

12 years ago

OK, profiling the new thing we have: * 40% restyle processing. Some of this (~6%?) is TreeMatchContext stuff, 10% is resolving the new style context, 10% is CalcStyleDifference to determine what changed. 5% gcing the ruletree. We should really try do do better with inline style changes in particular; was sure we had bugs on this... * 5% display list construction * 17% building layers * 17% painting background colors. and most of the rest is the stuff from comment 3 but scaled down by a factor of 7.

Jan de Mooij [:jandem]

Reporter

Comment 8

•

12 years ago

(In reply to Boris Zbarsky [:bz] from comment #7) > * 40% restyle processing. Some of this (~6%?) is TreeMatchContext stuff, > 10% is resolving > the new style context, 10% is CalcStyleDifference to determine what > changed. 5% gcing the > ruletree. We should really try do do better with inline style changes in > particular; was > sure we had bugs on this... David, do you know if we have bugs on this maybe? This is one of the Peacekeeper tests where we are much slower than Chrome and I'm trying to set dependencies for all these bugs so that we know what has to happen :)

Flags: needinfo?(dbaron)

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 9

•

11 years ago

I don't know; I'm not sure what bugs Boris was talking about. Are we rerunning selector matching as a result of style attribute changes? I thought we weren't anymore, but maybe not. I'm having trouble telling if that's the 6% plus 10% that bz is talking about, though I suppose I could re-profile.

Flags: needinfo?(dbaron)

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Updated

•

11 years ago

Depends on: 977991

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 10

•

11 years ago

(In reply to David Baron [:dbaron] (needinfo? me) (UTC-8) from comment #9) > Are we rerunning selector matching as a result of style attribute changes? > I thought we weren't anymore, but maybe not. I'm having trouble telling if > that's the 6% plus 10% that bz is talking about, though I suppose I could > re-profile. Er, we still do that, and I should have known that. Anyway, filed bug 977991.

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 11

•

11 years ago

(Still, that covers at most 16% if I'm understanding bz's data correctly, and it wouldn't reduce it to 0, either, though maybe close enough.)

Boris Zbarsky [:bzbarsky]

Comment 12

•

11 years ago

I was mostly talking about not rerunning selector matching when changing inline style, yes.

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 13

•

11 years ago

On 64bit linux we're doing really badly with attachment 809984 [details] Nightly 110ms, Chromium 33ms. 65% of refresh driver tick is flushing styles.

Jet Villegas (inactive)

Comment 14

•

11 years ago

Cam: can you have a quick look at Olli's test case and see if our current restyle perf bugs already cover it? Thanks!

Flags: needinfo?(cam)

Cameron McCormack (:heycam)

Comment 15

•

11 years ago

To my knowledge, bug 977991 (already landed) was the only one that would help with this test. On my machine I'm getting: 18% RefreshDriverTimer::Tick -> nsRefreshDriver::Tick -> ... -> DoProcessRestyles 12% nsRefreshDriverTimer::FinishedWaitingForTransaction -> nsRefreshDriver::Tick -> ... -> DoProcessRestyles and of the call stack (is there a way I can merge these two in Cleopatra?): * 8.4% is under RestyleSelf, of which about half is in CalcStyleDifference and its rule tree walking and style computation, and most of the rest under ResolveStyleWithReplacement * 2.8% GCing the rule tree and destroying style contexts * ~2.5% doing TreeMatchContext stuff now -- which actually isn't necessary since we're not going to do any selector matching as part of the restyle

Flags: needinfo?(cam)

Cameron McCormack (:heycam)

Comment 16

•

11 years ago

(In reply to Cameron McCormack (:heycam) from comment #15) > and of the call stack (is there a way I can merge these two in Cleopatra?): of the first call stack

Cameron McCormack (:heycam)

Updated

•

11 years ago

Depends on: 1109939

Mayank Bansal

Comment 17

•

7 years ago

on a fresh nightly: profile: http://bit.ly/2CMSUdr profile with sequential styling: http://bit.ly/2CMSYtH almost all the time is spent in styling

Emilio Cobos Álvarez (:emilio)

Comment 18

•

7 years ago

Any chance of getting a profile with sequential styling? I suspect this is bug 1493420 basically.

Mayank Bansal

Comment 19

•

7 years ago

(In reply to Emilio Cobos Álvarez (:emilio) from comment #18) > Any chance of getting a profile with sequential styling? I suspect this is > bug 1493420 basically. written in comment 17 profile with sequential styling: http://bit.ly/2CMSYtH

Flags: needinfo?(emilio)

Emilio Cobos Álvarez (:emilio)

Comment 20

•

7 years ago

(In reply to Mayank Bansal from comment #19) > (In reply to Emilio Cobos Álvarez (:emilio) from comment #18) > > Any chance of getting a profile with sequential styling? I suspect this is > > bug 1493420 basically. > > written in comment 17 > profile with sequential styling: http://bit.ly/2CMSYtH Bleh, I'm blind. Yeah, 74% of the time in ensure_child, that's bug 1493420.

Status: NEW → RESOLVED

Closed: 7 years ago

Flags: needinfo?(emilio)

Resolution: --- → DUPLICATE

Emilio Cobos Álvarez (:emilio)

Comment 21

•

6 years ago

We look roughly as fast or faster than chromium on the latest nightly, can you confirm it matches what you see too?

I see other stuff we could optimize as well...

Flags: needinfo?(jdemooij)

Emilio Cobos Álvarez (:emilio)

Comment 22

•

6 years ago

If you still see measurable differences, then please reopen and ni? me and I can take a look at those other optimizations.

Jan de Mooij [:jandem]

Reporter

Comment 23

•

6 years ago

(In reply to Emilio Cobos Álvarez (:emilio) from comment #22)

If you still see measurable differences, then please reopen and ni? me and I can take a look at those other optimizations.

What I'm seeing here on OS X:

Latest Nightly: 8-10 ms
Chrome Canary: 5-7 ms
Safari 12.1: 2-3 ms

We've come a long way and I don't know if this is worth spending more time on. However the Safari numbers are curious...

Flags: needinfo?(jdemooij) → needinfo?(emilio)

Jan de Mooij [:jandem]

Reporter

Comment 24

•

6 years ago

Oh my bad. I was using the original test case. On https://bug920508.bmoattachments.org/attachment.cgi?id=809984 we indeed match Chrome now if I also enable WebRender, about 22-24 ms.

Flags: needinfo?(emilio)

You need to log in before you can comment on or make changes to this bug.