Closed Bug 1373723 Opened 8 years ago Closed 1 year ago

Speedometer async benchmarks performance is very flaky

Tracking

()

Status:

RESOLVED WONTFIX

Project Flags:

Performance Impact

medium

People

(Reporter: bbouvier, Unassigned)

References

(Blocks 1 open bug)

Details

(Keywords: perf:responsiveness)

Attachments

(1 file)

Mark the async test times as a UserTiming region in the profiler UI 8 years ago (no longer active) 1.08 KB, patch		Details \| Diff \| Splinter Review

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Description

•

8 years ago

When looking at any of the Speedometer2 tests that contain "async" in their name: https://arewefastyet.com/#machine=35&view=single&suite=speedometer-misc (win64) https://arewefastyet.com/#machine=17&view=single&suite=speedometer-misc (win32) https://arewefastyet.com/#machine=36&view=single&suite=speedometer-misc It appears that the performance on Firefox is very flaky and has a lot of variance, while the performance on Chromium is very stable. This suggests a bug on our side. I haven't profiler these tests, nor investigated: first ideas which come to mind are GC or the event loop handling.

Jan de Mooij [:jandem]

Comment 1

•

8 years ago

This is basically bug 1366803. A number of bugs were fixed to run stuff during idle time, but there's more to be done here.

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 2

•

8 years ago

FWIW, we don't have enough idle time to run GC/CC during idle period when executing Speedometer. I'm planning to try to increase NS_INTERSLICE_GC_DELAY and kICCIntersliceDelay to see if that would give a bit better chance to use idle time, but I doubt that will help much.

Jan de Mooij [:jandem]

Comment 3

•

8 years ago

I hope nursery allocated strings and the GC scheduler will help.. Do you know what's triggering the GCs on Speedometer?

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 4

•

8 years ago

mostly GC alloc, and also some CC_WAITING

(no longer active)

Comment 5

•

8 years ago

Should we dupe it against bug 1366803?

(no longer active)

Comment 6

•

8 years ago

Attached patch Mark the async test times as a UserTiming region in the profiler UI — Details — Splinter Review

I have been using this patch to mark the async test periods as a UserTiming area in the profiler UI. It is helpful for profiling this period exclusively.

Andrew Overholt [:overholt]

Updated

•

8 years ago

Priority: -- → P2

(no longer active)

Comment 7

•

8 years ago

Markus noticed that the backbone tests seem to run into issues where a timer that the test schedules may end up running at within the async period: http://bit.ly/2tOFUwU The AWFY graphs seem to suggest that this test is a source of noise in the benchmark for us. This might explain why.

Ben Kelly [:bkelly, not reviewing]

Comment 8

•

8 years ago

(In reply to :Ehsan Akhgari (needinfo please, extremely long backlog) from comment #7) > Markus noticed that the backbone tests seem to run into issues where a timer > that the test schedules may end up running at within the async period: > http://bit.ly/2tOFUwU > > The AWFY graphs seem to suggest that this test is a source of noise in the > benchmark for us. This might explain why. Note, chrome current clamps all setTimeout() calls to a minimum of 1ms. So a setTimeout(f, 0) in chrome will run later than for us. I'm not sure we want to do that, but perhaps it helps them here.

Vicky Chin [:vchin]

Comment 9

•

7 years ago

Tagging this with [qf] as async tests on speedometer still have a lot of variance: https://arewefastyet.com/#machine=35&view=breakdown&suite=speedometer-misc

Whiteboard: [qf]

Mike Conley (:mconley) (:⚙️)

Updated

•

7 years ago

Whiteboard: [qf] → [qf:p1:f67]

Andrew Creskey [:acreskey]

Comment 10

•

6 years ago

I ran these tests locally through the raptor framework, and yes, the async tests show a relative standard deviation that is many times that of the sync tests. As a quick experiment, I implemented a 1ms clamp on the timeout intervals (https://phabricator.services.mozilla.com/D10946). I believe this is correctly implemented. However when running with this change I did not see a significant reduction in variance on the async tests.

Mike Conley (:mconley) (:⚙️)

Updated

•

6 years ago

Whiteboard: [qf:p1:f67] → [qf:p2:responsiveness]

Nobody; OK to take it and work on it

Assignee

Updated

•

6 years ago

Component: DOM → DOM: Core & HTML

Olli Pettay [:smaug][bugs@pettay.fi]

Updated

•

6 years ago

Blocks: 1552348

Dave Hunt [:davehunt] [he/him] ⌚BST

Updated

•

3 years ago

Performance Impact: --- → P2

Keywords: perf:responsiveness

Whiteboard: [qf:p2:responsiveness]

BMO Automation

Updated

•

3 years ago

Severity: normal → S3

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 11

•

1 year ago

sp2 runner is inherently just buggy. It may contain rAF/painting or may not. sp3 has better runner.

Status: NEW → RESOLVED

Closed: 1 year ago

Resolution: --- → WONTFIX

You need to log in before you can comment on or make changes to this bug.