1823730 - Implement new methodology for profiling benchmark tests in CI

Reporter

Description

•

2 years ago

The current profiling methodology in CI expects a page load and is not well suited for benchmark tests. This often causes useless profiles for benchmark tests such as speedometer. Instead, I think we need a new methodology in which the test itself needs to start and stop the profiler during the benchmark measurement.

The general idea would be something like this:

Implement a start/stop profiler command in browsertime.
Pass a boolean to the benchmark test script to indicate if profiling or not. It can be passed on the command line with --browsertime.profiling true (see https://www.sitespeed.io/documentation/sitespeed.io/scripting/#pass-your-own-options-to-your-script)
Modify the test script to start and stop the profiler manually. In speedometer, for example, we should probably switch to the interactive runner and profile each subtest. This would create a lot of profiles, but ensure we don't run out of profiler buffer space.

Kash Shampur [:kshampur] ⌚EST

Assignee

Updated

•

2 years ago

Severity: -- → S3

Priority: -- → P2

Whiteboard: [fxp]

Jira Integration Bot

Updated

•

2 years ago

See Also: → https://mozilla-hub.atlassian.net/browse/FXP-2668

Kash Shampur [:kshampur] ⌚EST

Assignee

Updated

•

2 years ago

Component: Performance → Raptor

Denis Palmeiro [:denispal]

Reporter

Updated

•

2 years ago

Whiteboard: [fxp] → [fxp][sp3]

Kash Shampur [:kshampur] ⌚EST

Assignee

Comment 1

•

2 years ago

Something like this sort of seems to be working locally https://github.com/92kns/browsertime/commit/0c50a6c5a5bb79b9581a78e7e92638ac94f934e8
e.g. was able to profile an entire benchmark run depending where I placed my start/stop in our benchmark scripts.

example try https://treeherder.mozilla.org/jobs?repo=try&revision=58ddfb1cdcaa2687ba7d07a728f7cbcce113d7eb&selectedTaskRun=RD6A5cMVTOCBGJ_i7_AH9w.0

here is a permalink of the sp3 linux profile in the browsertime-results.tgz artifact. https://share.firefox.dev/3nDGJc1

The profiler-*.zip isnt in the artifacts itself so I need to double check why that isnt uploading

Kash Shampur [:kshampur] ⌚EST

Assignee

Comment 2

•

2 years ago

Ok here is a new Try run, profiler links show up in the artifacts
https://treeherder.mozilla.org/jobs?repo=try&revision=ebe9ebce4b10da3408f66fca8fc13e49460ff8d7

Kash Shampur [:kshampur] ⌚EST

Assignee

Comment 3

•

2 years ago

hi Denis, is this close to what you envisioned?

for something like this in browsertime you could put browserProfiler.start() here and stop() after here (or wherever, and similarly for S3)

the flag that we could pass would be something like this geckoProfilerCustom into our browsertime options variable here

Flags: needinfo?(dpalmeiro)

Denis Palmeiro [:denispal]

Reporter

Comment 4

•

2 years ago

(In reply to Kash Shampur [:kshampur] ⌚EST from comment #3)

hi Denis, is this close to what you envisioned?

for something like this in browsertime you could put browserProfiler.start() here and stop() after here (or wherever, and similarly for S3)

the flag that we could pass would be something like this geckoProfilerCustom into our browsertime options variable here

This looks great, thanks Kash! A couple of minor notes:

I don't think you need firefoxConfig.geckoProfilerCustom. For the profiling flag, instead of adding a new option, you can just use --browsertime.profiling or something like you do for --browsertime.url and consume it here. I would assume it's undefined if not passed in.
I can also see from your posted profile above that we run out of buffer space in the profile. The default buffer size seems to be 100MB. You may try increasing the buffer size to 500MB or higher to see if we can capture the entire benchmark run.
Maybe throw an error if the user specifies --firefox.geckoProfiler and then tries to turn it on manually in the script while it's running.
Maybe consider renaming browserProfiler in the browsertime patch to geckoProfiler unless you're thinking of also adding in chrome support which would be cool. In that case, maybe something like commands.profiler.start()/commands.profiler.stop() makes more sense?
You'll also probably have to take care of the index, url parameters under the hood as that doesn't make much sense to me as a user calling profiler start/stop.

Flags: needinfo?(dpalmeiro)

Kash Shampur [:kshampur] ⌚EST

Assignee

Comment 5

•

2 years ago

Thanks for the feedback!

actually I made a mistake in my previous comment, I do pass something similar to --browsertime.profiling (well actually I call it exposed_gp as a placeholder, but not the important point) boolean just as you described, and geckoProfilerCustom is set here instead of geckoProfiler, when the test is a benchmark.

If I understood your point about the index, url , I did have an Initial prototype where I automatically get the index/url within the browsertime geckoprofiler stop function so that way the user calls stop() rather than stop(index, url) - I will go back to that implementation

Maybe consider renaming browserProfiler in the browsertime patch to geckoProfiler unless you're thinking of also adding in chrome support which would be cool. In that case, maybe something like commands.profiler.start()/commands.profiler.stop() makes more sense

Yeah, the intention was to keep it generic as Peter sort of alluded to having this for Chrome trace. And I imagine it will be useful when we get around to running Trace in our CI e.g. https://mozilla-hub.atlassian.net/browse/FXP-2438 (but this would be a future browsertime patch)
anyway browserProfiler -> profiler is a reasonable change, I can add that

Maybe throw an error if the user specifies --firefox.geckoProfiler and then tries to turn it on manually in the script while it's running.

Yea I was unsure as of yet if this should be handled on our script side or browsertime side

Anyway I will apply some/all of these changes and report back (buffer, paramaters, naming convention, etc)

Kash Shampur [:kshampur] ⌚EST

Assignee

Comment 6

•

2 years ago

Added some of the changes on the github side here

and here is a corresponding Try
This uses commands.profiler.start()/stop() (without having to pass index/url anymore), and the buffer we can set here ourselves. In this try I set it to 5 times the default from your link so it should be about 500mb

Maybe throw an error if the user specifies --firefox.geckoProfiler and then tries to turn it on manually in the script while it's running.

as mentioned in my previous comment unsure if we should handle on our side or in btime. Going to reach out to Peter for his thoughts on both this and the patch so far

Main thing I wanted to ask: Denis, does the profile look as you expect with the now increased buffer size?

Flags: needinfo?(dpalmeiro)

Denis Palmeiro [:denispal]

Reporter

Comment 7

•

2 years ago

Yes, that profile does look a lot better!

Flags: needinfo?(dpalmeiro)

Kash Shampur [:kshampur] ⌚EST

Assignee

Comment 8

•

2 years ago

•

Edited

https://github.com/sitespeedio/browsertime/pull/1934 has been merged so we can begin working on the m-c side of this (though technically it is already been worked on for the Trys above)

Assignee: nobody → kshampur

Status: NEW → ASSIGNED

Kash Shampur [:kshampur] ⌚EST

Assignee

Updated

•

2 years ago

Priority: P2 → P1

Kash Shampur [:kshampur] ⌚EST

Assignee

Comment 9

•

2 years ago

Attached file Bug 1823730 - Improve profling for raptor-browsertime benchmark tests. r?#perftest — Details

Previously, the logic for profling raptor tests was intended for browsertime pageload tests. The profiles for benchmark tests were not that useful.
This patch uses a new command in browsertime which makes use of the exposed geckoprofiler start/stop commands to manually choose when to start and stop browsertime through our own custom scripts.

Phabricator Automation

Updated

•

2 years ago

Attachment #9329109 - Attachment description: WIP: Bug 1823730 - Improve profling for raptor-browsertime benchmark tests. r?#perftest → Bug 1823730 - Improve profling for raptor-browsertime benchmark tests. r?#perftest

Kash Shampur [:kshampur] ⌚EST

Assignee

Updated

•

2 years ago

Blocks: 1829809

Kash Shampur [:kshampur] ⌚EST

Assignee

Updated

•

2 years ago

Blocks: 1829810

Phabricator Automation

Updated

•

1 year ago

Attachment #9329109 - Attachment description: Bug 1823730 - Improve profling for raptor-browsertime benchmark tests. r?#perftest → WIP: Bug 1823730 - Improve profling for raptor-browsertime benchmark tests. r?#perftest

Phabricator Automation

Updated

•

1 year ago

Attachment #9329109 - Attachment description: WIP: Bug 1823730 - Improve profling for raptor-browsertime benchmark tests. r?#perftest → Bug 1823730 - Improve profling for raptor-browsertime benchmark tests. r?#perftest

Kash Shampur [:kshampur] ⌚EST

Assignee

Updated

•

1 year ago

Blocks: 1831361

Updated

•

1 year ago

Comment 10

•

1 year ago

Pushed by kshampur@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/7bdbd5815ea7 Improve profling for raptor-browsertime benchmark tests. r=perftest-reviewers,sparky

Narcis Beleuzu [:NarcisB]

Comment 11

•

1 year ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/7bdbd5815ea7

Status: ASSIGNED → RESOLVED

Closed: 1 year ago

status-firefox114: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → 114 Branch

Alex Finder

Updated

•

1 year ago

Regressions: 1832246

Kash Shampur [:kshampur] ⌚EST

Assignee

Updated

•

1 year ago