Closed Bug 1803500 Opened 2 years ago Closed 2 years ago

30% regression on AWFY-Speedometer-Inferno around Nov-30

Categories

(Core :: Performance, defect)

defect

Tracking

()

RESOLVED FIXED
109 Branch
Performance Impact low
Tracking Status
firefox-esr102 --- unaffected
firefox107 --- unaffected
firefox108 --- unaffected
firefox109 + fixed

People

(Reporter: mayankleoboy1, Assigned: nika)

References

(Regression)

Details

(4 keywords)

Keywords: regression

[Tracking Requested - why for this release]:

Component: General → Performance
Product: Firefox → Core

Interestingly, I see an earlier spike on autoland to around the same level, and in both pushlog ranges I see bug 1790872 in there. Given that there's already another known perf regression from that bug (bug 1803798), it seems possible?

Flags: needinfo?(nika)

(In reply to Ryan VanderMeulen [:RyanVM] from comment #2)

Interestingly, I see an earlier spike on autoland to around the same level, and in both pushlog ranges I see bug 1790872 in there. Given that there's already another known perf regression from that bug (bug 1803798), it seems possible?

Given that I previously noticed a regression in Inferno-TodoMVC when there was another minor performance regression in IPC message sending performance due to changes to locking scheduling, unfortunately I can definitely believe that adding glean telemetry accumulation to IPC message sending could've caused this regression. I'll try to think of a way that we can reduce this overhead and keep the telemetry around, but there's definitely a chance we'll need to back it out.

On the bright side, we might not need to worry about 109, as the telemetry should be disabled outside of Nightly, meaning that beta's performance should be unimpacted.

Flags: needinfo?(nika)
Regressed by: 1790872
QA Whiteboard: [qa-regression-triage]

The bug is marked as tracked for firefox109 (nightly). We have limited time to fix this, the soft freeze is in 3 days. However, the bug still isn't assigned.

:plawless, could you please find an assignee for this tracked bug? Given that it is a regression and we know the cause, we could also simply backout the regressor. If you disagree with the tracking decision, please talk with the release managers.

For more information, please visit auto_nag documentation.

Flags: needinfo?(plawless)

(In reply to Release mgmt bot [:suhaib / :marco/ :calixte] from comment #5)

The bug is marked as tracked for firefox109 (nightly). We have limited time to fix this, the soft freeze is in 3 days. However, the bug still isn't assigned.

:plawless, could you please find an assignee for this tracked bug? Given that it is a regression and we know the cause, we could also simply backout the regressor. If you disagree with the tracking decision, please talk with the release managers.

As I mention in comment 3, this shouldn't impact beta/release as the new telemetry is nightly-only, however if we still need to back it out, we can. It'd be nice to get a bit of telemetry first from the nightly population before we back it out though.

I tried looking more at the telemetry profiles, and couldn't find anything obvious causing the slowdown, though likely the only possible cause is increased IPC message sending overhead due to the new telemetry.

I'll mark this as assigned to me for now, but if we want to do the backout, then feel free to take it.

Assignee: nobody → nika
Flags: needinfo?(plawless)
See Also: → 1803798

(In reply to Nika Layzell [:nika] (ni? for response) from comment #6)

I tried looking more at the telemetry profiles, and couldn't find anything obvious causing the slowdown, though likely the only possible cause is increased IPC message sending overhead due to the new telemetry.

Out of curiosity, how were the profiles generated? Were they from try, or from running things locally? Can you share the links?

Because this is Nightly only, the configuration is rare, so impact low.
But need to fix this anyhow, since this makes other performance work hard.

Performance Impact: --- → low

I'm getting bug 1790872 backed out, as some of the issues in other regressions will impact beta, and can't be easily fixed.

(In reply to Florian Quèze [:florian] from comment #7)

Out of curiosity, how were the profiles generated? Were they from try, or from running things locally? Can you share the links?

I clicked on the regression, found a run (e.g. before: https://treeherder.mozilla.org/jobs?repo=autoland&revision=ee38c5741bfe66889c3801d9daa5eb5e999de418&group_state=expanded&selectedTaskRun=PQLi3wG7ReCl2_8kheqTyA.0 and after: https://treeherder.mozilla.org/jobs?repo=autoland&revision=e4ac0dccafdddfa13e91dec0e852d001ca7c528e&group_state=expanded&selectedTaskRun=Id_FL64KQEuKcPXr5coqdQ.0), then clicked on the "Open in Firefox Profiler" buttons which were in the bottom section (before: https://share.firefox.dev/3HiYKnR, after: https://share.firefox.dev/3YbfB1S), and narrowed it down to the Inferno test. These aren't the specific ones which I used (not sure which ones I used), but are just two random ones I clicked on.

Fixed by backout.

Status: NEW → RESOLVED
Closed: 2 years ago
Resolution: --- → FIXED
Target Milestone: --- → 109 Branch
You need to log in before you can comment on or make changes to this bug.