<a class="header-button" href="https://bugzilla.mozilla.org/home" title="Go to home page"> Bugzilla

Assignee

Comment 2

•

5 months ago

I'm investigating to see if this is indeed the offending patch set and if so what parts of it are so offensive.

I have a couple of compare views going:

Revert patch set off of central:
https://perf.compare/compare-results?baseRev=8bff8a5c5f999dd9b728bf21b8eaca5cced84295&newRev=4ed762a2845da7abb629064502b0309302f70815&baseRepo=try&newRepo=try&framework=13
Clamp perf comparison to base rev before patchset was introduced to tip of set:
https://perf.compare/compare-results?baseRev=3b480041751b78102a1a644167e7b0ab26e2b026&baseRepo=autoland&newRev=d684010261d6886b7cae29c5f7299f4b7d48afca&newRepo=autoland&framework=13&search=speedometer
Remove pieces of patch that might be hot paths.
https://perf.compare/compare-results?baseRev=8bff8a5c5f999dd9b728bf21b8eaca5cced84295&newRev=2331ae5b7921c408b600ba41e2959853fb9692dc&baseRepo=try&newRepo=try&framework=13

So far none of these comparisons are showing similar regressions to what alert 2113 shows.

Flags: needinfo?(eitan)

BugBot [:suhaib / :marco/ :calixte]

Assignee

Updated

•

5 months ago

Severity: -- → S3

Priority: -- → P1

Comment 3

•

4 months ago

Set release status flags based on info from the regressing bug 1769586

status-firefox133: --- → affected

Comment 4

•

4 months ago

Are you still looking into this, Eitan?

Flags: needinfo?(eitan)

Assignee

Comment 5

•

4 months ago

I am. The fact that this is only a regression in a subtest makes it hard to profile. I'm working on this right now.

Flags: needinfo?(eitan)

Denis Palmeiro [:denispal]

Updated

•

4 months ago

Assignee: nobody → eitan

Comment 6

•

4 months ago

(In reply to Eitan Isaacson [:eeejay] from comment #5)

I am. The fact that this is only a regression in a subtest makes it hard to profile. I'm working on this right now.

Hi Eitan, you can also run just one subtest which can help when profiling by using this url: https://browserbench.org/Speedometer3.0?suites=Charts-observable-plot&iterationCount=100

Mike Conley (:mconley) (:⚙️)

Comment 7

•

4 months ago

FYI, bug 1769586 backs out cleanly from Beta if we don't want to ship this regression in 132. Note that we go to RC next week and have 2 betas left this cycle.

Updated

•

4 months ago

Performance Impact: --- → ?

Mike Conley (:mconley) (:⚙️)

Comment 8

•

4 months ago

Hey Bas, the regressor is part of our Interop 2024 initiative, however this also appears to impact SP3... any thoughts on what gets priority?

Flags: needinfo?(bas)

Assignee

Comment 9

•

4 months ago

Attached file Bug 1920082 - Check if mAttrElementsMap before CC traversing it. r?#dom-core — Details

I'm not sure why this helps, but it looks like checking if the
nsTHashMap is empty before iterating speeds things up.

https://perf.compare/compare-results?baseRev=26bd35ecde28cc0babc2093a65f4317303a50118&newRev=f887dd15962a57a81842a7190f4fef1287cf4d88&baseRepo=try&newRepo=try&framework=13

Assignee

Comment 10

•

4 months ago

Denis Palmeiro [:denispal]

Comment 11

•

4 months ago

I collected better profiles on Android where this is also reproducible.

Before: https://share.firefox.dev/48qvKGB
After: https://share.firefox.dev/4f6O21v

I didn't look at the regressing patch very closely, but did the patch add any additional GC objects or change the frequency of a CC or GC at all that may have caused a GC to occur? The major difference between the two profiles seems to come from a GC major slice caused in the bad profile which is not present before the patch: https://share.firefox.dev/4f3FxEq

Pulsebot

Comment 12

•

4 months ago

Pushed by eisaacson@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/a93b274a0bcd Check if mAttrElementsMap before CC traversing it. r=dom-core,jjaschke

https://hg.mozilla.org/mozilla-central/rev/a93b274a0bcd

Assignee

Comment 13

•

4 months ago

That is the same conclusion I came to. I think the Traverse phase in FragmentOrElement::nsExtendedDOMSlots might have taken a hit. I queued a patch that I hope speeds things up. The more extreme solution would be to have mAttrElementsMap on the heap or wrap it in a Maybe so we can skip it entirely when it is not used (99.9999% of the time).

Iulian Moraru

Comment 14

•

4 months ago

bugherder

Status: NEW → RESOLVED

Closed: 4 months ago

status-firefox133: affected → fixed

Resolution: --- → FIXED

Target Milestone: --- → 133 Branch

Denis Palmeiro [:denispal]

Comment 15

•

4 months ago

Unfortunately it doesn't seem like the attached patch fixed the regression, you can see here that the Dotted-sync test is still around 8-9ms on Windows when it was 7.5ms before the regression.

Comment 16

•

4 months ago

•

Edited

In the future, I think it would be better to land patches like this in separate bugs so that we can leave the bug open until we are sure the regression is fixed.

Comment 17

•

4 months ago

Let's reopen this bug for now. Usually I'm not a fan of having open bugs with landed patches, because it makes regression and uplift tracking harder, but in this case it's unlikely that the landed patch requires this kind of tracking.

Status: RESOLVED → REOPENED

status-firefox133: fixed → affected

Resolution: FIXED → ---

Target Milestone: 133 Branch → ---

Bas Schouten (:bas.schouten)

Updated

•

4 months ago

Blocks: speedometer3

Updated

•

4 months ago

Flags: needinfo?(bas)

Jeff Muizelaar [:jrmuizel]

Comment 18

•

4 months ago

I think we should actually back this out. SP3 is our flagship benchmark, it seems like we don't completely understand the regression and keeping it in Nightly will impact our ability to measure other changes in these numbers.

Assignee

Comment 19

•

4 months ago

I'm trying to come up with other solutions to this. Don't fully understand what is happening. I hope to post something today or tomorrow. This patch was is a significant step in our a11y interop-2024, so its worth fighting for.

Comment 20

•

4 months ago

Bug 1769586 has been backed out from both affected branches and the patch from this bug was backed out also. The remaining investigation can happen in bug 1769586.
https://hg.mozilla.org/integration/autoland/rev/5351bf4ac8f676c01bdd7f26254650cdc4cde2cd

Status: REOPENED → RESOLVED

Closed: 4 months ago → 4 months ago

status-firefox132: affected → fixed

status-firefox133: affected → fixed

Resolution: --- → FIXED

Comment 21

•

4 months ago

I can confirm Denis' suspicion about GC - it does seem that the problem is that bug 1769586 causes us to allocate more GC memory during the test. We then exceed the GC thresholds earlier and run GC slices earlier and more frequently.

Steps to reproduce:

Go to https://www.browserbench.org/Speedometer3.0/?suite=Charts-observable-plot&iterationCount=100
Start the Firefox profiler with the "Nightly" preset.
Click "Start Test"
When the test is done, capture the profile.
Go to the marker chart and search for "gcslice,usertiming"
Using shift+mousewheel, zoom to the first GCSlice marker.
In the UserTiming rows, check which iteration the first GCSlice ran in.

On my machine, the first GCSlice runs during iteration-18 in builds before bug 1769586, and in iteration-16 in builds after bug 1769586. I can reproduce this very consistently.
Before: https://share.firefox.dev/3BNOZ0h
After: https://share.firefox.dev/4frwXjd

Comment 22

•

4 months ago

JS allocation profiling says that, during the first 14 iterations, Document.createElementNS allocates 28744 bytes on the JS heap before bug 1769586 and 59416 bytes after, which is pretty close to 2x. I wonder if the JS wrapper for DOM elements has doubled in size due to the new properties.
Before: https://share.firefox.dev/3Af4Rsd
After: https://share.firefox.dev/4h75SDr

Assignee

Comment 23

•

4 months ago

Thanks Markus,
This makes sense. I was looking for hot paths in the patch. But it looks like the this is possibly due to the new FrozenArray attributes in the ARIAMixin.webidl.

Peter, is there a chance that the added attributes are making the js DOM wrappers that much bigger?

Flags: needinfo?(peterv)

Peter Van der Beken [:peterv]

Assignee

Comment 24

•

4 months ago

For reference, the FrozenArray support was added in bug 1891784.

Comment 25

•

4 months ago

Each attribute marked with [FrozenArray] requires a reserved slot on the JS object. We went from 1 to 8 slots, so I think that means the size of the JSObject went from a size of 4 64-bit values to 10 (JSObject_slots2 to JSObject_slots8). I'll try to make this more lazily allocated, but it's annoying because it means more special cases in the generated code.

Flags: needinfo?(peterv)