Closed Bug 741417 Opened 13 years ago Closed 11 months ago

[meta] Reduce CC pauses when there are cycles to collect

Tracking

()

Status:

RESOLVED WORKSFORME

Project Flags:

blocking-kilimanjaro

People

(Reporter: mccr8, Unassigned)

References

(Depends on 1 open bug)

Details

(Keywords: meta)

Attachments

(2 files, 1 obsolete file)

run Shark during CCs 13 years ago Andrew McCreight [:mccr8] 1.56 KB, patch		Details \| Diff \| Splinter Review
run Shark during CCs 13 years ago Andrew McCreight [:mccr8] 1.92 KB, patch		Details \| Diff \| Splinter Review
perf report 13 years ago Justin Lebar (not reading bugmail) 73.22 KB, text/plain		Details

Andrew McCreight [:mccr8]

Reporter

Description

•

13 years ago

In the wake of the ongoing work in bug 716598, CC times are getting closer to being proportional to the amount of garbage being collected (modulo some fairly constant overhead being looked at in bug 722715). Therefore, it would be good to reduce CC pauses when there is actual garbage to collect.

Andrew McCreight [:mccr8]

Reporter

Comment 1

•

13 years ago

As an example, I opened Gmail, a MDN wiki page, and 5 or so TechCrunch pages, then closed them all using "close other tabs". This caused a 522ms CC pause. 416ms of this (79%) was MarkRoots(), 28ms (5%) for Unlink, 27ms for ScanRoots, 13ms for ClearGraph, 10ms for Root, 10ms for Unroot. So it seems like we need to focus on MarkRoots. The CC visited 45826 ref counted and 256681 GCed objects, and freed 45340 ref counted and 256174 GCed objects. Kyle's patch to merge together DOM nodes in the CC graph will help with reducing the number of ref counted nodes, but as you can see JS can take up a large chunk of the graph sometimes, so I think we need a strategy for speeding up JS, too. I'll try sharking CCs to see where we are spending time.

Andrew McCreight [:mccr8]

Reporter

Comment 2

•

13 years ago

Attached patch run Shark during CCs (obsolete) — Details — Splinter Review

Andrew McCreight [:mccr8]

Reporter

Comment 3

•

13 years ago

Attached patch run Shark during CCs — Details — Splinter Review

Attachment #611508 - Attachment is obsolete: true

Justin Lebar (not reading bugmail)

Comment 4

•

13 years ago

Attached file perf report — Details

This is what perf tells me is hot during cc (using the patch in bug 741652). I browsed to cnn, slashdot, nytimes, then cc'ed from about:memory.

Andrew McCreight [:mccr8]

Reporter

Comment 5

•

13 years ago

Thanks. For the purposes of this bug, I'm interested in what happens when the CC does a lot of work, for instance, after you close a tab.

Justin Lebar (not reading bugmail)

Comment 6

•

13 years ago

This may not be particularly meaningful, but perf annotate says that the hottest part of the PL_DHashTableOperate(PL_DHASH_ADD) is this if statement in SearchTable: /* Miss: return space for a new entry. */ if (PL_DHASH_ENTRY_IS_FREE(entry)) { Presumably this is a difficult-to-predict branch.

Justin Lebar (not reading bugmail)

Comment 7

•

13 years ago

(In reply to Andrew McCreight [:mccr8] from comment #5) > Thanks. For the purposes of this bug, I'm interested in what happens when > the CC does a lot of work, for instance, after you close a tab. I closed these tabs as soon as they loaded, so I think I was exercising the CC. Hard to say for sure, of course... > Presumably this is a difficult-to-predict branch. One thing you might be able to do is re-order your calls into the hashtable so that the result of that branch is more predictable. That is, if you have a bunch of nodes and you know some likely aren't in the hashtable, you could add them first (or last). cachegrind might be able to give you more insight into the branch mispredicts, and there's code to make it work with JS_EnableProfiling (I dunno if that code actually works... :).

Andrew McCreight [:mccr8]

Reporter

Comment 8

•

13 years ago

(In reply to Justin Lebar [:jlebar] from comment #7) > I closed these tabs as soon as they loaded, so I think I was exercising the > CC. Hard to say for sure, of course... Oh, okay, I didn't realize that. > > Presumably this is a difficult-to-predict branch. > > One thing you might be able to do is re-order your calls into the hashtable > so that the result of that branch is more predictable. That is, if you have > a bunch of nodes and you know some likely aren't in the hashtable, you could > add them first (or last). That's not really possible. We are traversing the graph in a breadth-first fashion, and looking up any children we find. > cachegrind might be able to give you more insight into the branch > mispredicts, and there's code to make it work with JS_EnableProfiling (I > dunno if that code actually works... :). Okay, thanks.

Justin Lebar (not reading bugmail)

Comment 9

•

13 years ago

> That's not really possible. We are traversing the graph in a breadth-first fashion, and looking > up any children we find. Could you go depth-first? I wouldn't bet that this would help, because I expect you'd have worse memory locality. Hm... Could you store metadata (e.g. the visited bit) in the objects themselves, rather than in a hashtable?

Justin Lebar (not reading bugmail)

Comment 10

•

13 years ago

> Hm... Could you store metadata (e.g. the visited bit) in the objects themselves, rather than in a > hashtable? So...what if we moved struct PtrInfo into CC participants? That would bloat every object by three words? Sorry, I'm probably not helping anymore. :)

Andrew McCreight [:mccr8]

Reporter

Comment 11

•

13 years ago

You only need PRUint32 mColor : 2; PRUint32 mInternalRefs : 30; EdgePool::Iterator mFirstChild. The rest is already present (ignoring the DEBUG_CC gunk). Then you'd have to add virtual accessors for those fields. We could also skip the iterator if we just use Traverse multiple times. The whole copy of the graph was initially done to avoid traversal overhead. It has succeeded in its goal, in that building the graph is 80% of the CC, but maybe it would be worth looking into the tradeoff again.

Justin Lebar (not reading bugmail)

Comment 12

•

13 years ago

(In reply to Andrew McCreight [:mccr8] from comment #11) > Then you'd have to add virtual accessors for those fields. That would be lame. We can't do this with inheritance?

Andrew McCreight [:mccr8]

Reporter

Comment 13

•

13 years ago

Cycle collected classes would have to inherit from some new nsICCSupports class, which I guess wouldn't be impossible.

Justin Lebar (not reading bugmail)

Comment 14

•

13 years ago

(In reply to Andrew McCreight [:mccr8] from comment #13) > Cycle collected classes would have to inherit from some new nsICCSupports > class, which I guess wouldn't be impossible. Yes. And two extra words of storage per object doesn't seem bad at all, in my completely uninformed opinion. :)

Lawrence Mandel [:lmandel] (use needinfo)

Comment 15

•

13 years ago

noming for Kilimanjaro as per conversation with Olli.

Blocks: k9o-perf

blocking-kilimanjaro: --- → ?

Andrew McCreight [:mccr8]

Reporter

Updated

•

13 years ago

Depends on: 749784

Asa Dotzler [:asa]

Comment 16

•

13 years ago

leaving this meta bug a nom for k9o in case other high-value dependencies get attached in the next week or so. marked the current depends as k9o blocker.

Andrew McCreight [:mccr8]

Reporter

Updated

•

13 years ago

Depends on: 754495

Andrew McCreight [:mccr8]

Reporter

Updated

•

13 years ago

Depends on: 756675

David Zbarsky (:dzbarsky)

Updated

•

13 years ago

No longer depends on: 756675

(PTO June 19-July 7) Ryan VanderMeulen

Updated

•

13 years ago

Depends on: 756675

David Zbarsky (:dzbarsky)

Updated

•

13 years ago

No longer depends on: 754495

Andrew McCreight [:mccr8]

Reporter

Updated

•

13 years ago

Depends on: 754495

David Zbarsky (:dzbarsky)

Updated

•

13 years ago

No longer blocks: 698919

David Zbarsky (:dzbarsky)

Updated

•

13 years ago

Blocks: 698919

Lawrence Mandel [:lmandel] (use needinfo)

Comment 17

•

13 years ago

All depends marked as k9o+. Please nom any new dependent bugs that are applicable for k9o.

blocking-kilimanjaro: ? → -

Andrew McCreight [:mccr8]

Reporter

Updated

•

12 years ago

Depends on: IncrementalCC

Andrew McCreight [:mccr8]

Reporter

Updated

•

12 years ago

Depends on: 653191

Andrew McCreight [:mccr8]

Reporter

Updated

•

11 years ago

Depends on: 1005500

BMO Automation

Updated

•

3 years ago

Severity: normal → S3

BugBot (nomail) [:suhaib / :marco/ :calixte]

Updated

•

3 years ago

Keywords: meta

Andrew McCreight [:mccr8]

Reporter

Updated

•

2 years ago

Component: XPCOM → Cycle Collector

Andrew McCreight [:mccr8]

Reporter

Comment 18

•

11 months ago

I think this meta bug has served its purpose, as no new dependent bugs have been added in a decade.

Status: NEW → RESOLVED

Closed: 11 months ago

Resolution: --- → WORKSFORME

You need to log in before you can comment on or make changes to this bug.