Closed Bug 897655 Opened 12 years ago Closed 12 years ago

Use off thread JS parsing when loading scripts from XUL documents

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla26

People

(Reporter: bhackett1024, Assigned: bhackett1024)

References

Details

Attachments

(8 files, 1 obsolete file)

patch (868ce514bba7) 12 years ago Brian Hackett [Laid off!] 69.33 KB, patch		Details \| Diff \| Splinter Review
updated (868ce514bba7) 12 years ago Brian Hackett [Laid off!] 72.54 KB, patch		Details \| Diff \| Splinter Review
boring parts 12 years ago Brian Hackett [Laid off!] 14.58 KB, patch	billm : review+	Details \| Diff \| Splinter Review
boring rooting changes 12 years ago Brian Hackett [Laid off!] 5.99 KB, patch	billm : review+	Details \| Diff \| Splinter Review
remove vestigial code 12 years ago Brian Hackett [Laid off!] 1.77 KB, patch	luke : review+	Details \| Diff \| Splinter Review
XUL changes 12 years ago Brian Hackett [Laid off!] 24.75 KB, patch	bzbarsky : review+	Details \| Diff \| Splinter Review
JS changes 12 years ago Brian Hackett [Laid off!] 25.51 KB, patch	billm : review+	Details \| Diff \| Splinter Review
block load events during parsing 12 years ago Brian Hackett [Laid off!] 3.52 KB, patch	bzbarsky : review+	Details \| Diff \| Splinter Review
blacklist for os x orange 12 years ago Brian Hackett [Laid off!] 1.64 KB, patch	billm : review+	Details \| Diff \| Splinter Review

Brian Hackett [Laid off!]

Assignee

Description

•

12 years ago

Attached patch patch (868ce514bba7) (obsolete) — Details — Splinter Review

With bug 875125 JS can now be parsed off the main thread, and since XUL documents already read in scripts asynchronously it would be good if they parsed the scripts asynchronously once reading is done. This both reduces main thread time on cold startups and helps with potentially removing the need for XDR'ed bytecode (instead always reading in scripts directly, as source is more compact than XDR). A cold startup on my Mac spends about 200ms parsing/emitting scripts. About 1/3 of this parsing is triggered by scripts loaded from XUL documents. The attached patch changes these parses to happen off thread. Unfortunately, the 70ms of parsing that used to be on the main thread now takes about 310ms off thread. 220ms of the overhead, almost everything, is due to contention on the lock protecting the atoms table, and almost all this contention is because on Macs the browser triggers two parses for several rather large scripts (including browser.js) at almost the same time. There are several options for dealing with this contention: - Ignore it. Most of the contention is happening off the main thread and the delay in getting scripts parsed may not affect how long the browser takes to start. Also, the problem with parsing the same scripts multiple times doesn't affect non-Mac platforms (see bug 392650). - Only allow one helper thread at a time (plus the main thread) to parse scripts. - Modify the JS parser and VM so that locking is not required when using the atoms table. This would require using a fake thread local atoms table when parsing off thread and fixing up atom pointers and state when merging back onto the main thread. I need to do more measurements to see which of these approaches is best, but posting the complete patch as is for now.

Brian Hackett [Laid off!]

Assignee

Comment 1

•

12 years ago

Attached patch updated (868ce514bba7) — Details — Splinter Review

The second option above seems reasonable. When only one helper thread, and possibly the main thread, are allowed to parse there is much less contention for the exclusive access lock and the total amount of time blocked goes from 220ms to 17ms.

Attachment #780571 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

Comment 2

•

12 years ago

Attached patch boring parts — Details — Splinter Review

Splitting the above patch up for review. This has some boring changes allowing flattening and concatenation on ExclusiveContext threads (which own their zone, so this is fine) and a couple bugfixes.

Assignee: nobody → bhackett1024

Attachment #780668 - Flags: review?(wmccloskey)

Brian Hackett [Laid off!]

Assignee

Comment 3

•

12 years ago

Attached patch boring rooting changes — Details — Splinter Review

Some more boring stuff to add new rooting APIs and relax JS_AddRoot* roots so that they can hold null pointers. This means that while the main thread still needs to add/remove roots, they can be changed by helper threads (so long as those threads are paused during GC).

Attachment #780670 - Flags: review?(wmccloskey)

Brian Hackett [Laid off!]

Assignee

Comment 4

•

12 years ago

Attached patch remove vestigial code — Details — Splinter Review

Remove some vestigial parser code that is unnecessary after CPG.

Attachment #780673 - Flags: review?(luke)

Brian Hackett [Laid off!]

Assignee

Comment 5

•

12 years ago

Attached patch XUL changes — Details — Splinter Review

Changes to XUL interfaces so that XUL documents can parse their scripts off thread after the source has fully loaded.

Attachment #780676 - Flags: review?(bzbarsky)

Brian Hackett [Laid off!]

Assignee

Comment 6

•

12 years ago

Attached patch JS changes — Details — Splinter Review

Changes to off thread parsing to support the new API for triggering off thread parses and merging the finished scripts into the target compartment/zone after parsing finishes.

Attachment #780678 - Flags: review?(wmccloskey)

Luke Wagner [:luke]

Comment 7

•

12 years ago

Comment on attachment 780673 [details] [diff] [review] remove vestigial code my favorite kind of review

Attachment #780673 - Flags: review?(luke) → review+

Boris Zbarsky [:bzbarsky]

Comment 8

•

12 years ago

I'm sorry for the lag here. I should get to this tomorrow....

Luke Wagner [:luke]

Comment 9

•

12 years ago

I was looking at the other JS patches and thinking that it would be useful to JS_ASSERT(parseFinishedList.empty()) when we can to ensure that we aren't leaking ParseTasks. A first place to assert would be ~JSRuntime (since otherwise we'll leak them). Other than that, the JS engine doesn't have a good pinch point, but Gecko should know when there are no outstanding scripts compiling and be able to call some JS_AssertNoPendingScriptCompilations. (I see nsIOffThreadScriptReceiver isn't implemented yet, so this is a future request.)

Brian Hackett [Laid off!]

Assignee

Comment 10

•

12 years ago

(In reply to Luke Wagner [:luke] from comment #9) > I was looking at the other JS patches and thinking that it would be useful > to JS_ASSERT(parseFinishedList.empty()) when we can to ensure that we aren't > leaking ParseTasks. A first place to assert would be ~JSRuntime (since > otherwise we'll leak them). Other than that, the JS engine doesn't have a > good pinch point, but Gecko should know when there are no outstanding > scripts compiling and be able to call some > JS_AssertNoPendingScriptCompilations. (I see nsIOffThreadScriptReceiver > isn't implemented yet, so this is a future request.) Actually, XULDocument is an nsIOffThreadScriptReceiver. I'm not sure where a good place would be to assert this in gecko since the script loading is already done asynchronously so we'd basically end up with some counter somewhere that needs to be updated in a threadsafe manner and would be updated at similar points to where parseFinishedList changes, which doesn't seem to add much.

Luke Wagner [:luke]

Comment 11

•

12 years ago

(In reply to Brian Hackett (:bhackett) from comment #10) > Actually, XULDocument is an nsIOffThreadScriptReceiver. Oops, I missed that. I agree just adding some on-the-side counter won't add much. bz: by any chance do you know of any existing conditions in Gecko where we know there are no outstanding documents being loaded?

Boris Zbarsky [:bzbarsky]

Comment 12

•

12 years ago

Comment on attachment 780676 [details] [diff] [review] XUL changes >+++ b/content/xul/content/src/nsXULElement.cpp >+ nsIOffThreadScriptReceiver *aOffThreadReceiver /* = NULL */) nullptr, please. >+++ b/content/xul/content/src/nsXULElement.h >+ nsIOffThreadScriptReceiver *aOffThreadReceiver = NULL); And here. >+++ b/content/xul/document/src/XULDocument.cpp >+ // We will be notified via OnOffThreadCompileComplete when the >+ // compile finishes. Keep the contents of the compiled script >+ // alive until the compilation finishes. >+ mOffThreadCompileString = stringStr; This isn't actually safe: depending on exactly how stringStr was filled with data this assignment may copy. It's probably better to assert that mOffThreadCompileString is empty, then assign it before the Compile() call, and pass its .get() and .Length(). Then if the Compile call returns an error, Truncate() it. >+XULDocument::OnScriptCompileComplete(JSScript *aScript, nsresult aStatus) The first arg should probably be a JS::Handle<JSScript*>. Or is it a non-handle on purpose? If so, it's worth documenting that and why that is... >+ mOffThreadCompileString = nullptr; mOffThreadCompileString.Truncate(). Or if you want the "null string" behavior then SetIsVoid(), I guess, but just making it empty should be fine. >+ // Clear mCurrentScriptProto now, but save it first for use below in >+ // the compile/execute code There is no compile code below, just execute. You should probably add nsIOffThreadScriptReceiver to the QueryInterface implementation for XULDocument... See more on this below. >+++ b/dom/base/nsIScriptContext.h Please give the new interface an IID, and maybe put it and its IID after nsIScriptContext? You can forward-declare it for use in CompileScript. >+ nsIOffThreadScriptReceiver* aOffThreadReceiver = NULL, nullptr. >+++ b/dom/base/nsJSEnvironment.cpp >+ nsRefPtr<nsIOffThreadScriptReceiver> mReceiver; nsCOMPtr, please. >+ JS::FinishOffThreadScript(nsJSRuntime::sRuntime, mScript); >+ return mReceiver->OnScriptCompileComplete(mScript, mScript ? NS_OK : NS_ERROR_FAILURE); The first call effectively unroots mScript, according to the comments where mScript is defined. What guarantees that it won't die during the middle of the second call? I guess the callee is expected to immediately root it? If so, that's worth documenting on the nsIOffThreadScriptReceiver interface. >+ nsIOffThreadScriptReceiver* aOffThreadReceiver /* = NULL */, nullptr. >+ aOffThreadReceiver->AddRef(); NS_ADDREF(aOffThreadReceiver); >+++ b/dom/base/nsJSEnvironment.h >+ nsIOffThreadScriptReceiver* aOffThreadReceiver = NULL, nullptr. r=me with the above nits fixed.

Attachment #780676 - Flags: review?(bzbarsky) → review+

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 13

•

12 years ago

Comment on attachment 780668 [details] [diff] [review] boring parts Review of attachment 780668 [details] [diff] [review]: ----------------------------------------------------------------- ::: js/src/jsatom.cpp @@ -212,5 @@ > /* We treat static strings as interned because they're never collected. */ > if (StaticStrings::isStatic(atom)) > return true; > > - AtomSet::Ptr p = cx->runtime()->atoms.lookup(atom); Oops. Can we make this private or something so that you have to access it through the ExclusiveContext getter? ::: js/src/vm/String.cpp @@ +400,5 @@ > } > > template <AllowGC allowGC> > JSString * > +js::ConcatStrings(ExclusiveContext *cx, Shu is already making this a ThreadSafeContext. I think the right thing to do is to change ScopedThreadSafeStringInspector::ensureChars so that it calls ensureLinear even for ExclusiveContexts. Right now it will copy the chars out, which is probably less efficient.

Attachment #780668 - Flags: review?(wmccloskey) → review+

Brian Hackett [Laid off!]

Assignee