Closed Bug 761723 (savesource) Opened 13 years ago Closed 12 years ago

implement toString of function objects by saving source

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla17

People

(Reporter: Benjamin, Assigned: Benjamin)

References

Details

(Keywords: addon-compat, Whiteboard: [js:t])

Attachments

(6 files, 36 obsolete files)

save and compress source 13 years ago :Benjamin Peterson 432.31 KB, patch		Details \| Diff \| Splinter Review
address initial comments 13 years ago :Benjamin Peterson 433.67 KB, patch		Details \| Diff \| Splinter Review
add memory accounting for script sources 13 years ago :Benjamin Peterson 6.02 KB, patch	n.nethercote : review+	Details \| Diff \| Splinter Review
add memory accounting (v2) 13 years ago :Benjamin Peterson 5.08 KB, patch		Details \| Diff \| Splinter Review
memory accounting (v3 - fix typo) 13 years ago :Benjamin Peterson 5.08 KB, patch		Details \| Diff \| Splinter Review
various rebasing/fixes/cleanups 13 years ago :Benjamin Peterson 437.43 KB, patch		Details \| Diff \| Splinter Review
part 2 (mem stats) 13 years ago :Benjamin Peterson 5.15 KB, patch		Details \| Diff \| Splinter Review
part 1b (fix browser tests) 13 years ago :Benjamin Peterson 14.51 KB, patch		Details \| Diff \| Splinter Review
part 1a (source saving) 13 years ago :Benjamin Peterson 437.76 KB, patch		Details \| Diff \| Splinter Review
part 1a (saving source) 13 years ago :Benjamin Peterson 439.02 KB, patch		Details \| Diff \| Splinter Review
part 1a (saving source) 13 years ago :Benjamin Peterson 439.16 KB, patch		Details \| Diff \| Splinter Review
part 1a (saving source) 13 years ago :Benjamin Peterson 440.60 KB, patch		Details \| Diff \| Splinter Review
part 2 (mem stats) 13 years ago :Benjamin Peterson 5.19 KB, patch		Details \| Diff \| Splinter Review
part 2 (mem stats) 13 years ago :Benjamin Peterson 5.19 KB, patch		Details \| Diff \| Splinter Review
part 1a (saving source) 13 years ago :Benjamin Peterson 447.57 KB, patch		Details \| Diff \| Splinter Review
part 1a (saving source) 13 years ago :Benjamin Peterson 450.04 KB, patch		Details \| Diff \| Splinter Review
interdiff between latest and last reviewed 13 years ago :Benjamin Peterson 34.35 KB, patch	jorendorff : review+	Details \| Diff \| Splinter Review
part 1a (saving source) 13 years ago :Benjamin Peterson 452.45 KB, patch		Details \| Diff \| Splinter Review
part 1b (fix browser tests) 13 years ago :Benjamin Peterson 15.66 KB, patch	jorendorff : review+ Ms2ger : review+ jst : superreview+	Details \| Diff \| Splinter Review
part 1a (saving source) 13 years ago :Benjamin Peterson 452.91 KB, patch		Details \| Diff \| Splinter Review
part 1a (saving source) 13 years ago :Benjamin Peterson 454.90 KB, patch		Details \| Diff \| Splinter Review
part 1b (fix browser tests) 13 years ago :Benjamin Peterson 15.72 KB, patch		Details \| Diff \| Splinter Review
part 2 (mem stats) 13 years ago :Benjamin Peterson 5.00 KB, patch		Details \| Diff \| Splinter Review
part 3 (context option to only save compileAndGo scripts) 13 years ago :Benjamin Peterson 8.14 KB, patch	jorendorff : review+	Details \| Diff \| Splinter Review
part 4 (use that option in the browser) 13 years ago :Benjamin Peterson 9.30 KB, patch		Details \| Diff \| Splinter Review
part 4: add a hook to allow retrieving sources we didn't save 13 years ago :Benjamin Peterson 9.61 KB, patch		Details \| Diff \| Splinter Review
part 5: let the browser load chrome source if someone wants it 13 years ago :Benjamin Peterson 5.90 KB, patch		Details \| Diff \| Splinter Review
part 5: let the browser load chrome source if someone wants it 13 years ago :Benjamin Peterson 5.91 KB, patch		Details \| Diff \| Splinter Review
part 4: add a hook to allow retrieving sources we didn't save 13 years ago :Benjamin Peterson 10.61 KB, patch		Details \| Diff \| Splinter Review
part 5: let the browser load chrome source if someone wants it 13 years ago :Benjamin Peterson 6.04 KB, patch		Details \| Diff \| Splinter Review
part 5: let the browser load chrome source if someone wants it 13 years ago :Benjamin Peterson 6.22 KB, patch	bzbarsky : review+	Details \| Diff \| Splinter Review
part 6: test for chrome toSource 13 years ago :Benjamin Peterson 2.57 KB, patch	bzbarsky : review+	Details \| Diff \| Splinter Review
part 4: add a hook to allow retrieving sources we didn't save 13 years ago :Benjamin Peterson 10.71 KB, patch	luke : review+	Details \| Diff \| Splinter Review
part 5: let the browser load chrome source if someone wants it 13 years ago :Benjamin Peterson 7.28 KB, patch		Details \| Diff \| Splinter Review
part 5: let the browser load chrome source if someone wants it 13 years ago :Benjamin Peterson 7.31 KB, patch		Details \| Diff \| Splinter Review
part 1 13 years ago :Benjamin Peterson 470.38 KB, patch		Details \| Diff \| Splinter Review
part 2 13 years ago :Benjamin Peterson 5.01 KB, patch		Details \| Diff \| Splinter Review
part 3 13 years ago :Benjamin Peterson 8.17 KB, patch		Details \| Diff \| Splinter Review
part 4 13 years ago :Benjamin Peterson 10.72 KB, patch		Details \| Diff \| Splinter Review
part 5 13 years ago :Benjamin Peterson 6.87 KB, patch		Details \| Diff \| Splinter Review
part 6 13 years ago :Benjamin Peterson 2.58 KB, patch		Details \| Diff \| Splinter Review
part 6 13 years ago :Benjamin Peterson 2.58 KB, patch		Details \| Diff \| Splinter Review

:Benjamin Peterson

Assignee

Description

•

13 years ago

I think this can be implemented by having frontend::CompileScript saving the whole source in the toplevel script object. Individual function scripts can simply store an offset. The source should also be compressed if that makes it smaller.

:Benjamin Peterson

Assignee

Updated

•

13 years ago

Status: NEW → RESOLVED

Closed: 13 years ago

Resolution: --- → DUPLICATE

Nicholas Nethercote [inactive]

Comment 2

•

13 years ago

Not a duplicate of bug 747831, but a follow-up, IMHO.

•

13 years ago

(In reply to Jeff Walden [:Waldo] (busy, try to prefer other reviewers if possible) from comment #4) > Do other browsers implement a similar strict-mode kludge? Be bold in > removing it and attempting to reduce complexity if not! (Probably will need > to tell the fuzzpeople about the change, tho.) Compatibility with the > decompiler is not a priority here, because the entire change is a > compatibility break in and of itself. And web compatibility's probably > negligible too, since strict mode doesn't get used on the web very much yet. I agree with this. I "implemented" this because we have a test for it. Does that change your opinion?

Jason Orendorff [:jorendorff]

Comment 7

•

13 years ago

Benjamin, since you've already taken the trouble of implementing this "use strict" hack, I say we keep it for now. All other things equal, it's slightly better to do a massive implementation overhaul in one patch, and change visible behavior in a separate patch after that. However I agree with Waldo we want to rip that out if other engines aren't doing it. Easy follow-up patch if so.

Jeff Walden [:Waldo]

•

13 years ago

(In reply to Nicholas Nethercote [:njn] from comment #5) > Comment on attachment 632032 [details] [diff] [review] > save and compress source > > Review of attachment 632032 [details] [diff] [review]: > ----------------------------------------------------------------- > The most important thing missing is an idea of how much extra memory this > causes us to use. After all, memory savings were the entire motivation for > the decompiler, AIUI. > > To do those measurements: > > - Create a function JSScript::sizeOfSource() that is just like > JSScript::sizeOfData(). > - Use it in StatsCellCallback() within MemoryMetrics.cpp. > - In XPCJSRuntime.cpp, add a "script-source" reporter, similar to the > existing "script-data" one. You'll need to add a scriptSource field to > JS::CompartmentStats. > - In the same file, update rtStats.totalScripts as well to include the new > number. > > I'm happy to formally review these memory reporter changes once you > implement them, because I'm very familiar with that code. Great. I'll submit a patch doing this on top of a revised one I'm churning up now. > > With that in place, you'll be able to consult about:memory in the browser to > see how much memory is being used. > > ::: js/src/frontend/Parser.cpp > @@ +1271,5 @@ > > return false; > > } > > > > + tc->sc->funbox()->bufStart = tokenStream.offsetOfToken(tokenStream.currentToken()); > > + > > I would set this when the FunctionBox is created in functionDef(). You > could even do it by passing the offset to newFunctionBox() and then onto the > FunctionBox constructor. It's quite convenient to do it here as we've just seen the left paren of the arguments (see the getToken() above); this is where I want the source data to start. > > ::: js/src/jsfun.cpp > @@ +563,5 @@ > > + *bodyStart = p - chars; > > + for (; unicode::IsSpaceOrBOM2(end[-1]); end--) > > + ; > > + if (end[-1] == '}') { > > + end--; > > I suspect Waldo will have a heart attack with all the raw pointer arithmetic > going on here. Can it be avoided somehow? TokenStream::TokenBuf hides the > pointer arithmetic when we're doing vanilla tokenizing -- can it be reused > here? TokenBuf is private to TokenStream. I can understand it is a useful abstraction in the complexity of the tokenizer, but this is a fairly vanilla search through the string. Here, IMO, calling functions like getRawChar() for *p++ just obscures the flow. > > @@ +592,5 @@ > > + return NULL; > > + const jschar *chars = src->getChars(cx); > > + if (!chars) > > + return NULL; > > + bool funCon = !(flags & JSFUN_EXPR_CLOSURE) && chars[0] != '('; > > Is the chars[0] bit fragile? Feels like the Parser should instead set a > flag somewhere and this code should read that flag. The parser ensures the source data always starts with a '(' except at the case of function constructors. The !(flags & JSFUN_EXPR_CLOSURE) condition is not needed; I merely added it for safety. > > @@ +593,5 @@ > > + const jschar *chars = src->getChars(cx); > > + if (!chars) > > + return NULL; > > + bool funCon = !(flags & JSFUN_EXPR_CLOSURE) && chars[0] != '('; > > + bool addUseStrict = script()->strictModeCode && !script()->explicitUseStrict; > > I'm having real trouble understanding this function. I don't understand why > |explicitUseStrict| is needed in the first place, and then |addUseStrict| is > used multiple times, which is unexpected. > > The control flow is quite hard to follow. Look at the conditions in > isolation: > > if (!bodyOnly && funCon) > if ((bodyOnly && !funCon) || addUseStrict) > if (addUseStrict && !out.append(chars + bodyEnd, realLength - bodyEnd)) > if (!bodyOnly && funCon && !out.append(" }")) > if (!bodyOnly && !pretty && flags & JSFUN_LAMBDA) > else if (bodyOnly && flags & JSFUN_EXPR_CLOSURE) Yes, it took quite a bit of testing to get that right. :) > > Whoa. It's really unclear to me the interplay between bodyOnly, funCon and > addUseStrict, even after I saw the |JS_ASSERT(!funCon || !addUseStrict)|. > And there is only one comment for the entire function. > > I suspect that some refactoring could make this a lot clearer, even if that > results in a small amount of duplicated code. Moving some of the code into > separate functions might also help. > > @@ +632,5 @@ > > + src = js_NewDependentString(cx, src, bodyStart, bodyEnd - bodyStart); > > + if (!src) > > + return NULL; > > + if (addUseStrict && > > + (((flags & JSFUN_EXPR_CLOSURE) && !out.append("/* use strict */")) || > > Why is the |/* use strict */| necessary? It merely imitates what the decompiler does. > ::: js/src/jsscript.h > @@ +451,5 @@ > > > > js::HeapPtrFunction function_; > > > > + size_t sourceStart; > > + size_t sourceEnd; > > We have *lots* of JSScripts, so we want them to be as small as possible. > Could these two fields be uint32_t instead? It's entirely possible. I used size_t because that's what we get from the tokenizer, and I didn't know if it was safe to downcast. I suppose JS source files larger than 4GB are pretty rare... WRT to script size, it's also important to remember that killing the decompiler will allow the removal of many bytes of source notes and extra bytecode. > > Similarly, does every JSScript need a pointer to ScriptSource? A lot of > JSScripts will points to the same ScriptSource -- can you take advantage of > that to save space? Do scripts participate in a parent/children relationship? It might be possible to store the pointer at the top of the hierarchy then. The JSScript struct will still need a pointer, though. > > Maybe you could just store a jschar* pointer and a uint32_t length? We need metadata for compression and GCing. Note only one ScriptSource is created per compiler call. > > @@ +344,5 @@ > > + unsigned char *out, size_t *outlen); > > + > > +/* > > + * Decompress a string. The caller must know the length of the output and > > + * allocate |out| to that length. > > "allocate |out| to that length| -- ditto. I'm not sure what's confusing here. "that length" refers to the length of the output provided by the caller.

Jesse Ruderman

Comment 12

•

13 years ago

(In reply to Jason Orendorff [:jorendorff] from comment #10) > Fuzzers: This change will basically leave the decompiler attack surface > exposed, but only via js_DecompileValueGenerator (used to generate various > error messages), which our existing tests (and perhaps our existing fuzzers) > don't cover well. Please add a testing function to debug shells so we can test the exposed surface more directly: * decompile(fun) * decompileValueGenerator(fun, bytecodeOffset) ??? Do you still want the fuzzer to find "round-trip" bugs, when decompile() returns something bogus?

Jesse Ruderman

Comment 13

•

13 years ago

What does this change win us? The worst complexity of the decompiler is also exposed by js_DecompileValueGenerator, isn't it?

Jesse Ruderman

Comment 14

•

13 years ago

Have you considered HTTP cache pinning as an alternative to saving (and compressing!?) an extra copy of the function source?

Jesse Ruderman

•

13 years ago

(In reply to Jesse Ruderman from comment #14) > Have you considered HTTP cache pinning as an alternative to saving (and > compressing!?) an extra copy of the function source? While that could work, it mixes many browser levels and would require participation through the JS API. We'll need something like my patch anyway, since we need to save sources embedded in webpages, too.

Jesse Ruderman

Comment 19

•

13 years ago

> > Have you considered HTTP cache pinning as an alternative to saving (and > > compressing!?) an extra copy of the function source? > > While that could work, it mixes many browser levels and would require > participation through the JS API. True. > We'll need something like my patch anyway, > since we need to save sources embedded in webpages, too. Inline script tags should be the cheapest, because we already pin the main page for View Source to work. I'm hoping we can rely on the HTTP cache for most source recovery, and only use extra side storage for dynamically constructed scripts.

Jeff Walden [:Waldo]

Comment 20

•

13 years ago

One step at a time, please! Let's not let perfect be the enemy of good here. :-)

:Benjamin Peterson

Assignee

Comment 21

•

13 years ago

Attached patch address initial comments (obsolete) — Details — Splinter Review

Here's a revised patch. It generally addresses the comments made above. Most significantly, it stores offsets as 32-bits and refactors the toString() method, adding comments.

Attachment #632032 - Attachment is obsolete: true

:Benjamin Peterson

Assignee

•

13 years ago

Comment on attachment 632973 [details] [diff] [review] add memory accounting for script sources Review of attachment 632973 [details] [diff] [review]: ----------------------------------------------------------------- Looks good! Only minor changes needed. Do you have any numbers to share? ::: js/src/jscntxt.cpp @@ +104,5 @@ > runtime->scriptFilenames = scriptFilenameTable.sizeOfExcludingThis(mallocSizeOf); > for (ScriptFilenameTable::Range r = scriptFilenameTable.all(); !r.empty(); r.popFront()) > runtime->scriptFilenames += mallocSizeOf(r.front()); > > + runtime->scriptSources = sizeOfScriptSources(mallocSizeOf); This is the only callsite of sizeOfScriptSources(); just inline it and avoid the separate function. @@ +118,5 @@ > +{ > + size_t size = 0; > + for (ScriptSource *ss = scriptSources; ss; ss = ss->next) > + size += ss->sizeOf(mallocSizeOf); > + return size; Nit: I've been consistently using |n| as the accumulator name in functions like this. Might as well do the same here. ::: js/src/jsscript.cpp @@ +1162,5 @@ > Foreground::free_(this); > } > > +size_t > +ScriptSource::sizeOf(JSMallocSizeOfFun mallocSizeOf) Please rename this sizeOfIncludingThis(), for consistency with other reporters. (See https://wiki.mozilla.org/Memory_Reporting for an explanation why, along with tons of other background info about writing memory reporters.) @@ +1166,5 @@ > +ScriptSource::sizeOf(JSMallocSizeOfFun mallocSizeOf) > +{ > + JS_ASSERT(attached); > + JS_ASSERT(ready); > + return mallocSizeOf(this) + mallocSizeOf(this->data.compressed); |compressed| is in a union with |source|. This will measure correctly either way, but it might be worth a brief comment. ::: js/xpconnect/src/XPCJSRuntime.cpp @@ +1573,5 @@ > "Memory used for the math cache."); > > + REPORT_BYTES(pathPrefix + NS_LITERAL_CSTRING("script-sources"), > + nsIMemoryReporter::KIND_HEAD, rtStats.runtime.scriptSources, > + "Memory allocated to storing JavaScript source code."); Use "Memory used for..." for consistency with other reports. And it might be worth adding "compressed" in there?

Attachment #632973 - Flags: review?(n.nethercote) → review+

:Benjamin Peterson

Assignee

Comment 25

•

13 years ago

Attached patch add memory accounting (v2) (obsolete) — Details — Splinter Review

Attachment #632973 - Attachment is obsolete: true

:Benjamin Peterson

Assignee

Comment 26

•

13 years ago

Attached patch memory accounting (v3 - fix typo) (obsolete) — Details — Splinter Review

Attachment #633208 - Attachment is obsolete: true

Naveed Ihsanullah [:naveed]

Updated

•

13 years ago

Whiteboard: [js:t]

Brendan Eich [:brendan]

Comment 27

Assignee

Comment 31

•

13 years ago

Memory usage doesn't seem to be a problem. With the current patch, Gmail script sources use about 1MB. Facebook is about the same. I even tried emscripten Python, which used about 2MB.

:Benjamin Peterson

Assignee

Comment 32

•

13 years ago

Attached patch part 2 (mem stats) (obsolete) — Details — Splinter Review

Attachment #633210 - Attachment is obsolete: true

Nicholas Nethercote [inactive]

Updated

•

13 years ago

No longer depends on: 747831

:Benjamin Peterson

Assignee

Comment 34

•

13 years ago

Attached patch part 1b (fix browser tests) (obsolete) — Details — Splinter Review

Separate patch for fixing browser tests. This is separate only for the purposes of review; I think it should be folded into the main patch for landing.

:Benjamin Peterson

Assignee

Comment 35

•

13 years ago

Attached patch part 1a (source saving) (obsolete) — Details — Splinter Review

Attachment #635070 - Attachment is obsolete: true

Attachment #635070 - Flags: review?(jorendorff)

Attachment #635552 - Flags: review?(jorendorff)

:Benjamin Peterson

Assignee

Updated

•

13 years ago

Attachment #635203 - Attachment description: fix browser tests → part 1b (fix browser tests)

:Benjamin Peterson

Assignee

Updated

•

13 years ago

Attachment #635180 - Attachment description: mem stats (v4) → part 2 (mem stats)

:Benjamin Peterson

Assignee

Comment 36

•

13 years ago

Attached patch part 1a (saving source) (obsolete) — Details — Splinter Review

Rebased.

Attachment #635552 - Attachment is obsolete: true

Attachment #635552 - Flags: review?(jorendorff)

Attachment #636019 - Flags: review?(jorendorff)

:Benjamin Peterson

Assignee

Updated

•

13 years ago

Blocks: 718969

Jim Blandy :jimb

Comment 37

•

13 years ago

Comment on attachment 636019 [details] [diff] [review] part 1a (saving source) Review of attachment 636019 [details] [diff] [review]: ----------------------------------------------------------------- ::: js/src/frontend/BytecodeCompiler.cpp @@ +124,5 @@ > + return NULL; > + ScriptSource *ss = ScriptSource::createFromSource(cx, chars, length); > + if (!ss) > + return NULL; > + AutoAttachToRuntime attacher(cx->runtime, ss); It'd be nice to have a comment in ~AutoAttachToRuntime pointing out that it's always okay to attach scripts, even if the compilation fails, because the GC will clean them up when it realizes no live scripts refer to them.

Jim Blandy :jimb

•

13 years ago

(In reply to Jim Blandy :jimb from comment #37) > It'd be nice to have a comment in ~AutoAttachToRuntime pointing out that > it's always okay to attach scripts, even if the compilation fails, because > the GC will clean them up when it realizes no live scripts refer to them. Will do in next patch iteration.

Jim Blandy :jimb

Comment 42

•

13 years ago

Comment on attachment 636019 [details] [diff] [review] part 1a (saving source) Review of attachment 636019 [details] [diff] [review]: ----------------------------------------------------------------- ::: js/src/frontend/TokenStream.cpp @@ +1104,5 @@ > +TokenStream::stripRight(size_t off) > +{ > + const jschar *base = userbuf.base(); > + while (IsSpaceOrBOM2(base[off])) > + off--; Nick mentioned this, invoking the spectre of threats to Waldo's well-being (I suspect Waldo is quite rugged, myself), but seeing this loop without a test that 'off' is positive makes me uncomfortable, security-wise, too. For example, if there were ever added a constructor like 'Function' that produced expression closures (passing ExpressionBody instead of StatementListBody), then applying that to the empty string would have us walking off the beginning of the buffer, right? But adding such a constructor should be a change one can make without vetting everything for security holes. You're effectively adding a precondition to Parser::functionBody (that the text not be empty) that is kind of obscure, and on which security depends.

Jim Blandy :jimb

Comment 43

•

13 years ago

(In reply to Benjamin Peterson from comment #40) > Are you referring to jsanalyze.cpp? That's the one. It includes a bunch of different analyses; the SSA analysis, I think, links opcodes to their inputs; that should be a tree one can walk almost directly (modulo ?:, perhaps?) to produce the expression that produced a given value --- or one equivalent to it, at least.

Jim Blandy :jimb

Comment 44

•

13 years ago

By the way, I'm really psyched to see this work; it's exactly what we need for Debugger. We'll just put a pretty front end on it, say, Debugger.Source, and lots of problems Firebug and Mozilla's script debugger face now will disappear.

Jeff Walden [:Waldo]

Comment 45

•

13 years ago

(In reply to Jim Blandy :jimb from comment #42) > Nick mentioned this, invoking the spectre of threats to Waldo's well-being > (I suspect Waldo is quite rugged, myself), but seeing this loop without a > test that 'off' is positive makes me uncomfortable, security-wise, too. :-) I explained this in-person, but yeah, it's lack-of-test (which might be a debug-only assert) that worries me. If userbuf stored its pointers as RangedPtr instances, and exposed derived pointers as RangedPtrs, you could write nearly the same code and get validity-asserting for free. I bet it'd probably be fairly easy to make that change in a separate underlying patch.

:Benjamin Peterson

Assignee

Comment 46

•

13 years ago

Attached patch part 1a (saving source) (obsolete) — Details — Splinter Review

Attachment #636019 - Attachment is obsolete: true

Attachment #636019 - Flags: review?(jorendorff)

Attachment #636358 - Flags: review?(jorendorff)

Jim Blandy :jimb

Updated

•

13 years ago

Blocks: 755661

Jim Blandy :jimb

Comment 47

•

13 years ago

Comment on attachment 636358 [details] [diff] [review] part 1a (saving source) Review of attachment 636358 [details] [diff] [review]: ----------------------------------------------------------------- ::: js/src/frontend/Parser.cpp @@ +1626,5 @@ > + // No braces, so walk backwards from the next token to find the end of > + // the expression. > + tokenStream.getToken(); > + size_t off = tokenStream.offsetOfToken(tokenStream.currentToken()); > + funbox->bufEnd = tokenStream.stripRight(off - 1) + 1; Will this include trailing comments in the function body? x = function (y) y*2 // and here is a fine specimen of ... xlerb; Would it work to use tokenStream.currentToken().pos.end? It seems like functionBody leaves currentToken referring to the last token of the expression. That, together with .ptr and .pos.begin, might give you the ending offset directly. ::: js/src/jsscript.h @@ +1000,5 @@ > + source(NULL), > + chars(NULL), > + thread(NULL), > + lock(NULL), > + wakeup(NULL) {} You're missing 'done' in this initializer.

Jim Blandy :jimb

•

13 years ago

Attached patch part 1a (saving source) (obsolete) — Details — Splinter Review

Addressed Jim's review comments.

Attachment #636358 - Attachment is obsolete: true

Attachment #636358 - Flags: review?(jorendorff)

Attachment #636774 - Flags: review?(jimb)

:Benjamin Peterson

Assignee

Comment 52

•

13 years ago

Attached patch part 2 (mem stats) (obsolete) — Details — Splinter Review

Attachment #635180 - Attachment is obsolete: true

:Benjamin Peterson

Assignee

Comment 53

•

13 years ago

Attached patch part 2 (mem stats) (obsolete) — Details — Splinter Review

Attachment #636775 - Attachment is obsolete: true

:Benjamin Peterson

Assignee

Comment 54

•

13 years ago

Attached patch part 1a (saving source) (obsolete) — Details — Splinter Review

This is growing all sorts of bells and whistles. Now with XDR support.

Attachment #636774 - Attachment is obsolete: true

Attachment #636774 - Flags: review?(jimb)

Attachment #636938 - Flags: review?(jorendorff)

:Benjamin Peterson

Assignee

Comment 55

•

13 years ago

Attached patch part 1a (saving source) (obsolete) — Details — Splinter Review

Fix a few XDR things.

Attachment #636938 - Attachment is obsolete: true

Attachment #636938 - Flags: review?(jorendorff)

Attachment #636942 - Flags: review?(jorendorff)

Jason Orendorff [:jorendorff]

Comment 56

•

13 years ago

Partial review; the rest coming shortly. This review is based on a patch from last Thursday or Friday. I'm pretty sure you can break the invariant that only one script is being compressed at a time by using eval in a Debugger.onNewScript hook. Need a test and a fix for that; it's probably as simple as calling ensureReady() before triggering the hook. I have no experience with zlib, but offhand Z_BEST_SPEED seems more appropriate than Z_BEST_COMPRESSION. Thoughts? Given that, how much speed does SourceCompressorThread win? I would love to do without it. In JS_THREADSAFE builds, AutoAttachToRuntime's real job is to join with the compression thread. So its name is wrong. I say call it ScriptSourceCompressor or something like that, and use the C++ type system to make it impossible to forget to use it. (For example, add a ScriptSourceCompressor * argument to createFromSource... I hope it's clear what I'm getting at.) ensureReady() and attachToRuntime() shouldn't be separate methods. In js::frontend::CompileFunctionBody: >+ if (!CheckLength(cx, length)) >+ return false; >+ ScriptSource *ss = ScriptSource::createFromSource(cx, chars, length); >+ if (!ss) >+ return NULL; >+ ss->argumentsNotIncluded = true; Weird nit: For some reason these lines have a ton of trailing whitespace! Please get rid of that. In frontend/Parser.cpp, Parser::functionDef: >+ // No braces, so walk backwards from the next token to find the end of >+ // the expression. Jim noted that this leaves trailing comments, except whitespace, attached to functions. Two possible routes to fix that, uninvestigated: - We do somehow get column and line numbers for the expression body that stops at the end of the last token: js> var fun_decl = Reflect.parse("function f() 0\n//x\n").body[0]; js> fun_decl.body.loc ({start:{line:1, column:13}, end:{line:1, column:14}, source:null}) So that's a possible lead, but I'm doubtful. - Or, check and see if we always preserve the preceding token. It seems like it may have come up before; ES only needs 1 token lookahead, but I have vague memories of it not quite being implemented that way. If it's not easy to fix, followup bug. >+ tokenStream.getToken(); >+ size_t off = tokenStream.offsetOfToken(tokenStream.currentToken()); >+ funbox->bufEnd = tokenStream.stripRight(off - 1) + 1; >+ tokenStream.ungetToken(); >+ if (kind == Statement && !MatchOrInsertSemicolon(context, &tokenStream)) >+ return NULL; Style nit: Consider a line break after ungetToken(), since we're moving on to such an unrelated thing. In jscntxt.h, class SourceDataCache: >+ JSFixedString *lookup(ScriptSource *fun); >+ void put(ScriptSource *fun, JSFixedString *); You kept the argument name 'fun' for these; but they are not functions at all, right? In jsfun.cpp, FindBody: >+ for (;; p++) { >+ bool done = false; >+ switch (*p) { >+ case '(': >+ nest++; >+ break; >+ case ')': >+ if (--nest == 0) >+ done = true; >+ break; >+ default: >+ break; >+ } >+ if (done) >+ break; >+ } Won't this be defeated by stray parentheses in comments or strings? Either way, follow-up bug to kill the bodyOnly argument and the JS_DecompileFunctionBody API. The only user is JSD's "CreatePPLineMap" which anyway depends on the decompiler "pretty-printing" the function, which makes no sense (and it assumes that recompiling the script preserves the same bytecode instructions, which is crazy-- if anyone's using this, they're nuts, the whole thing should be replaced with some JS source-map-fu). In jsfun.cpp, JSFunction::toString: >+ // You may ask: What kind of self-respecting interpreted function doesn't >+ // have source? Function.protoype for one. (See GlobalObject.cpp) >+ bool interpreted = isInterpreted() && script()->sourceDataAvailable(); I think if you implement XDR, Function.prototype will be the only one; fix up the comment. It's kind of silly for only Function.prototype.toString() to change behavior, so consider preserving the current behavior. I think it only costs 2 lines of code plus a comment. Either way, Function.prototype.toString() needs a test. >+ if (!bodyOnly) { >+ // If we're not in pretty mode, put parentheses around lambda functions. Since this is the only thing the "pretty" flag controls anymore, please rename it. Follow-up bug to update the JSAPI; the JS_Decompile* functions shouldn't have "indent" parameters anymore. >+ if (interpreted && !pretty && flags & JSFUN_LAMBDA) >+ out.append("("); Style micro-nit: Please add extra parens around (flags & JSFUN_LAMBDA). >+ if (interpreted && !pretty && flags & JSFUN_LAMBDA) >+ out.append("("); >+ out.append("function "); >+ if (atom && !out.append(atom)) >+ return NULL; Nit: check the return value every time out.append() is called. The StringBuffer won't remember if an earlier call failed with OOM, so it might report OOM more than once on cx, or worse, return true the second time (so that we both report OOM and proceed, returning a wrong result). Admittedly unlikely. Looks like only 2 places to correct here.

Jason Orendorff [:jorendorff]

•

13 years ago

Attached patch interdiff between latest and last reviewed (obsolete) — Details — Splinter Review

Attachment #637294 - Flags: review?(jorendorff)

:Benjamin Peterson

Assignee

Comment 61

•

13 years ago

Attached patch part 1a (saving source) (obsolete) — Details — Splinter Review

Attachment #636942 - Attachment is obsolete: true

Attachment #636942 - Flags: review?(jorendorff)

Nicholas Nethercote [inactive]

Comment 62

•

13 years ago

> Since it's on a background thread that invariably completes before parsing > (I checked), we might as well get the best memory savings we can. After a > few experiments, I've changed it to Z_DEFAULT_COMPRESSION, which achieves > nearly the compression ratio of Z_BEST_COMPRESSION at a much lower time cost. Please write a comment about this :) > Space is not a concern. tok.pos.end is just set in a lot of places; I was > afraid it would be tricky to get right in all cases, and I'd miss some > subtlety. The nastiness is at least fairly contained here. Followup bug? I agree with Benjamin -- contained complexity is much better than spread-out complexity. We've had bugs with tok.pos.end before, esp. with tricky cases like multi-line tokens.

:Benjamin Peterson

Assignee

Comment 63

•

13 years ago

Attached patch part 1b (fix browser tests) (obsolete) — Details — Splinter Review

Every time you call toString on a function using ASI, stick the result in a data url, and try to load it, I dump a cute panda in acid.

Attachment #635203 - Attachment is obsolete: true

Jim Blandy :jimb

Comment 64

•

13 years ago

Should we put all new scripts in the decompressed cache immediately, instead of waiting until the first time they're decompressed? That's essentially free, since we still have the uncompressed text. This could certainly be a follow-up.

Jason Orendorff [:jorendorff]

Comment 65

•

13 years ago

(In reply to Benjamin Peterson from comment #59) > > How much speed does SourceCompressorThread win? I would love to do > > without it. > > It's not about winning speed. It's about not losing any. :) Parsing is > already a bottleneck for large Emscripten-type things, and this is an easy > thing to parallelize. Even if parsing is a bottleneck, the question stands. What's the difference in total startup time for a largeish emscripten image (since you bring it up) with the background thread vs. without? If it's less than 1%, I claim it's not worth the complexity, because there are simpler hacks we could do for the same gain. I could be convinced in two ways: if this is actually the most straightforward way to win 1%, or if it wins 20%, then obviously we keep it. > > In JS_THREADSAFE builds, AutoAttachToRuntime's real job is to join with > > the compression thread. So its name is wrong. > > I claim its real job is to make the source visible to the GC, for which the > compression being completed is only a precondition. I disagree. Waiting for compression to be finished is crucial for two reasons totally unrelated to GC: (1) the locking scheme between the main thread and the compression thread depends on it; and (2) once compilation returns to script, we may call fn.toString() and we don't want that to race with compression. > > In ScriptSource::substring: > > >+ return js_NewStringCopyN(cx, chars + start, stop - start); > > I have a hard time caring about the memory usage/performance of toString() > in general. Yeah, OK.

:Benjamin Peterson

Assignee

Comment 66

•

13 years ago

(In reply to Jason Orendorff [:jorendorff] from comment #65) > (In reply to Benjamin Peterson from comment #59) > > > How much speed does SourceCompressorThread win? I would love to do > > > without it. > > > > It's not about winning speed. It's about not losing any. :) Parsing is > > already a bottleneck for large Emscripten-type things, and this is an easy > > thing to parallelize. > > Even if parsing is a bottleneck, the question stands. What's the difference > in total startup time for a largeish emscripten image (since you bring it > up) with the background thread vs. without? > > If it's less than 1%, I claim it's not worth the complexity, because there > are simpler hacks we could do for the same gain. I could be convinced in two > ways: if this is actually the most straightforward way to win 1%, or if it > wins 20%, then obviously we keep it. Compiling and running the emscripten Python interpreter is 30% slower without the background thread. > > > > In JS_THREADSAFE builds, AutoAttachToRuntime's real job is to join with > > > the compression thread. So its name is wrong. > > > > I claim its real job is to make the source visible to the GC, for which the > > compression being completed is only a precondition. > > I disagree. Waiting for compression to be finished is crucial for two > reasons totally unrelated to GC: (1) the locking scheme between the main > thread and the compression thread depends on it; and (2) once compilation > returns to script, we may call fn.toString() and we don't want that to race > with compression. Okay. This has changed in the latest patch, anyway. See what you think.

QA Contact: general

:Benjamin Peterson

Assignee

Updated

•

13 years ago

QA Contact: general

:Benjamin Peterson

Assignee

Comment 67

•

13 years ago

Attached patch part 1a (saving source) (obsolete) — Details — Splinter Review

Rebased.

Attachment #637295 - Attachment is obsolete: true

Jason Orendorff [:jorendorff]

Attachment #638084 - Flags: review?(Ms2ger)

:Ms2ger (he/him; ⌚ UTC+1/+2)

Comment 73

•

13 years ago

Comment on attachment 638084 [details] [diff] [review] part 1b (fix browser tests) Review of attachment 638084 [details] [diff] [review]: ----------------------------------------------------------------- Thanks for asking; the JSON files under failures/ are fair game.

Attachment #638084 - Flags: review?(Ms2ger) → review+

:Benjamin Peterson

Assignee

•

13 years ago

Attachment #637294 - Attachment is obsolete: true

:Benjamin Peterson

Assignee

Comment 78

•

13 years ago

Attached patch part 3 (context option to only save compileAndGo scripts) (obsolete) — Details — Splinter Review

Attachment #642093 - Flags: review?(jorendorff)

:Benjamin Peterson

Assignee

Comment 79

•

13 years ago

Attached patch part 4 (use that option in the browser) (obsolete) — Details — Splinter Review

:Benjamin Peterson

Assignee

Updated

•

13 years ago

Attachment #642093 - Attachment description: part 3 (context option to only save compileAndGo scrip[ts) → part 3 (context option to only save compileAndGo scripts)

:Benjamin Peterson

Assignee

Comment 90

•

13 years ago

•

13 years ago

Attached patch part 4: add a hook to allow retrieving sources we didn't save (obsolete) — Details — Splinter Review

Attachment #643185 - Attachment is obsolete: true

Attachment #643185 - Flags: review?(jorendorff)

Attachment #643609 - Flags: review?(jorendorff)

:Benjamin Peterson

Assignee

Comment 105

•

13 years ago

Attached patch part 5: let the browser load chrome source if someone wants it (obsolete) — Details — Splinter Review

Here I check if the channel is local.

Attachment #643272 - Attachment is obsolete: true

Attachment #643272 - Flags: review?(bzbarsky)

Attachment #643610 - Flags: review?(bzbarsky)

Nicholas Nethercote [inactive]

Comment 106

•

13 years ago

> > Another thought: Once we have lazy bytecode compilation, will we still need > > to XDR stuff in chrome? For that matter, how will XDR handle lazily > > compiled stuff? > > My assumption is that we would eagerly compile browser js and put it in the > startup cache anyway. I defer to njn, who is actually implementing this, > though. I don't know. Lazy bytecode is only working for trivial examples, there is still quite a lot of work to do even for non-trivial vanilla JS code; I haven't even thought about chrome code and the startup cache and all that. This bug is much closer to being finished, so if we can avoid "lazy bytecode will fix it" thinking in this bug that would be good.

(dormant account)

Comment 107

•

13 years ago

(In reply to Benjamin Peterson from comment #103) > It's less than 10MB otherwise. The source is saved as compressed UTF-16. > Perhaps the duplicate compression is causing horrible things. It probably is causing horrible things. The omni.ja XDR landed before njn sped up JS parsing. I suspect we can get rid of XDR if it's no longer a significant win. Someone just needs to compare time to load chrome JS via XDR vs time to load from JS source. I'm not worried about synchronous chrome IO for monkeypatching. Most of the chrome IO is already on the main thread.

:Benjamin Peterson

Assignee

Comment 108

•

13 years ago

Attached patch part 5: let the browser load chrome source if someone wants it (obsolete) — Details — Splinter Review

Fixed on jars.

Attachment #643610 - Attachment is obsolete: true

Attachment #643610 - Flags: review?(bzbarsky)

Attachment #643739 - Flags: review?(bzbarsky)

Benjamin Smedberg

Comment 109

•

13 years ago

I'm not sure whether my guidance here is still needed. You can certainly check the channel once you've opened it, although I don't think that omnijarred URLs have a useful nsIFileChannel, so I'm not sure what that gets you. You can also special-case the chrome protocol and call convertChromeURL for them if that helps.

:Benjamin Peterson

Assignee

Comment 110

•

13 years ago

I tested several addons from Gavin's search with my patches. They seem to work correctly.

:Benjamin Peterson

Assignee

Comment 111

•

13 years ago

Attached patch part 6: test for chrome toSource (obsolete) — Details — Splinter Review

Here's a test for chrome toSource. I put it in xpconnect/test; I'm open to better suggestions.

Attachment #643992 - Flags: review?(bzbarsky)

Boris Zbarsky [:bzbarsky]

Comment 112

•

13 years ago

So JSOPTION_ONLY_CNG_SOURCE means to only save source for JS_CompileFunction or for compile-and-go scripts, right? Contrary to what the comments in attachment 642093 [details] [diff] [review] say? And we're relying on chrome _not_ compiling its scripts with compile-and-go, right? Might be worth documenting that somewhere.

:Benjamin Peterson

Assignee

Comment 113

•

13 years ago

(In reply to Boris Zbarsky (:bz) from comment #112) > So JSOPTION_ONLY_CNG_SOURCE means to only save source for JS_CompileFunction > or for compile-and-go scripts, right? Contrary to what the comments in > attachment 642093 [details] [diff] [review] say? Correct; I'll fix the comment. > > And we're relying on chrome _not_ compiling its scripts with compile-and-go, > right? Might be worth documenting that somewhere. I'll add a comment in nsJSEnvironment.cpp.

Boris Zbarsky [:bzbarsky]

Comment 114

•

13 years ago

Comment on attachment 643739 [details] [diff] [review] part 5: let the browser load chrome source if someone wants it >+++ b/dom/base/nsJSEnvironment.cpp >+ReadSourceFromFilename(JSContext *cx, const char *filename, char **buf, PRUint32 *len) >+ nsCOMPtr<nsIIOService> ioService = do_GetIOService(&rv); >+ NS_ENSURE_SUCCESS(rv, rv); >+ >+ // Get the URI. >+ nsCOMPtr<nsIURI> uri; >+ nsDependentCString fn(filename); >+ rv = ioService->NewURI(fn, nsnull, nsnull, getter_AddRefs(uri)); >+ NS_ENSURE_SUCCESS(rv, rv); >+ >+ nsCOMPtr<nsIChannel> scriptChannel; >+ rv = ioService->NewChannelFromURI(uri, getter_AddRefs(scriptChannel)); nsCOMPtr<nsIURI> uri; rv = NS_NewURI(getter_AddRefs(uri), filename)); NS_ENSURE_SUCCESS(rv, rv); nsCOMPtr<nsIChannel> scriptChannel; rv = NS_NewChannel(getter_AddRefs(scriptChannel), uri) >+ /* read the file in one swoop */ There's no guarantee this will work. You can get short reads. You should probably read in a loop until an error happens or 0 bytes are read; the latter would indicate EOF. >+ if (bytesRead != *len) I'd rather the JS_free were here, closer to the alloc and where we know that buf is non-null. At least in part because free(null) is not universally safe (the spec notwithstanding). >+SourceHook(JSContext *cx, JSScript *script, char **src, uint32_t *length) Probably best to set *src to null and *length to 0 up front here. r=me with the above addressed.

Attachment #643739 - Flags: review?(bzbarsky) → review+

:Benjamin Peterson

Assignee

Comment 115

•

13 years ago

Attached patch part 4: add a hook to allow retrieving sources we didn't save (obsolete) — Details — Splinter Review

Attachment #643609 - Attachment is obsolete: true

Attachment #643609 - Flags: review?(jorendorff)

Attachment #644158 - Flags: review?(jorendorff)

:Benjamin Peterson

Assignee

Comment 116

•

13 years ago

Attached patch part 5: let the browser load chrome source if someone wants it (obsolete) — Details — Splinter Review

Addressed review comments; added long comment.

Attachment #643739 - Attachment is obsolete: true

:Benjamin Peterson

Assignee

Comment 117

•

Assignee

Comment 122

•

•

13 years ago

Attached patch part 6 — Details — Splinter Review

Attachment #644399 - Attachment is obsolete: true

Jan de Mooij [:jandem]

Comment 129

•

13 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/e080642175e6 https://hg.mozilla.org/integration/mozilla-inbound/rev/35cef082206b https://hg.mozilla.org/integration/mozilla-inbound/rev/3bacbbca7d87 https://hg.mozilla.org/integration/mozilla-inbound/rev/1abd39543f58 https://hg.mozilla.org/integration/mozilla-inbound/rev/166ee51a633f https://hg.mozilla.org/integration/mozilla-inbound/rev/f9b341d6babd

Status: NEW → ASSIGNED

Target Milestone: --- → mozilla17

Ryan VanderMeulen [:RyanVM]

Comment 130

•

13 years ago

https://hg.mozilla.org/mozilla-central/rev/e080642175e6 https://hg.mozilla.org/mozilla-central/rev/35cef082206b https://hg.mozilla.org/mozilla-central/rev/3bacbbca7d87 https://hg.mozilla.org/mozilla-central/rev/1abd39543f58 https://hg.mozilla.org/mozilla-central/rev/166ee51a633f https://hg.mozilla.org/mozilla-central/rev/f9b341d6babd

Status: ASSIGNED → RESOLVED

Closed: 13 years ago → 13 years ago

Resolution: --- → FIXED

tabmix.onemen

Comment 131

•

13 years ago

With this patch almost every call to eval("(" + foo.toString() + ")") will fail. the toString() return the function with all comment and remark many function missing the last } for example warnAboutClosingWindow.toString() ends with: > //@line 6108 "e:\builds\moz2_slave\m-cen-w32-ntly\build\browser\base\content\browser.js" > return true; > //@line 6110 "e:\builds\moz2_slave\m-cen-w32-ntly\build\browser\base\content\browser.js" without the } at the end of the function

Alice0775 White

Updated

•

13 years ago

Depends on: 776283

Alice0775 White

Updated

•

13 years ago

Depends on: 776290

Boris Zbarsky [:bzbarsky]

Comment 132

•

•

13 years ago

Depends on: 776741

:Benjamin Peterson

Assignee

Updated

•

13 years ago

Depends on: 776484

Gary Kwong [:gkw] [:nth10sd] (NOT official MoCo now)

Updated

•

13 years ago

Depends on: 777776

Gary Kwong [:gkw] [:nth10sd] (NOT official MoCo now)

Updated

•

13 years ago

Depends on: 777834

David Mandelin [:dmandelin]

Comment 133

•

13 years ago

Reopening since it got pref'd off. Also note the memory regression in bug 776741, which needs to be addressed somehow before turning back on.

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Matt Brubeck (:mbrubeck)

Comment 134

•

12 years ago

This patch was never fully disabled; the compression part was disabled in bug 776700 but has been re-enabled in bug 777190.

Status: REOPENED → RESOLVED