Closed Bug 54743 Opened 24 years ago Closed 24 years ago

Our JavaScript is 3x slower than IE's

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla0.8

People

(Reporter: kandrot, Assigned: brendan)

References

Details

Attachments

(50 files)

Luhn credit card verification script 24 years ago Edward Kandrot 966 bytes, text/html		Details
A Simple looping script 24 years ago Edward Kandrot 427 bytes, text/html		Details
A Simple math script 24 years ago Edward Kandrot 497 bytes, text/html		Details
avoid lock allocation and overhead for single-threaded objects 24 years ago Brendan Eich [:brendan] 49.13 KB, patch		Details \| Diff \| Splinter Review
diff -wu version of last patch 24 years ago Brendan Eich [:brendan] 33.08 KB, patch		Details \| Diff \| Splinter Review
jslock.h patch re-do (unconditionally include jsscope.h) 24 years ago Brendan Eich [:brendan] 9.69 KB, patch		Details \| Diff \| Splinter Review
latest patch (more jslock.c cleanup), test and review solicited 24 years ago Brendan Eich [:brendan] 51.27 KB, patch		Details \| Diff \| Splinter Review
diff -wu version of last patch 24 years ago Brendan Eich [:brendan] 35.49 KB, patch		Details \| Diff \| Splinter Review
deadlock stacktrace 24 years ago John Bandhauer 3.11 KB, text/plain		Details
timing.js testcase to compare speed of property lookups 24 years ago John Bandhauer 1.16 KB, text/plain		Details
patch with deadlock fixes in jslock.c and jsscope.c 24 years ago Brendan Eich [:brendan] 51.64 KB, patch		Details \| Diff \| Splinter Review
diff -wu version of last patch 24 years ago Brendan Eich [:brendan] 35.85 KB, patch		Details \| Diff \| Splinter Review
Remove JS_FRIEND_API at js_InitContextForLocking definition; fix comment wording 24 years ago Brendan Eich [:brendan] 51.63 KB, patch		Details \| Diff \| Splinter Review
diff -wu of latest and greatest; hoping to get this in if it tests well for everyone (so far so good for me) 24 years ago Brendan Eich [:brendan] 35.84 KB, patch		Details \| Diff \| Splinter Review
lockup stack 24 years ago John Bandhauer 9.82 KB, text/plain		Details
multi-thread testcase for xpcshell 24 years ago John Bandhauer 714 bytes, text/plain		Details
a lockup with the test case. 24 years ago John Bandhauer 2.93 KB, text/plain		Details
avoid waiting for a destroyed context to notify us 24 years ago Brendan Eich [:brendan] 53.34 KB, patch		Details \| Diff \| Splinter Review
diff -wu of latest patch 24 years ago Brendan Eich [:brendan] 37.55 KB, patch		Details \| Diff \| Splinter Review
xpcshell patch for Request (ignore 'stupidTest' parts) 24 years ago John Bandhauer 3.35 KB, patch		Details \| Diff \| Splinter Review
optimize JSScope space by unioning count and link 24 years ago Brendan Eich [:brendan] 53.96 KB, patch		Details \| Diff \| Splinter Review
diff -wu of last patch 24 years ago Brendan Eich [:brendan] 38.43 KB, patch		Details \| Diff \| Splinter Review
i'm a monkey: avoid punning 0 as NULL 24 years ago Brendan Eich [:brendan] 54.19 KB, patch		Details \| Diff \| Splinter Review
close race over 0-ownercx->requestDepth-implies-scope-off-todo-list 24 years ago Brendan Eich [:brendan] 54.44 KB, patch		Details \| Diff \| Splinter Review
loop in js_LockScope until the ownercx's request ends 24 years ago Brendan Eich [:brendan] 54.62 KB, patch		Details \| Diff \| Splinter Review
latest checkpoint, after much jband help testing, not quite there yet 24 years ago Brendan Eich [:brendan] 60.55 KB, patch		Details \| Diff \| Splinter Review
better checkpoint, still chasing a hard bug with jband 24 years ago Brendan Eich [:brendan] 71.90 KB, patch		Details \| Diff \| Splinter Review
looks like we have a winner 24 years ago Brendan Eich [:brendan] 76.22 KB, patch		Details \| Diff \| Splinter Review
diff -wu version of last patch 24 years ago Brendan Eich [:brendan] 63.77 KB, patch		Details \| Diff \| Splinter Review
added scope-lock instrumentation, picked a few Thanksgiving nits 24 years ago Brendan Eich [:brendan] 77.35 KB, patch		Details \| Diff \| Splinter Review
fix last patch to suspend request around wait on rt->scopeSharingDone; simplify GC finalize phase 24 years ago Brendan Eich [:brendan] 82.58 KB, patch		Details \| Diff \| Splinter Review
diff -wu version of last patch 24 years ago Brendan Eich [:brendan] 70.03 KB, patch		Details \| Diff \| Splinter Review
detailed checkin comments 24 years ago Brendan Eich [:brendan] 5.06 KB, text/plain		Details
patch du jour (bogus assertion in js_TransferScopeLock removed, thanks to jband) 24 years ago Brendan Eich [:brendan] 83.24 KB, patch		Details \| Diff \| Splinter Review
diff -wu version of last patch 24 years ago Brendan Eich [:brendan] 71.20 KB, patch		Details \| Diff \| Splinter Review
latest detailed checkin comments 24 years ago Brendan Eich [:brendan] 5.49 KB, patch		Details \| Diff \| Splinter Review
fix ClaimScope to deal with itself 24 years ago Brendan Eich [:brendan] 83.58 KB, patch		Details \| Diff \| Splinter Review
diff -wu of last patch 24 years ago Brendan Eich [:brendan] 71.54 KB, patch		Details \| Diff \| Splinter Review
newer threaded stress test with timing code 24 years ago John Bandhauer 2.11 KB, text/plain		Details
oops, forgot #ifdef JS_THREADSAFE in js_{Add,Remove}Root mods 24 years ago Brendan Eich [:brendan] 83.64 KB, patch		Details \| Diff \| Splinter Review
optimized jsshell vs. xpcshell, 1000000 iterations 24 years ago Brendan Eich [:brendan] 365 bytes, patch		Details \| Diff \| Splinter Review
Fix js_ContextIterator to cope with js_DestroyContext of next cx in list 24 years ago Brendan Eich [:brendan] 83.95 KB, patch		Details \| Diff \| Splinter Review
fix early return case in js_GC to add back requestDebit to rt->requestCount; do interlock when managing rt->gcLevel at label out: in js_GC 24 years ago Brendan Eich [:brendan] 85.44 KB, patch		Details \| Diff \| Splinter Review
dump of deadlocked threads 24 years ago Patrick C. Beard 19.02 KB, text/plain		Details
fix suspend/resume-request per beard's testing; unify js_LockScope1 and js_LockScope per mccabe's review 24 years ago Brendan Eich [:brendan] 86.32 KB, patch		Details \| Diff \| Splinter Review
oops, left a static before js_LockScope's definition 24 years ago Brendan Eich [:brendan] 86.30 KB, patch		Details \| Diff \| Splinter Review
diff -wu version of last (and final, for this bug!) patch 24 years ago Brendan Eich [:brendan] 74.33 KB, patch		Details \| Diff \| Splinter Review
Screw API compatibility, fix JS_SuspendRequest to return the saved requestDepth and JS_ResumeRequest to take it as an argument; fix xpconnect to use the revised API 24 years ago Brendan Eich [:brendan] 88.69 KB, patch		Details \| Diff \| Splinter Review
diff -wu version of last patch 24 years ago Brendan Eich [:brendan] 76.71 KB, patch		Details \| Diff \| Splinter Review
Removed bogus assertion from JS_SuspendRequest (meant to do that last time; sorry) 24 years ago Brendan Eich [:brendan] 88.66 KB, patch		Details \| Diff \| Splinter Review

Edward Kandrot

Reporter

Description

•

24 years ago

Our implementation of JavaScript appears to take 3 times as long to do many tasks as in IE. This was also noticed in bug #40988, but that is not the focus of that bug. Here are the numbers I found: in a pure looping javascript (1 million loops), we spend (function only, not children) ftol 25% (2,000,000 calls) js_interpret 25% (1 call) js_UnlockScope 10% (7,000,000 calls) js_lockScope1 9% (7,000,000 calls) js_FindProperty 5% (2,000,000 calls) It looks like somethings are called multiple times for each pass through a loop, which we might be able to change. I did not list them all here, but can add them to the bug if they are needed. In the Luhn Credit card js (30,000 iterations), we spend (function only) malloc 55% (332,630) (mostly allocating 2 bytes for character access) js_Interpret 5% (1) ftol 4% (917,000) (mostly from js_Interpret and js_newNumberValue) js_GetSlotWhileLocked 4% (4,000,000) (spread out through a few functions) I hope these numbers help to narrow down the problem to someone who knows our JavaScript engine better than I. If IE can do these same operations in 1/3 the time, so can we. (I have more information, which I can add later, once I find out the format needed to be helpful.)

Phil Schwartau

Comment 1

•

24 years ago

cc'ing Brendan, jband, Patrick -

John Bandhauer

Comment 2

•

24 years ago

Edward, see bug 43902 for discussion of the 3x cost of doing threadsafe locking in the JS engine. And bug 50859 for discussion of why this locking is necessary and numbers that show this locking overhead is of samll and acceptable cost in real world situations in the browser (i.e. no one does 30k iterations of credit card validations).

John Bandhauer

Comment 3

•

24 years ago

Also, It would be good if you could attach the test case(s).

John Bandhauer

Comment 4

•

24 years ago

bug 40988 is not about JavaScript speed, its about the speed of manipulating the DOM. note the non-linear aspect of that bug. I'd bet a lot this is limited by the speed of manipulating the content model and probably cascading notifications and reflows or some such thing outside of the core JS engine.

Edward Kandrot

Reporter

Comment 5

•

24 years ago

Attached file Luhn credit card verification script — Details

Edward Kandrot

Reporter

Comment 6

•

24 years ago

Lock and Unlock only account for 19% and only in the case of looping. Also, they called 7 times for each pass through a loop. Is this as designed? Most of the time is spent in malloc, or doing things more times then what I think the author intended. It is good to know that Locking and Unlocking were designed into the code, but were they designed to be called as many times as they are in that case? Did we design to malloc 2 byte strings everytime a JavaScript access a char? I have attached the Luhn script, the looping script, and an abs script, all of which are 1/3 the speed under our JavaScript as compared to MS. There are also benchmarks on different websites, which I will try to find again, which also show the same thing, across the board, as well as the referenced bug #40988. We are slower, why is what I want to know. This bug has some information, and if anyone has any more relevant info into "why?", I think it should be added.

Edward Kandrot

Reporter

Comment 7

•

24 years ago

Attached file A Simple looping script — Details

Edward Kandrot

Reporter

Comment 8

•

24 years ago

Attached file A Simple math script — Details

Edward Kandrot

Reporter

Comment 9

•

24 years ago

Under Linux, 4.75 vs latest Moz6 (in seconds), with the attached scripts: Luhn - 6.113 (M6) vs 3.528 (4.75) looping - 4.72 (M6) vs 6.777 (4.75) abs - 15.938 (M6) vs 15.339 So, Moz6 seems to be much faster in looping, about the same for simple math, and slower on strings. Obviously small test cases, so if there are any more to add, please add them. Thanks.

John Bandhauer

Comment 10

•

24 years ago

objects are locked for property access, not based on passes through loops - other threads in other loops could be accessing the objects. The credit card test does a lot of calls to String.charAt. This has to make a new string (one char+null terminator) each time. A better allocator or a recycler for small strings would improve this. The looping is heavily dominated by the locked access to the counter value. This can be quicker when done in a function with the counter being local rather than a property of the global object. I'm not seeing why there would be calls to ftol. The numbers are small enough to use the integer comparison optimization in RELATIONAL_OP. This is curious. The math test is just a combination of this looping and the speed of calling Math.abs. whatever.