Closed Bug 502736 Opened 16 years ago Closed 7 years ago

SunSpider survey: opcode count, type stability, cycles per opcode...

Tracking

()

Status:

RESOLVED INCOMPLETE

People

(Reporter: wagnerg, Assigned: gwagner)

Details

Attachments

(9 files)

detailed txt file for sunspider benchmarks 16 years ago Gregor Wagner 183.77 KB, text/plain		Details
instrumentation code 16 years ago Gregor Wagner 73.98 KB, text/plain		Details
lifetime and dslots size for SunSpider 16 years ago Gregor Wagner 30.10 KB, text/plain		Details
improved version 16 years ago Gregor Wagner 44.89 KB, text/plain		Details
All SunSpider 16 years ago Gregor Wagner 1.01 KB, text/plain		Details
XYChart for sunspider lifetime vs. size 16 years ago Gregor Wagner [:gwagner] 13.93 KB, image/png		Details
Waste for SS benchmarks. 16 years ago Gregor Wagner 517 bytes, text/plain		Details
Each individual benchmark + size relationship 16 years ago Gregor Wagner 54.02 KB, text/plain		Details
XYChart for dslots size without length slot 16 years ago Gregor Wagner 4.75 KB, image/png		Details

Gregor Wagner

Reporter

Description

•

16 years ago

User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.1) Gecko/20090624 Firefox/3.5 Build Identifier: No bug, some facts about the SunSpider benchmarks: Very few GVars and they are very stable. Only in one test case the type of a GVar changes. Locals are very stable and mostly Integers. Divisions are very rare but mostly with Integer types and the divisor is 2 or 8. By far the most executed opcode is getLocal. Specialized subtraction for integers instead of doubles reduces the cycles per subtraction from about 40 to 20. The same with addition reduces the cost per addition from about 48 to 28. Reproducible: Always

Gregor Wagner

Reporter

Comment 1

•

16 years ago

Attached file detailed txt file for sunspider benchmarks — Details

Brendan Eich [:brendan]

Comment 2

•

16 years ago

Integer meaning 31-bit INT_FITS_IN_JSVAL int? Could you attach the instrumentation patch? Doesn't matter if hacky, just want to see all the details. Thanks, /be

Gregor Wagner

Reporter

Comment 3

•

16 years ago

I changed for example the JSOP_SUB case in the interpreter from BINARY_OP(-) to following: BEGIN_CASE(JSOP_SUB) //BINARY_OP(-); rval = FETCH_OPND(-1); lval = FETCH_OPND(-2); if((lval & rval) & JSVAL_INT) { i = JSVAL_TO_INT(lval); i -= JSVAL_TO_INT(rval); regs.sp--; STORE_INT(cx, -1, i); } else { VALUE_TO_NUMBER(cx, -2, lval, d); VALUE_TO_NUMBER(cx, -1, rval, d2); d -= d2; regs.sp--; STORE_NUMBER(cx, -1, d); } END_CASE(JSOP_SUB) I am looking at the controlflow-recursive testcase from sunspider. With the BINARY_OP(-) code I get about 40 cycles per sub. With this code I get about 16 cycles per sub if the type is integer. The instrumentation details will follow.

Andreas Gal :gal

Comment 4

•

16 years ago

We could try some asm("") magic to check for overflow, or even adding longs and checking for x >> 32 == 0 might do the trick. Worth a try. 16 vs 40 is probably due to the int -> double and then double -> int conversions. Long math is a bit more expensive, but staying in the integer domain saves a lot of i2f/f2i business.

Andreas Gal :gal

Comment 5

•

16 years ago

Actually scrap that. asm("") magic doesn't work since we want 31-bit overflow. So just add the numbers as longs and mask out invalid results.

Gregor Wagner

Reporter

Comment 6

•

16 years ago

If we do the calculation with long integers and perform the proper overflow check we kill the performance win again. We win about 2-3 cycles per sub but I guess that's not really worth it.

Gregor Wagner

Reporter

Comment 7

•

16 years ago

Attached file instrumentation code — Details

Short description: One big data structure -> instrumentationStruct Cycle count for opcodes happens mostly in DO_OP() and BEGIN_CASE() in js_interpret(). Output and calculation in printInstrumentation() Instrumentation add, remove in jscntxt.h

Gregor Wagner

Reporter

Comment 8

•

16 years ago

Attached file lifetime and dslots size for SunSpider — Details

Lifetime analysis and dslots size for objects in SunSpider. Execution time starts at JS_NewRuntime() and ends at JS_DestroyRuntime(). I also use JS_GC_ZEAL to GC at each allocation. The size for each object is defined as if(obj->dslots) totalSize = (((uint32)obj->dslots[-1])*sizeof(obj->dslots[0])); and is stored during FinalizeObject().

Gregor Wagner

Reporter

Comment 9

•

16 years ago

Attached file improved version — Details

improved version where dslots = 0 is separate.

Gregor Wagner

Reporter

Comment 10

•

16 years ago

Attached file All SunSpider — Details

all benchmarks are executed as a single file.

Gregor Wagner [:gwagner]

Assignee

Comment 11

•

16 years ago

Attached image XYChart for sunspider lifetime vs. size — Details

-Axis shows lifetime in cycles for all objects Y-Axis shows dslots size for each object. Note the logarithmic scale.

Brendan Eich [:brendan]

Comment 12

•

16 years ago

Hi Gregor, great to have this -- any way to estimate what objects are stack-like, become garbage no later than return of C/C++ frame in which the allocating JS API call was made? For "objects" read gc-things -- strings and doubles definitely included. Thanks, /be

Gregor Wagner

Reporter

Comment 13

•

16 years ago

Attached file Waste for SS benchmarks. — Details

Waste for all SS benchmarks if Objectsize is increased to 64.

Gregor Wagner

Reporter

Comment 14

•

16 years ago

Attached file Each individual benchmark + size relationship — Details

Maybe we could guess the number of needed dslots. For all SS benchmarks, 76% of the objects have the same dslots size as the previous allocated object. Total Object count: 225637 Object has same dslots size as earlier allocated object: 172843 Relative: 76.6%

Brendan Eich [:brendan]

Updated

•

16 years ago

Attachment #388613 - Attachment description: Waste for SS benachmarks. → Waste for SS benchmarks.

Gregor Wagner

Reporter

Comment 15

•

16 years ago

corrected calculation for dlsots size: if(obj->dslots) { Size = (((uint32)obj->dslots[-1] - JS_INITIAL_NSLOTS + 1)*sizeof(obj->dslots[0])); } New numbers with JS_INITIAL_NSLOTS corrections: Size = 0 :155027 Size<= 8 : 8 Size<= 16: 64544 Size<= 24: 0 Size<= 32: 2248 Size<= 64: 46 Size > 64: 3764

Gregor Wagner

Reporter

Comment 16

•

16 years ago

dslots size without the length slot: if(obj->dslots) { Size = (((uint32)obj->dslots[-1] - JS_INITIAL_NSLOTS)*sizeof(obj->dslots[0])); } Size = 0 :155027 Size<= 8 : 64534 Size<= 16: 18 Size<= 24: 10 Size<= 32: 2238 Size<= 64: 46 Size > 64: 3764

Gregor Wagner

Reporter

Comment 17

•

16 years ago

Attached image XYChart for dslots size without length slot — Details

YChart for dslots size without length slot. Size = (((uint32)obj->dslots[-1] - JS_INITIAL_NSLOTS)*sizeof(obj->dslots[0])); The y-axis is limited to 200.

Brendan Eich [:brendan]

Updated

•

16 years ago

Assignee: general → anygregor

Status: UNCONFIRMED → NEW

Ever confirmed: true

Gregor Wagner

Reporter

Comment 18

•

16 years ago

(In reply to comment #12) > any way to estimate what objects are > stack-like, become garbage no later than return of C/C++ frame in which the > allocating JS API call was made? > > For "objects" read gc-things -- strings and doubles definitely included. I have an estimation now for objects and strings. Doubles will follow. These are the results for opening a spreadsheet on google docs. Startup of Firefox is included. I monitored following API function calls. Other functions like JS_EvaluateScript are included since they call one of the following functions: API_FUNCTION :CALLS JS_CallFunctionValue :5465 JS_CallFunctionName :0 JS_CallFunction :1 JS_ExecuteScript :105 JS_EvaluateUCScriptForPrinc: 234 Objects that become garbage before API return: 35882 out of 153069 allocated Objects. Strings that become garbage before API return: 18553 out of 71230 allocated Strings. I don't have any meaningful numbers for the SunSpider benchmarks since there is just a single JS_ExecuteScript call involved and almost all objects and strings become garbage before returning. Do you want more fine grained measurements for them?

André Bargull [:anba]

Comment 19

•

7 years ago

Resolving as INCOMPLETE, because SunSpider benchmark is no longer of interest in 2018.

Status: NEW → RESOLVED

Closed: 7 years ago

Resolution: --- → INCOMPLETE

You need to log in before you can comment on or make changes to this bug.