Closed Bug 481419 Opened 16 years ago Closed 14 years ago

trace repeated execution of scripts

Tracking

()

Status:

RESOLVED WONTFIX

People

(Reporter: shaver, Unassigned)

References

Details

(Keywords: perf)

Attachments

(3 files, 4 obsolete files)

treat the entry pc for a script as a loop header 16 years ago Mike Shaver (:shaver emeritus) 2.73 KB, patch		Details \| Diff \| Splinter Review
refreshed 16 years ago Brian Crowder 2.83 KB, patch		Details \| Diff \| Splinter Review
crashes my patch 16 years ago Brian Crowder 132 bytes, text/plain		Details
with less crashiness 16 years ago Brian Crowder 3.39 KB, patch		Details \| Diff \| Splinter Review
even less crashy! 16 years ago Brian Crowder 3.38 KB, patch		Details \| Diff \| Splinter Review
sunspider results 16 years ago Brian Crowder 3.80 KB, patch		Details \| Diff \| Splinter Review
closer 16 years ago Brian Crowder 3.46 KB, patch		Details \| Diff \| Splinter Review

Mike Shaver (:shaver emeritus)

Reporter

Description

•

16 years ago

Attached patch treat the entry pc for a script as a loop header (obsolete) — Details — Splinter Review

With the attached patch, we trace evals-within-loop as well as repeated script execution as from setTimeout and event handlers. We also regress unpack-code performance by 10x, so there's something not quite right!

Stuart Parmenter

Updated

•

16 years ago

Flags: wanted-fennec1.0+

Brian Crowder

Comment 1

•

16 years ago

Has anyone got thoughts on the regression shaver mentions in comment 0?

Brendan Eich [:brendan]

Comment 2

•

16 years ago

(In reply to comment #1) > Has anyone got thoughts on the regression shaver mentions in comment 0? Someone get jit stats with and without the patch, post them here. /be

Andreas Gal :gal

Comment 3

•

16 years ago

The patch needs a threshold counter per script. Here is what we do for loops: - the emitter emits a JSOP_LOOP in every loop header - at the backedge if jump target == JSOP_LOOP then we look up the fragment using target pc as key in the fragment cache, and then increment the counter in the fragment or trigger compiled code - if we ever decide to blacklist the location, we change LOOP to NOP. both are single byte opcodes so concurrency is not an issue What this patch needs: - rename JSOP_LOOP to JSOP_TRACE - change emitter to emit TRACE at every function entry, and use its PC as key (it will never be a loop target, if thats the case there is a TRACE under it) - use some reasonable threshold and the mechanics in the patch to trigger tracing

David Anderson [:dvander] - inactive, e-mail if emergency

Updated

•

16 years ago

Depends on: 515806

Brian Crowder

Updated

•

16 years ago

Depends on: tracerecursion
No longer depends on: 515806

Brian Crowder

Comment 4

•

16 years ago

Attached patch refreshed (obsolete) — Details — Splinter Review

Attachment 365435 [details] [diff] is badly rotted, here's a swat at refreshing it. It crashes on my test script.

Attachment #365435 - Attachment is obsolete: true

Brian Crowder

Comment 5

•

16 years ago

Attached file crashes my patch (obsolete) — Details

Assignee: general → crowder

Brian Crowder

Comment 6

•

16 years ago

Attached patch with less crashiness (obsolete) — Details — Splinter Review

Attachment #400591 - Attachment is obsolete: true

Brian Crowder

Comment 7

•

16 years ago

Attached patch even less crashy! — Details — Splinter Review

This one actually runs sunspider, even. Test results shortly.

Attachment #400596 - Attachment is obsolete: true

Brian Crowder

Comment 8

•

16 years ago

Attached patch sunspider results — Details — Splinter Review

unpack-code is no longer 10x slow (just a terrible 2x slower). Nothing else is improved measurably, except perhaps access/binary-trees, alas.

Brian Crowder

Comment 9

•

16 years ago

Attached patch closer — Details — Splinter Review

This is more like the right patch. Unfortunately, for short scripts, we're way worse-off tracing (because of setup/breakdown) than simply running through the interpreter. We need to learn what bug 516614 has to say about this in order to develop a heuristic for determining when it is worthwhile to JIT over such code and when it isn't.

Attachment #400592 - Attachment is obsolete: true

Brian Crowder

Updated

•

16 years ago

Depends on: 516614, 515806
No longer depends on: tracerecursion

Boris Zbarsky [:bzbarsky]

Comment 10

•

16 years ago

So what does the data look like? What does it look like if we restrict to scripts that call a function?

Boris Zbarsky [:bzbarsky]

Comment 11

•

16 years ago

And in particular, how common is this sort of pattern: function foo() { // Not very loopy but somewhat slow body } setInterval(foo, 0); or equivalent with setTimeout?

Brian Crowder

Comment 12

•

16 years ago

bug 516614 has more details about what sorts of callbacks are getting run when.

Boris Zbarsky [:bzbarsky]

Comment 13

•

16 years ago

Yes, I read that bug. I can't make sense of the data there, and specifically it doesn't answer my question from comment 11.

Brian Crowder

Comment 14

•

16 years ago

From what I saw, that pattern is very common, but foo() is often a very short function (a handful of spidermonkey opcodes being executed per-run), and in the case of chrome code foo() is actually often an empty function. The empty-function issue has been patched (I'm not sure how much benefit we saw from that? Anyone else?), and as I understand it, dvander has some work on his TODO list that will make the timer -> foo() setup/shutdown time less expensive (I don't know what bug that is). After dvander's work is complete, it should be easy enough to come up with a heuristic for deciding between traced and interpreted code for callbacks like these. That's my current understanding of our situation, at least. Hopefully gal or dvander can weigh in. Also, can you clarify what you aren't discovering from the data in the other bug that you'd like to know? Thanks

Boris Zbarsky [:bzbarsky]

Comment 15

•

16 years ago

The only data in the other bug is comment 7, right? That tells me the distribution of opcode lengths, but nothing that tells me what we could to to efficiently detect the large opcode lengths, or to efficiently detect things that correlate well with large opcode lengths. Or even what opcode lengths ought to be wins when traced (62 opcodes, I'm looking at you).

Brian Crowder

Comment 16

•

16 years ago

Might come back to this later; but in the meantime, back in the pool!

Assignee: crowder → general

Ryan VanderMeulen [:RyanVM]

Comment 17

•

14 years ago

Obsolete with the removal of tracejit.

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → WONTFIX

You need to log in before you can comment on or make changes to this bug.