Open Bug 824249 Opened 12 years ago Updated 2 years ago

IonMonkey: inline v8-richards run() call

Tracking

()

Status:

NEW

People

(Reporter: bhackett1024, Unassigned)

References

(Blocks 1 open bug)

Details

(Whiteboard: [leave open])

Attachments

(1 file, 1 obsolete file)

partial patch 12 years ago Brian Hackett [Laid off!] 6.38 KB, patch		Details \| Diff \| Splinter Review
partial patch 12 years ago Brian Hackett [Laid off!] 6.38 KB, patch	dvander : review+	Details \| Diff \| Splinter Review

Brian Hackett [Laid off!]

Reporter

Description

•

12 years ago

Attached patch partial patch (obsolete) — Details — Splinter Review

The v8-richards benchmark is a task scheduler that loops over task blocks and calls the run() method on them. This run() call is a polymorphic dispatch about which we have perfect type information, but the call is currently not inlined and we do a CallGeneric which hurts performance tremendously. If I hack the code to allow inlining this call (js_new below), I get these scores (average of 6 runs): js_old: 10006 js_new: 11621 d8: 12919 So, fixing this closes more than half the gap with v8. There are three things preventing inlining of this call: - One of the inlined functions contains a loop (which doesn't actually run that much). JM+TI can't inline calls with loops because of its overspecialized loop analysis code. This restriction has carried over to Ion, but I don't see any reason why this would actually cause problems. Attached patch removes this restriction. - Inlining the call fails on some heuristics for inlined script length and use counts. Now that we compile off thread I don't think the complexity here is warranted, the attached patch makes simplifications and tweaks to allow this call to be inlined. - There is a type barrier on the call site (one of the targets, IdleTask, is only ever called with a null packet due to invariants in the program beyond TI's purview). This should be fixed by bug 796114, which has had a patch waiting for review the last two months :(

Attachment #695179 - Flags: review?(dvander)

Marco Castelluccio [:marco]

Updated

•

12 years ago

Depends on: 768288

Brian Hackett [Laid off!]

Reporter

Updated

•

12 years ago

Blocks: 824257

Brian Hackett [Laid off!]

Reporter

Comment 1

•

12 years ago

Attached patch partial patch — Details — Splinter Review

Alas, the measurements from the previous patch were with the use count checks totally removed, and that patch gives scores a few hundred points less than was reported due to other call sites not inlining as much as they could. This patch relaxes usesBeforeInlining even further to get the reported scores. It might be worth killing this field entirely, as it is partially redundant with inlineUseCountRatio.

Attachment #695179 - Attachment is obsolete: true

Attachment #695179 - Flags: review?(dvander)

Attachment #695194 - Flags: review?(dvander)

David Anderson [:dvander] - inactive, e-mail if emergency

Comment 2

•

12 years ago

Comment on attachment 695194 [details] [diff] [review] partial patch Review of attachment 695194 [details] [diff] [review]: ----------------------------------------------------------------- Cool, glad to see that restriction removed. re: bug 796114, we couldn't measure any performance improvement at the time, so we decided not to take it. If it will help now that other things are in place, I'll review it.

Attachment #695194 - Flags: review?(dvander) → review+

Hannes Verschore [:h4writer]

Comment 3

•

12 years ago

Please don't commit the "enable inlining loop" part. This will cause a huge regression on earley-boyer. Details are in bug 768288 . Also bug 768288 is about inlining the loops ... I will have a proper patch for that before the end of the week.

Nicolas B. Pierron [:nbp]

Comment 4

•

12 years ago

Hum, I think we want to keep these checks for mono-threaded CPUs, such as mobile phones targeted by b2g. Unless we don't want IonMonkey with B2G at-all ?!

Hannes Verschore [:h4writer]

Comment 5

•

12 years ago

Bug 823884 has landed. Earley-boyer shouldn't cause any problems now when inlining. Now for bug 768288, that bug will introduce one extra heuristic, to not inline functions that invalidate a lot (because the impact will be bigger as the caller will be invalidated too). Now because this bug is about removing these heuristics, I'm not sure, especially because we don't want to decrease performance also on single core computers. Can you make sure this isn't the case? In that case you can just go ahead an push this... because I think it is better than adding more heuristics, like in 768288

Sean Stangl [:sstangl]