772329 - Optimize nested property fetches where possible

Reporter

Description

•

12 years ago

V8 has a useful optimization for nested property fetches, like:

X.Y.Z.W

I'm not entirely sure how it's implemented, but in cases where the properties do not change frequently, the cost of the deeply nested fetch is nearly equivalent to the cost of simply fetching X. This is really really helpful for generated code that uses nesting to reproduce object models (like basically every line of code my translator JSIL generates).

In the case where the JS is being generated by a translator, it could probably locate nested fetches like the above and cache them in advance, but that will eat up valuable local variable slots and potentially incur additional overhead beyond that (for example, having to pre-fetch properties that are only used in certain branched code paths).

Here is a simple test case:
http://jsperf.com/deeply-nested-property-fetches/2

This is a likely contributor to terrible performance for all JSIL test cases in FF (hard to know for sure until JS profiling lands).

Luke Wagner [:luke]

Comment 1

•

12 years ago

That looks like LICM (loop invariant code motion) which will turn

  for (...)
     X.Y.Z.W

into

  t = X.Y.Z
  for (...)
     t.Z

What do you see on IM (which has LICM)?

Katelyn Gadd (:kael)

Reporter

Comment 2

•

12 years ago

What flags would I use to force-enable IM? I tried this in the latest Nightly and it's still very slow there. Is there a set of flags I could pass to a modern js.exe build to test out IM's implementation of LICM to see if it's faster?

Katelyn Gadd (:kael)

Reporter

Comment 3

•

12 years ago

Since my test case actually would have been optimized by *either* really smart LICM or optimized nested fetches, I tried to add a second set of cases that is less likely to get optimized by LICM (I think it would need a combination of inlining and smart LICM, which is much less likely):

http://jsperf.com/deeply-nested-property-fetches/3

It is still blazing fast in Chrome. Unfortunately, it's still possible that this is just really awesome LICM - as long as the benchmark calls a pure function in a loop, I suppose it's possible for them to optimize everything down to 'if (constant !== constant)', which is basically instantaneous. Any suggestions on how to make this test less likely to be inaccurate would be great.

I've seen claims that V8 optimizes for the case of chained property fetches like this, specifically because they show up really often in JS code that's namespaced/abstracted. But I don't know offhand how you would optimize for it in the function call case (where the nested fetch only occurs say, once, in a given scope) - maybe you have a native trampoline for doing chained fetches instead of calling GetProp repeatedly on its own result, and that gets you most of the win? It may also be that they optimize for the best case, where the entire chained fetch is successful, and as a result end up deoptimized horribly in the failure case but it doesn't matter because it's rare?

URL: http://jsperf.com/deeply-nested-prope... → http://jsperf.com/deeply-nested-prope...

Luke Wagner [:luke]

Comment 4

•

12 years ago

(In reply to Kevin Gadd (:kael) from comment #3)
Chrome (and IM) do inlining early in the pipeline so this still looks like LICM.

To try out IM, check out http://hg.mozilla.org/projects/ionmonkey.  dvander: is IM on by default in content in the browser?

David Mandelin [:dmandelin]

Updated

•

12 years ago

Blocks: WebJSPerf

Whiteboard: [js:t]

John Drinkwater (:beta)

Comment 5

•

12 years ago

(In reply to Kevin Gadd (:kael) from comment #3)
> http://jsperf.com/deeply-nested-property-fetches/3

Noticed this was much slower in FF 18 and assumed it was IM, but FF 17 has a similar regression…

Till Schneidereit [:till]

Updated

•

11 years ago

Blocks: 885526

Katelyn Gadd (:kael)

Reporter

Updated

•

11 years ago

Blocks: JSIL

Hannes Verschore [:h4writer]

Comment 6

•

11 years ago

Just checked and v8 is just doing LICM here and that's why the times are the same.

We fail to LICM it, because our current heurstic decide that the "new Error()" could potentially override the values of X.Y.Z.W. So we should reload the value every time. This is caused due to our Alias Analysis not being aware of the graph structure.

There are two possible solutions:
1) bug 844779: Improve our rpo graph. That way AA won't see the the contents of the "if" as part of the loop.
2) Teach Alias Analysis about graph structure.

(Applying the patch from bug 844779 improves score to 110ms, as fast as d8)

Hannes Verschore [:h4writer]

Updated

•

11 years ago

Depends on: 844779

Nobody; OK to take it and work on it

Assignee

Updated

•

10 years ago

Assignee: general → nobody

BMO Automation

Updated

•

2 years ago

Severity: normal → S3

Mayank Bansal

Comment 7

•

1 month ago

the link to jsperf does not work anymore.
Bryan, should this bug be closed?

Flags: needinfo?(bthrall)

Iain Ireland [:iain]

Comment 8

•

1 month ago

Reading through the comments:

The title is misleading, since it's actually just a question of having sufficiently precise alias analysis to enable LICM.
It looks like our alias analysis wasn't flow-sensitive when this was opened, but comment 6 implies that improving our AA fixed the perf gap.

Should be safe to close this.

Status: NEW → RESOLVED

Closed: 1 month ago

Flags: needinfo?(bthrall)

Resolution: --- → WORKSFORME

Mayank Bansal

Comment 9

•

1 month ago

FWIW, flow sensitive alias analysis was implemented in bug 1255008, but was never enabled. It was removed in bug 1455280

Bugzilla

Quick Search

Optimize nested property fetches where possible

Categories

(Core :: JavaScript Engine, defect)

Tracking

()

People

(Reporter: kael, Unassigned)

References

(Blocks 1 open bug,
URL
)

Details

(Whiteboard: [js:t])

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Comment 3

Comment 4

Updated

Comment 5

Updated

Updated

Comment 6

Updated

Updated

Updated

Comment 7

Comment 8

Comment 9