1191061 - OdinMonkey OOMs on makethingsnow/minecraft

Reporter

Description

•

9 years ago

makethingsnow.com/minecraft/

works when OdinMonkey is disabled, and on Chrome. On Firefox with OdinMonkey enabled, memory usage jumps by a few GB (far far more than it uses with OdinMonkey off), very quickly, and my machine runs out of memory.

Alon Zakai (:azakai)

Reporter

Updated

•

9 years ago

URL: https://makethingsnow.com/minecraft/

Luke Wagner [:luke]

Comment 1

•

9 years ago

That would probably be the function Sf which is .7MB (1/3 of the file), contains ~2300 local variables, and took about 91s on my machine to compile (which it was able to complete successfully).  Once compilation finished, though, the game ran much smoother (when fully zoomed out) on FF than on Chrome.

The baseline compiler (bug 1169167) should allow us to mitigate the problem.  In addition to producing baseline code fast, the background Ion compilation can monitor LifoAlloc memory usage and, past a threshold, fallback to a baseline compilation of just that function.  This would hurt performance of the final code, though.  It'd be nice to look into optimizations to mitigate the memory usage in these many local slots x huge function cases.

Depends on: 1169167

Jan de Mooij [:jandem]

Comment 2

•

9 years ago

(In reply to Luke Wagner (PTO) [:luke] from comment #1)
> It'd be nice to look into optimizations to mitigate
> the memory usage in these many local slots x huge function cases.

Yeah, I think it'd be interesting to find out where these GBs go. I'll look into this today, there might be some easy wins that'd also be nice for mobile.

Jan de Mooij [:jandem]

Comment 3

•

9 years ago

On OS X 64-bit, we have the following LifoAlloc sizes when compiling the big function:

After generating MIR: 4822 MB
After optimizing MIR: 5021 MB
After generating LIR: 5035 MB
After regalloc:       5648 MB
After codegen:        5648 MB

Regalloc uses 600 MB but other than that the backend seems pretty memory-efficient. I'll find out what the initial 4.8 GB is.

Jan de Mooij [:jandem]

Comment 4

•

9 years ago

(In reply to Jan de Mooij [:jandem] from comment #3)
> I'll find out what the initial 4.8 GB is.

I think most of this is phis... With > 2300 phis and sizeof(MPhi) a bit more than 200 bytes, that's 500 KB per basic block. Not sure but it seems we have 5082 basic blocks, so just the phis take > 2 GB.

Luke Wagner [:luke]

Comment 5

•

9 years ago

That matches what I've seen before when looking into these mega-memory cases.  IIRC, most of these are being introduced for loops (since we pessimistically insert phis for all local slots before entering the loop body, and then only at the end drop useless phis).  What if, at the end of the loop, instead of leaving the useless phis dead, we added them to some free list of phis that was reused?  That is, I'm guessing that 2gb is mostly full of dead phis.

Alon Zakai (:azakai)

Reporter

Comment 6

•

9 years ago

(In reply to Jan de Mooij [:jandem] from comment #4)
> (In reply to Jan de Mooij [:jandem] from comment #3)
> > I'll find out what the initial 4.8 GB is.
> 
> I think most of this is phis... With > 2300 phis and sizeof(MPhi) a bit more
> than 200 bytes, that's 500 KB per basic block. Not sure but it seems we have
> 5082 basic blocks, so just the phis take > 2 GB.

Sorry if this is a silly question, but does that mean that memory usage is numPhis*numBasicBlocks*sizeofPhi? In other words, adding one phi adds memory proportional to the number of basic blocks in the function?

Whiteboard: [MemShrink]

Luke Wagner [:luke]

Comment 7

•

9 years ago

Almost, it's:
 (numLocalVars * numLoops * sizeofPhi) + (numerOfActualPhisNeededForNonLoops * sizeofPhi)

Alon Zakai (:azakai)

Reporter

Comment 8

•

9 years ago

I see, thanks. Then plugging in the numbers, this implies that to reach 2GB we probably need either

1. Around 5,000 loops (to get the first expression to the right range), or
2. Around 10 million numerOfActualPhisNeededForNonLoops (to get the second)

Both seem surprising?

Jan de Mooij [:jandem]

Comment 9

•

9 years ago

(In reply to Alon Zakai (:azakai) from comment #8)
> 1. Around 5,000 loops (to get the first expression to the right range), or

I double checked and there are indeed 5082 "pending loop header" blocks, out of ~28312 blocks... Looking at the asm.js code, there are a *ton* of loops like this one:

do {
    a[qs >> 0] = a[js >> 0] | 0;
    qs = qs + 1 | 0;
    js = js + 1 | 0
} while ((qs | 0) < (rs | 0));

Alon Zakai (:azakai)

Reporter

Comment 10

•

9 years ago

Wow, thanks. I'll pass that along to the project, maybe they can inline less or something like that.

Nicholas Nethercote [inactive]

Comment 11

•

9 years ago

Is this something to fix on our side, or on their side?

Alon Zakai (:azakai)

Reporter

Comment 12

•

9 years ago

Probably more on their side. Also, the site now works fine on nightly, so they may have already done some optimizing.

Status: NEW → RESOLVED

Closed: 9 years ago

Resolution: --- → INVALID

Bugzilla

Quick Search

OdinMonkey OOMs on makethingsnow/minecraft

Categories

(Core :: JavaScript Engine: JIT, defect)

Tracking

()

People

(Reporter: azakai, Unassigned)

References

(
URL
)

Details

(Whiteboard: [MemShrink])

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12