1279293 - (IPCError_ShutDownKill) [meta] Crash in [@ IPCError-browser | ShutDownKill]

Reporter

Description

•

9 years ago

This bug was filed from the Socorro interface and is report bp-3bbe367b-ff88-4040-90de-0567a2160609. ============================================================= new signature in JSStructuredCloneWriter

User Dderss

Comment 1

•

9 years ago

I had a huge FF session in SafeMode and by exit got this crash, too: https://crash-stats.mozilla.com/report/index/d0758d13-d0c8-4ccf-985b-bf1522160615

Jim Jeffery not reading bug-mail 1/2/11

Comment 2

•

9 years ago

This link seems to freeze browser and I just got a 'shut down' kill' crash. https://crash-stats.mozilla.com/report/index/cfd92e66-a5e1-44be-b18d-33b982160625 Running Win10 x64 and Win32 Nighly builds. Mozilla/5.0 (Windows NT 10.0; WOW64; rv:50.0) Gecko/20100101 Firefox/50.0 e10s in 'enabled' Same, goes not responding with e10s 'off' No idea when this started, just noticed today trying to visit: http://www.mayoclinic.org/diseases-conditions/retrograde-ejaculation/basics/definition/con-20030795

Status: UNCONFIRMED → NEW

Ever confirmed: true

Jim Jeffery not reading bug-mail 1/2/11

Comment 3

•

9 years ago

Just discovered that it does not Hang with 'Tracking Protection' 'OFF' Options->Privacy: Had it set to 'Always', Flipping to 'Never' stops the hang.

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 4

•

9 years ago

This signature covers a lot of possible issues. It looks like we don't have a bug on it, so this can be the one. Jim, I filed bug 1282580 for the issue you're seeing. The shutdown hang is really a side effect of a different problem (that the site is hanging).

Comment 5

•

9 years ago

20160629030209 Mozilla/5.0 (Windows NT 6.1; rv:50.0) Gecko/20100101 Firefox/50.0 Nightly 50.0a1 crashed having the same signature as the one from this bug with the following steps: 1. Load http://html.spec.whatwg.org/ 2. Press Ctrl+F to open the Find toolbar 3. Scroll the page up and down 4. Close FF Expected results: The page should be loaded without crashing Actual results: The page is loaded, Find toolbar is not opened,the page is not scrolled and after page is closed Firefox crashes: https://crash-stats.mozilla.com/report/index/e013a326-36b4-4cad-829c-8efc42160630

Andrew McCreight [:mccr8]

Comment 6

•

9 years ago

This is a generic signature, so different steps to cause a crash from this signature should have separate bugs, blocking this bug.

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 7

•

9 years ago

We'll need a multifaceted approach here. Comment 5 isn't really even a bug, exactly. If a web page is doing a lot of work and you quit, then the content process is going to be slow to respond (slower than 5 seconds). It's probably reasonable to increase the timeout to 30 seconds.

Priority: -- → P2

Benjamin Smedberg

Comment 8

•

9 years ago

I thought the plan here was to get rid of this entirely by just killing the content process when we didn't need it any more. We cannot afford to let content block shutdown for significant periods of time (5 seconds is already too much in general).

User Dderss

Comment 9

•

9 years ago

Can I ask something? I have a lots of tabs, so most of the time e.g. last year closing/restarting FireFox took forever (sometimes literally since you would just never see the damn thing going away from the task manager process list no matter for how long you wait), but in recent months there was a change that has finally put a time limit on how long FF can stay in the memory after you commanded exit. Though it is still takes unacceptably long time. Correct me if I am wrong, but from what I understand, there were no fundamental changes in how FF operates, but what was done was literally implementation of a time limit after which FireFox/PluginContainer gets killed no matter what. As result, ever since that change, absolutely every exit/restart I have in FF ends with a crash like this one or of another type of ShutDownKill. Can some please explain why it is impossible to redesign things in a way that would allow FireFox exit/restart ***right away***? Or maybe such development project is already ongoing within Mozilla quarters, and it is supposed to "land" in FF version 55 or something? Thanks in advance.

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 10

•

9 years ago

(In reply to Benjamin Smedberg [:bsmedberg] from comment #8) > I thought the plan here was to get rid of this entirely by just killing the > content process when we didn't need it any more. We cannot afford to let > content block shutdown for significant periods of time (5 seconds is already > too much in general). The problem I realized only recently is with "beforeunload", "unload", and "pagehide" events. Currently, Firefox fires them on shutdown. They can do sync XHRs, so they have observable side effects. Sync XHRs are "deprecated" (although no one seems very hopeful that we'll ever be able to remove them), but the beacon API is new and we would need to support that at shutdown as well (currently I'm not sure if that even works). If we stop firing this stuff at shutdown, we're probably going to break a lot of websites. There's a github issue on the topic [1] that links to a bunch of Chrome usage counters, and it seems like a lot of sites are using sync XHR in unload (something like 0.3% of web pages as far as I understand the data). I think we could probably make an effort to fire these event listeners but not do any of the other teardown activities associated with destroying a docshell. That might save us a good amount of time. [1] https://github.com/whatwg/xhr/issues/20#issuecomment-185163375

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 11

•

9 years ago

Setting needinfo to Benjamin in case you have an opinion or ideas on comment 10.

Flags: needinfo?(benjamin)

Benjamin Smedberg

Comment 12

•

9 years ago

I assumed that shutdown worked like this (and I'm totally terrible for assuming this): * the Firefox UI code knows that we're quitting. ** It triggers beforeunload handlers before we've actually decided to quit, so that we can support the returnValue/confirmation UI. ** Then we trigger the unload (and pagehide?) events as part of closing the Firefox window ** This process also collects any final session restore information * Only after we're finished shutting down the user-visible bits do we trigger the content process to quit ** At this point, the content process shouldn't contain any important user data and in non-leakchecking builds we can just kill it (using TerminateProcess/SIGTERM) ** In leakchecking builds we'd do the full/painful shutdown sequence So you're saying that we don't trigger some of the unload events until we've actually told the content process to quit? That seems like it might be both a UI regression (in case the beforeunload event has quit confirmation prompts) and might cause weird teardown sequence errors in the Firefox UI.

Flags: needinfo?(benjamin) → needinfo?(wmccloskey)

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 13

•

9 years ago

Your description is correct as I understand things, except maybe for when we start the shutdown timer. A typical sequence is: 1. We run "beforeunload" events before everything else. 2. Parent does session restore, which does spin the event loop waiting for the child. But at this point nothing has been closed and this is pretty fast. 3. Parent closes all the windows, which closes all tabs, which causes an async message to be sent to the child asking it to tear down the docshell for that tab. Destroying a docshell fires "unload"/"pagehide" and also frees memory for the tab (DOM, frame tree, etc.). 4. Parent ends up in ContentParent::Observe("xpcom-shutdown"), at which time it asks the child to shut down. If the child fails to shut down after 5 seconds, it kills it. We could introduce more waiting to give the child a chance to finish running its "unload"/"pagehide"/docshell destruction code before we start the 5 second timer in step (4). However, that would defeat the purpose of the timer, since AFAIK step (3) is what takes all the time. When you have 20 tabs and we have to free the memory for all of them as well as handle any sync network requests they make, it can easily take > 5 seconds. To put it another way, once all the docshells are gone, shutting down the content process is trivial (in opt builds). There's a little bit of message traffic with the parent, but basically the child just calls QuickExit. If we want to save time here, I think the best we can do is avoid freeing the DOM/frame tree/whatever else. I'd be interested in how other browsers handle this. I Googled for "firefox shutdown slow" and "chrome shutdown slow". There are a lot more results for Firefox.

Flags: needinfo?(wmccloskey)

Liz Henry (:lizzard) (relman/hg->git project)

Comment 14

•

9 years ago

Tracking this for 50, seems like a high volume crash good to keep an eye on.

status-firefox50: --- → affected

tracking-firefox50: --- → +

The 8472

Comment 15

•

9 years ago

> However, that would defeat the purpose of the timer, since AFAIK step (3) is what takes all the time. How about tearing down tabs 1 by 1 and giving each one a separate timeout. That way a hang can still be detected by the timeout scales with the tabs.

Calixte Denizet (:calixte)

Comment 16

•

9 years ago

Crash volume for signature 'IPCError-browser | ShutDownKill': - aurora (49): 69120 - beta (48): 805 - release (47): 702 - esr (45): 14 Affected platforms: Windows, Mac OS X, Linux

status-firefox47: --- → affected

status-firefox48: --- → affected

status-firefox49: --- → affected

status-firefox-esr45: --- → affected

David Bolter [:davidb] (NeedInfo me for attention)

Comment 17

•

9 years ago

(In reply to Bill McCloskey (:billm) from comment #13) > If we want to save time here, I think the best we can do is avoid freeing > the DOM/frame tree/whatever else. I'd be interested in how other browsers > handle this. I Googled for "firefox shutdown slow" and "chrome shutdown > slow". There are a lot more results for Firefox. Andrew do you know who might be able chase this?

Flags: needinfo?(overholt)

Nika Layzell [:nika] (ooo, ni? for response)

Comment 18

•

9 years ago

Would the idea with this be to, instead of closing all of the windows, trigger the firing of these "beforeunload", doing SessionStore stuff, "unload", and "pagehide" etc. events, and then just kill the child process outright, without performing any of the usual cleanup? (I presume that the child process would QuickExit() itself)

Flags: needinfo?(wmccloskey)

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 19

•

9 years ago

(In reply to Michael Layzell [:mystor] from comment #18) > Would the idea with this be to, instead of closing all of the windows, > trigger the firing of these "beforeunload", doing SessionStore stuff, > "unload", and "pagehide" etc. events, and then just kill the child process > outright, without performing any of the usual cleanup? (I presume that the > child process would QuickExit() itself) Yes. We already avoid application-level cleanup (e.g., XPCOM shutdown) by calling QuickExit. We additionally would like to avoid any docshell-level cleanup we're doing now. Probably the first step, though, is to see how expensive that cleanup is.

Flags: needinfo?(wmccloskey)

hung-Nightly.png 5 years ago alex_mayorga 17.34 KB, image/png		Details
sample_spindump.zip 4 years ago Sam Johnson 431.08 KB, application/zip		Details

Rank	IPC shutdown state	#	%
1	SendFinishShutdown (sent)	292	17.70
2	ShutdownInternal entry	6	0.36
3	content-child-shutdown started	2	0.12