Crash in shutdownhang | RtlAnsiStringToUnicodeString | MsgWaitForMultipleObjectsEx | CCliModalLoop::BlockFn
Categories
(Core :: Disability Access APIs, defect, P3)
Tracking
()
| Tracking | Status | |
|---|---|---|
| firefox-esr52 | --- | wontfix |
| firefox-esr60 | --- | wontfix |
| firefox-esr102 | --- | wontfix |
| firefox57 | --- | wontfix |
| firefox58 | --- | wontfix |
| firefox59 | --- | wontfix |
| firefox60 | --- | wontfix |
| firefox61 | --- | wontfix |
| firefox62 | --- | wontfix |
| firefox63 | --- | wontfix |
| firefox64 | --- | wontfix |
| firefox65 | --- | wontfix |
| firefox66 | --- | wontfix |
| firefox67 | - | wontfix |
| firefox68 | + | wontfix |
| firefox69 | --- | wontfix |
| firefox70 | --- | wontfix |
| firefox113 | --- | fixed |
| firefox114 | --- | fixed |
| firefox115 | --- | fixed |
People
(Reporter: davidb, Unassigned)
References
(Depends on 1 open bug, Blocks 1 open bug)
Details
(Keywords: crash, regression, Whiteboard: a11y:crash-win)
Crash Data
| Reporter | ||
Comment 1•8 years ago
|
||
| Reporter | ||
Comment 2•8 years ago
|
||
Comment 3•8 years ago
|
||
Comment 4•8 years ago
|
||
Comment 5•8 years ago
|
||
Comment 6•8 years ago
|
||
Updated•8 years ago
|
Updated•8 years ago
|
Comment 7•8 years ago
|
||
Updated•8 years ago
|
Updated•8 years ago
|
Updated•8 years ago
|
Comment 8•7 years ago
|
||
Comment 9•7 years ago
|
||
Updated•7 years ago
|
Comment 10•7 years ago
|
||
Comment 11•7 years ago
|
||
Comment 12•7 years ago
|
||
Updated•7 years ago
|
Comment 13•7 years ago
|
||
Updated•7 years ago
|
Updated•7 years ago
|
Comment 15•7 years ago
|
||
Comment 16•7 years ago
|
||
Comment 17•7 years ago
|
||
Not going to make 65 at this point, but leaving it set to fix-optional as a possible dot release ride-along should a low-risk patch be available.
Comment 18•7 years ago
|
||
Tracking for 66 to keep an eye on this in beta.
Comment 19•7 years ago
|
||
Changing the priority to p2 as the bug is tracked by a release manager for the current nightly.
See How Do You Triage for more information
Comment 20•7 years ago
|
||
Marking 67 as affected. #6 overall browser crash in 65 release.
Comment 21•7 years ago
|
||
Adding another signature, Windows 10.
| Reporter | ||
Comment 22•7 years ago
|
||
NI Aaron in case he has any quick hunches.
Comment 23•7 years ago
•
|
||
It would be really helpful if we also had dumps from the content process side of things.
By virtue of the fact that we already know which child process we're waiting on (since we're resolving a child id that has a content process id encoded into it), it would be nice if we had some kind of RAII mechanism to say, "Yo, hang reporter, if you get any hangs while I'm on the stack, take a paired minidump with this process, okay?"
Sorry I don't have a better answer atm, but we need more data :-)
Comment 24•7 years ago
|
||
Aaron, do you know who might work on getting us that sort of information? is that something for work on the crash reporter itself?
Comment 25•7 years ago
|
||
(In reply to Liz Henry (:lizzard) (use needinfo) from comment #24)
Aaron, do you know who might work on getting us that sort of information? is that something for work on the crash reporter itself?
I think it probably involves the hang reporter more than anything. IIRC Gabriele has recently worked on the hang monitor.
Gabriele, is my proposal in Comment 23 reasonable? Do you or somebody you know have cycles to implement such an API?
Comment 26•7 years ago
|
||
Yes, we could modify the shutdown terminator with something like that. The only issue that would need to be addressed is that we're currently just calling MOZ_CRASH() to generate the crash report; we would need to issue the action to take a paired minidump manually. It's not a huge deal but I'm swamped with urgent stuff and won't have free cycles before the end of next week.
Comment 27•7 years ago
|
||
(In reply to Gabriele Svelto [:gsvelto] from comment #26)
Yes, we could modify the shutdown terminator with something like that. The only issue that would need to be addressed is that we're currently just calling MOZ_CRASH() to generate the crash report; we would need to issue the action to take a paired minidump manually. It's not a huge deal but I'm swamped with urgent stuff and won't have free cycles before the end of next week.
I filed bug 1527668 for this part.
Comment 28•7 years ago
|
||
Discussed with Gabriele in email; other projects have priority for this week.
Updated•7 years ago
|
Updated•7 years ago
|
| Reporter | ||
Comment 29•7 years ago
|
||
Jamie we need an assignee on tracked bugs.
Comment 30•7 years ago
|
||
Since this needs an owner, I'll "own" this, but it's important to note that right now, it isn't actionable without more data/a reproduceable test case.
Updated•7 years ago
|
Comment 31•7 years ago
|
||
We just had 50+ crashes over the week end on Nightly 67, Jamie is there something that landed recently that could explain this sudden spike on Nightly?
Comment 32•7 years ago
|
||
This is only a hunch, but next gen storage just got re-enabled on Nightly in bug 1517090. It was disabled originally due to a11y hangs; see bug 1516136. The hang was believed to be fixed (and I couldn't reproduce it any more with brief testing), but perhaps there's still an edge case?
Comment 33•7 years ago
|
||
This is now the #2 browser crash on 67 nightly (shutdownhang | NtUserMsgWaitForMultipleObjectsEx | CCliModalLoop::BlockFn).
Comment 34•7 years ago
|
||
The deadlock described in comment 32 was fixed in bug 1534208. However, there seems to be another one: bug 1535221.
It'd be interesting to know whether the spike is caused by builds without the fix for bug 1534208 or whether the spike is still occurring even with that fix.
Comment 35•7 years ago
|
||
(In reply to James Teh [:Jamie] from comment #34)
It'd be interesting to know whether the spike is caused by builds without the fix for bug 1534208 or whether the spike is still occurring even with that fix.
the crash volume went down considerably in builds 20190311215435 & later that have the fix (from 20-50 crashes per build to 1-10 now)
Comment 36•7 years ago
|
||
(In reply to James Teh [:Jamie] from comment #32)
This is only a hunch, but next gen storage just got re-enabled on Nightly in bug 1517090. It was disabled originally due to a11y hangs; see bug 1516136. The hang was believed to be fixed (and I couldn't reproduce it any more with brief testing), but perhaps there's still an edge case?
The volume is very high on beta with about 100 crashes per day, so setting as P1. LSNG is delayed to 68 and will be deactivated in the next beta (67.0b9), which should tell you if the theory is correct.
Comment 37•7 years ago
|
||
Crashes went down by more than an order of magnitude on beta after we deactivated LSNG (310 crashes in beta 8 vs 22 in beta 10) so no need to track it anymore for 67 and marking as fix-optional for the release, but tracking for 68.
Comment 38•7 years ago
|
||
Hi Jamie, looks like this is related to LSNG (which is probably being enabled again for 68). Any updates? (tracking this in release triage for 68). Thanks!
Comment 39•7 years ago
|
||
LSNG issue was fixed in bug 1535221.
Comment 40•7 years ago
|
||
I think we're now back to the original situation here; i.e. not LSNG, thus much lower volume, but without repro or data and thus little hope of a diagnosis or fix. :( See comment 10, comment 23, comment 26, comment 27.
Updated•7 years ago
|
Comment 41•7 years ago
|
||
I might be able to reproduce.
Updated•7 years ago
|
Comment 42•7 years ago
|
||
Martijn: Were you able to reproduce this issue? Thanks.
Comment 43•6 years ago
|
||
wontfix for 68 based on comment 40.
Comment 44•6 years ago
|
||
Also marking this stalled based on comment 40. Happy to take a patch though or otherwise help out.
Updated•6 years ago
|
Updated•6 years ago
|
Updated•6 years ago
|
Updated•5 years ago
|
Comment 46•3 years ago
|
||
Problem with Firefox 101.0: Crash Report [@ shutdownhang | NtUserMsgWaitForMultipleObjectsEx | CCliModalLoop::BlockFn ]
https://crash-stats.mozilla.org/report/index/daf246d5-6b58-4387-b612-5e8340220606#tab-bugzilla
Comment 47•3 years ago
|
||
Based on the topcrash criteria, the crash signatures linked to this bug are not in the topcrash signatures anymore.
For more information, please visit auto_nag documentation.
Updated•3 years ago
|
Updated•3 years ago
|
Comment 48•3 years ago
|
||
This is resolved by Cache the World, which is enabled by default in Firefox 113.
Comment 49•3 years ago
|
||
Since the bug is closed, the stalled keyword is now meaningless.
For more information, please visit BugBot documentation.
Updated•3 years ago
|
Description
•