Crash in shutdownhang | mozilla::net::detail::BlockingIOWatcher::WatchAndCancel
Categories
(Core :: Networking: Cache, defect, P3)
Tracking
()
| Tracking | Status | |
|---|---|---|
| firefox-esr52 | --- | wontfix |
| firefox-esr60 | --- | wontfix |
| firefox-esr102 | --- | wontfix |
| firefox55 | --- | wontfix |
| firefox56 | --- | wontfix |
| firefox57 | --- | wontfix |
| firefox58 | --- | wontfix |
| firefox59 | --- | wontfix |
| firefox66 | --- | wontfix |
| firefox67 | --- | wontfix |
| firefox68 | --- | verified disabled |
| firefox108 | --- | fixed |
People
(Reporter: yoasif, Assigned: valentin)
References
Details
(Keywords: crash, regression, Whiteboard: [necko-triaged] [qa-not-actionable])
Crash Data
| Reporter | ||
Updated•8 years ago
|
| Reporter | ||
Updated•8 years ago
|
Comment 1•8 years ago
|
||
Comment 2•8 years ago
|
||
Updated•8 years ago
|
Updated•8 years ago
|
Comment 3•8 years ago
|
||
| Reporter | ||
Comment 4•8 years ago
|
||
Comment 5•8 years ago
|
||
Comment 6•8 years ago
|
||
Comment 7•6 years ago
|
||
This signature and the one in bug 1435343 are both spiking as of Thursday April 12. That could correlate with the 66.0.3 release or could be unrelated. There are also crashes now showing up in 67 beta 9.
While this still isn't a very high volume crash, I'd like to keep an eye on this and it may warrant investigation.
Comment 8•6 years ago
|
||
the recent crash spike seems to be skewed towards users with german builds - 80% of crash reports have antivirus software from "Avira" showing up in the telemetry environment.
Comment 9•6 years ago
|
||
I left a comment on the Avira support site - let's see if they answer.
Comment 10•6 years ago
|
||
Selena - since this crash volume is now quite high for release and beta, can you help find someone to investigate these crashes? Thanks!
Comment 11•6 years ago
|
||
Selena is out - Dragana, could you take a look at this one
Updated•6 years ago
|
Comment 12•6 years ago
|
||
This is not a new bug. There were always shutdown hangs caused by the cache code.
We are trying to join the IO thread at shutdown that may be blocked doing a sync io. The code where this crash happens is trying to interrupt that blocking IO, which we strongly suspect doesn't really work for these extreme cases when e.g. A/V software is involved. If we remove that interruption code, the crash will not be removed, we will still hang joining the thread.
If we remove the cache2 io thread join (or make it timeout-able) and just let that thread leak in production builds and let the system terminate it, I believe any IO blocking will just shift somewhere else, to some other code like cookies, storage.
I'm author of the cache thread code and the io interruption code, but can't attend this bug sooner than next week. Michal knows it too, he may think of some urgent type of a fix for this.
Comment 13•6 years ago
|
||
I've noticed that in some of the reports cache thread is doing IO which comes from WriteEvent. It's quite strange because we ignore any write operation 2 seconds after shutdown was requested https://searchfox.org/mozilla-central/rev/ec489aa170b6486891cf3625717d6fa12bcd11c1/netwerk/cache2/CacheFileIOManager.cpp#1969
If the call stacks are correct, this means that either we have very short time to finish IO operations when shutting down, or cache thread is blocked on the write event for some reason.
Comment 14•6 years ago
|
||
The write event may be hanging there for quite a long time (a minute or more). And blockade of the cache io thread may cause pages/resources to stop loading (channels will indefinitely wait for cache entries) that may contribute to the reason why users try to close firefox and start it again to fix the problem.
Comment 15•6 years ago
|
||
the recent crash spike from german users is gone again since the beginning of the week, so we can probably downgrade the priority of this bug again.
Updated•6 years ago
|
Updated•6 years ago
|
Updated•4 years ago
|
Updated•3 years ago
|
| Assignee | ||
Comment 17•3 years ago
|
||
I'm removing BlockingIOWatcher::WatchAndCancel in bug 1794376. Should fix these crashes.
| Assignee | ||
Updated•3 years ago
|
| Assignee | ||
Comment 18•3 years ago
|
||
This should have been fixed in bug 1794376.
Setting a reminder to check crash reports.
Comment 19•3 years ago
|
||
2 months ago, Valentin Gosu [:valentin] (he/him) placed a reminder on the bug using the whiteboard tag [reminder-deprecation 2023-01-15] .
kershaw, please refer to the original comment to better understand the reason for the reminder.
Comment 20•3 years ago
|
||
Looks like this is really fixed. I didn't see any crash after 108.
Updated•3 years ago
|
Updated•3 years ago
|
Description
•