Closed
Bug 1242802
Opened 8 years ago
Closed 8 years ago
NSS crash in _PR_CleanupThread on shutdown
Categories
(Firefox Build System :: General, defect)
Tracking
(firefox45 unaffected, firefox46 unaffected, firefox47+ verified)
VERIFIED
FIXED
Tracking | Status | |
---|---|---|
firefox45 | --- | unaffected |
firefox46 | --- | unaffected |
firefox47 | + | verified |
People
(Reporter: streetwolf52, Assigned: ted)
References
Details
(4 keywords)
Crash Data
Attachments
(1 file)
198.53 KB,
text/plain
|
Details |
User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:46.0) Gecko/20100101 Firefox/46.0 Build ID: 20160125073110 Steps to reproduce: Go to some web sites then exit Fx46 Actual results: Fx46 crashed Expected results: Fx46 should have crashed.
Reporter | ||
Comment 1•8 years ago
|
||
Unfortunately my crash dumps have no debugging info as I'm running from the inbounds. I do have a regression range: Bad - https://hg.mozilla.org/integration/mozilla-inbound/rev/1b7625c90538a95413f7ca1910f4ec791eff82b5 Good - https://hg.mozilla.org/integration/mozilla-inbound/rev/5edf01b87f78b580d01dcd2aa1756d46dcede672 **** ER should be "shouldn't have crashed."
OS: Unspecified → Windows 10
Hardware: Unspecified → x86_64
Reporter | ||
Comment 2•8 years ago
|
||
Here's my crash dump: https://crash-stats.mozilla.com/report/index/2fe4c54f-0035-4635-acaf-fd7952160126 Here's the build where the problem started: http://archive.mozilla.org/pub/firefox/tinderbox-builds/mozilla-inbound-win64-pgo/1453746694/
Reporter | ||
Updated•8 years ago
|
Product: Firefox → Core
Reporter | ||
Comment 3•8 years ago
|
||
Forgot to mention that the crash happens on a new profile.
Reporter | ||
Updated•8 years ago
|
Summary: Fx46 using inbound crashes when I exit. → Fx46 from inbound crashes when I exit.
Reporter | ||
Updated•8 years ago
|
Keywords: regressionwindow-wanted
Comment 4•8 years ago
|
||
Reproduced the crash on Windows7 (m-i tinderbox build) as well: https://hg.mozilla.org/integration/mozilla-inbound/rev/99bdd3287bcf9ecf974c6f68ba3ba15e6fc17937 Mozilla/5.0 (Windows NT 6.1; WOW64; rv:46.0) Gecko/20100101 Firefox/46.0 ID:20160125083827 Pushlog: https://hg.mozilla.org/integration/mozilla-inbound/pushloghtml?fromchange=fd1b82f7fbeeb92f6dddcccdb378126973b06a38&tochange=99bdd3287bcf9ecf974c6f68ba3ba15e6fc17937
Severity: normal → critical
Flags: needinfo?(ted)
Flags: needinfo?(mh+mozilla)
Flags: needinfo?(gps)
Keywords: reproducible
Comment 5•8 years ago
|
||
Wait. This landed before the merge to aurora? huh, I'd rather have had that bake on nightly for a while.
Flags: needinfo?(mh+mozilla)
Comment 6•8 years ago
|
||
It didn't, I was confused by the version number.
Comment 7•8 years ago
|
||
And we're lacking symbols on nss3.dll :-/ This is definitely something for ted.
Flags: needinfo?(gps)
Assignee | ||
Comment 8•8 years ago
|
||
(In reply to Mike Hommey [:glandium] from comment #7) > And we're lacking symbols on nss3.dll :-/ > > This is definitely something for ted. These are inbound builds, we don't upload symbols for them.
Assignee | ||
Comment 9•8 years ago
|
||
(In reply to Gary [:streetwolf] from comment #2) > Here's my crash dump: > https://crash-stats.mozilla.com/report/index/2fe4c54f-0035-4635-acaf- > fd7952160126 > > Here's the build where the problem started: > http://archive.mozilla.org/pub/firefox/tinderbox-builds/mozilla-inbound- > win64-pgo/1453746694/ I can't reproduce a crash on shutdown running this build on my local Win10 machine. I'm using a fresh testing profile. Are you doing anything specific that causes a crash?
Flags: needinfo?(ted)
Assignee | ||
Comment 10•8 years ago
|
||
Here's what the stack for this crash looks like: https://crash-stats.mozilla.com/report/index/2fe4c54f-0035-4635-acaf-fd7952160126
Assignee | ||
Comment 11•8 years ago
|
||
It's crashing here: https://dxr.mozilla.org/mozilla-central/rev/aa90f482e16db77cdb7dea84564ea1cbd8f7f6b3/nsprpub/pr/src/threads/prtpd.c#237 trying to call a thread private destructor, it looks like there's a bad entry in that list?
Assignee | ||
Comment 12•8 years ago
|
||
Thanks so much for finding the regression window on this! If one of you could narrow down some reliable STR that would be super helpful.
Keywords: regressionwindow-wanted → steps-wanted
Reporter | ||
Comment 13•8 years ago
|
||
I seem to crash very often by going to this site: http://www.softexia.com/ and then exiting Fx. You might have to click on a few links to get it to crash. If you still can't crash try creating some bookmarks and go to the site by clicking on them. I mostly use the bookmarks toolbar but have crashed using other methods. I also crash in safe-mode.
Reporter | ||
Comment 14•8 years ago
|
||
Keep in mind Comment 4. Crash was under Windows 7 x86.
Reporter | ||
Comment 15•8 years ago
|
||
Might be Windows x64
Reporter | ||
Comment 16•8 years ago
|
||
Ted... I'll test a try build if you come up with one.
Assignee | ||
Comment 17•8 years ago
|
||
Thanks! I was able to reproduce it by loading that site.
Keywords: steps-wanted
Assignee | ||
Comment 18•8 years ago
|
||
I reproduced it in a local debug build. Still looking.
Reporter | ||
Comment 19•8 years ago
|
||
If it helps this site doesn't produce a crash on exiting: http://www.onlinetextmessage.com/verizon-text-message.php
Reporter | ||
Comment 20•8 years ago
|
||
(In reply to Gary [:streetwolf] from comment #19) > If it helps this site doesn't produce a crash on exiting: > http://www.onlinetextmessage.com/verizon-text-message.php It appears that when I use my bookmark to get to this site on my bookmarks toolbar I don't crash. However when I click on the link I gave you above it does.
Assignee | ||
Comment 21•8 years ago
|
||
I have enough info to reproduce the crash in a debugger now, I just need to figure out what's going on. Thanks again for all your help!
Reporter | ||
Comment 22•8 years ago
|
||
Just thought things out and the reason I crash on the link above is that I had to get to this site first before I clicked on the link above.
Assignee | ||
Comment 23•8 years ago
|
||
Okay, I don't 100% know what's happening, but it's something like: * Some NSS code from nssckbi.dll calls PR_NewThreadPrivateIndex, registers a destructor * During shutdown we unload nssckbi.dll * We shut down a thread which winds up calling the destructor from a DLL we already unloaded
Assignee | ||
Comment 24•8 years ago
|
||
This is the destructor registration: https://dxr.mozilla.org/mozilla-central/rev/aa90f482e16db77cdb7dea84564ea1cbd8f7f6b3/security/nss/lib/base/error.c#67 I'm guessing the bug here is either that we're unloading NSS earlier than we should, or that PR_Free is getting inlined in error.c, so it's trying to call a destructor from a library we already unloaded.
Comment 25•8 years ago
|
||
bp-03a7f24d-2cb4-4ae2-8cfe-8b31b2160126
Crash Signature: [@ _PR_CleanupThread | _PR_NativeRunThread | pr_root]
Comment 26•8 years ago
|
||
Same crash report as Alice https://crash-stats.mozilla.com/report/index/b83af0f0-0dba-4195-9b48-b14372160126 Nightly win32 m-c build Crashing on close of browser. Setting to NEW
Status: UNCONFIRMED → NEW
Ever confirmed: true
Comment 27•8 years ago
|
||
Nightly 47 is also crashing for me with a slightly different signature (it happens when closing Firefox using the Close button). [@ PR_DestroyThreadPrivate | PR_CleanupThread | PR_NativeRunThread | pr_root ] Here are some more reports: https://crash-stats.mozilla.com/report/index/31f167bf-bcc7-4b4e-a10e-c00082160126 https://crash-stats.mozilla.com/report/index/7222143b-1998-4752-b0d3-fb0c12160126
Comment 28•8 years ago
|
||
https://crash-stats.mozilla.com/report/index/dba7947b-b0c4-4228-a743-549492160126
status-firefox46:
--- → affected
status-firefox47:
--- → affected
tracking-firefox46:
--- → ?
tracking-firefox47:
--- → ?
Keywords: regression
Assignee | ||
Comment 29•8 years ago
|
||
I'm 99% confident this is fallout from bug 1237863. That patch changed the linkage on some NSPR functions, and so nssckbi.dll winds up inlining PR_Free, but we unload that DLL during shutdown before we terminate all our threads and so the destructor it registers gets called after it has been unloaded. In local testing backing this patch out seems to fix the crash, I'll push to try just for sanity and then likely land the backout soon. Try push: https://treeherder.mozilla.org/#/jobs?repo=try&revision=6380ab360cfa
Blocks: 1237863
Assignee | ||
Updated•8 years ago
|
Assignee: nobody → ted
Updated•8 years ago
|
Summary: Fx46 from inbound crashes when I exit. → NSS crash in _PR_CleanupThread on shutdown
Comment 30•8 years ago
|
||
The crash is somehow related to the "cache2". Removing permissions to write into "cache2/entries" folder or creating that folder "fixes" the issue. Win7x64, Fx47x64 Nightly.
Comment 31•8 years ago
|
||
(In reply to Ted Mielczarek [:ted.mielczarek] from comment #29) > I'm 99% confident this is fallout from bug 1237863. That patch changed the > linkage on some NSPR functions, and so nssckbi.dll winds up inlining > PR_Free, but we unload that DLL during shutdown before we terminate all our > threads and so the destructor it registers gets called after it has been > unloaded. > > In local testing backing this patch out seems to fix the crash, I'll push to > try just for sanity and then likely land the backout soon. > > Try push: > https://treeherder.mozilla.org/#/jobs?repo=try&revision=6380ab360cfa Just tested this and the shut-down crash is gone. win7 x64
Reporter | ||
Comment 32•8 years ago
|
||
(In reply to Aris from comment #30) > The crash is somehow related to the "cache2". Removing permissions to write > into "cache2/entries" folder or creating that folder "fixes" the issue. > > Win7x64, Fx47x64 Nightly. While deleting cache2 works the first time I exit, any subsequent exits produce the crash.
Reporter | ||
Comment 33•8 years ago
|
||
I also confirm that https://treeherder.mozilla.org/#/jobs?repo=try&revision=6380ab360cfa fixes the problem.
Assignee | ||
Comment 34•8 years ago
|
||
Thanks for testing, I'll back that patch out ASAP.
Assignee | ||
Comment 35•8 years ago
|
||
Backed out: https://hg.mozilla.org/integration/mozilla-inbound/rev/e265e7992928c9ca7bacfd8bfca1929e974b2467 Should be fixed whenever that merges.
Crash Signature: [@ _PR_CleanupThread | _PR_NativeRunThread | pr_root] → [@ _PR_CleanupThread | _PR_NativeRunThread | pr_root]
[@ PR_DestroyThreadPrivate | PR_CleanupThread | PR_NativeRunThread | pr_root ]
Comment 38•8 years ago
|
||
I'm having this too: nightly 47 windows 7 sp1 x64: https://crash-stats.mozilla.com/report/index/c1be7036-7fb8-49bc-b040-dd6af2160126
Updated•8 years ago
|
status-firefox45:
--- → unaffected
Comment 42•8 years ago
|
||
Just downloaded latest nightly and not seeing any crashes yet.
Comment 43•8 years ago
|
||
I've also confirmed that the latest Nightly does *not* crash on its shutdown. Nightly(2016-01-27) on x86_32 Windows 7
Assignee | ||
Comment 44•8 years ago
|
||
This got merged to central: https://hg.mozilla.org/mozilla-central/rev/e265e7992928c9ca7bacfd8bfca1929e974b2467 Thanks for all the testing and verification, folks! Sorry for the inconvenience.
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
Comment 46•8 years ago
|
||
Nightly 47.0a1 updated it and crash with signature _PR_CleanupThread | _PR_NativeRunThread | pr_root https://crash-stats.mozilla.com/report/index/abc72302-b05a-4b01-9fc6-d00492160127
Updated•8 years ago
|
Keywords: crashreportid
Updated•8 years ago
|
Component: Untriaged → Build Config
Comment 47•8 years ago
|
||
Looks like this affect 47 but maybe not 46. Tracking for 47 since this is a regression.
Comment 48•8 years ago
|
||
So this is fixed for 47, right? If yes, can we please set the right status flags? Ted?
Flags: needinfo?(ted)
Updated•8 years ago
|
Status: RESOLVED → VERIFIED
Comment 49•8 years ago
|
||
Yes Nightly 47.0a1
Assignee | ||
Comment 51•8 years ago
|
||
I don't believe bug 1237863 made it to 46 (the target milestone indicates it didn't), so this shouldn't be an issue there.
Comment 52•8 years ago
|
||
I’m still seeing this issue on Firefox 47.0a1. Shouldn’t be fixed and verified? I’ve encountered this crash under Windows 10 64-bit under a Dell Xps 12. This is my crash report: bp-3453c143-8011-4861-90d9-482652160203 And please take a look also on reports list https://crash-stats.mozilla.com/report/list?product=Firefox&signature=_PR_CleanupThread+|+_PR_NativeRunThread+|+pr_root#tab-reports
Flags: needinfo?(bernesb)
Comment 53•8 years ago
|
||
Sorry, just noticed that I had a build from 2016-01-26. I confirm that this bug is fixed on latest Firefox 47.0a1 (2016-02-03).
Flags: needinfo?(bernesb)
Updated•6 years ago
|
Product: Core → Firefox Build System
You need to log in
before you can comment on or make changes to this bug.
Description
•