Closed Bug 855330 Opened 11 years ago Closed 10 years ago

Intermittent 468263-2-ref.html,468263-2.html,468263-2-alternate-ref.html,723484-1.html | application timed out after 330 seconds with no output (Assertion failure: _mOwningThread.GetThread() == PR_GetCurrentThread() (nsStandardURL not thread-safe))

Categories

(Core :: DOM: Core & HTML, defect)

x86
Windows XP
defect
Not set
critical

Tracking

()

RESOLVED WORKSFORME

People

(Reporter: RyanVM, Unassigned)

References

Details

(Keywords: intermittent-failure)

More possible fallout from bug 716140?

https://tbpl.mozilla.org/php/getParsedLog.php?id=21161842&tree=Mozilla-Inbound

08:33:13     INFO -  REFTEST TEST-START | file:///c:/talos-slave/test/build/tests/reftest/tests/content/html/content/reftests/468263-2.html | 9075 / 9235 (98%)
08:33:13     INFO -  ++DOMWINDOW == 146 (22323DF8) [serial = 25447] [outer = 09255670]
08:33:13     INFO -  ++DOCSHELL 11BD51E0 == 5 [id = 1177]
08:33:13     INFO -  ++DOMWINDOW == 147 (11BD59E8) [serial = 25448] [outer = 00000000]
08:33:13     INFO -  WARNING: Subdocument container has non-subdocument frame: file e:/builds/moz2_slave/m-in-w32-d-0000000000000000000/build/layout/base/nsDocumentViewer.cpp, line 2385
08:33:13     INFO -  ++DOMWINDOW == 148 (2209E640) [serial = 25449] [outer = 11BD59E8]
08:33:13     INFO -  WARNING: NS_ENSURE_TRUE(txToRemove) failed: file e:/builds/moz2_slave/m-in-w32-d-0000000000000000000/build/docshell/shistory/src/nsSHistory.cpp, line 1320
08:33:13     INFO -  --DOCSHELL 11BD51E0 == 4 [id = 1177]
08:33:13     INFO -  Assertion failure: _mOwningThread.GetThread() == PR_GetCurrentThread() (nsStandardURL not thread-safe), at e:/builds/moz2_slave/m-in-w32-d-0000000000000000000/build/netwerk/base/src/nsStandardURL.cpp:915
08:33:13     INFO -  REFTEST TEST-START | file:///c:/talos-slave/test/build/tests/reftest/tests/content/html/content/reftests/468263-2-ref.html | 9075 / 9235 (98%)
08:33:13     INFO -  ++DOMWINDOW == 149 (222F3ED0) [serial = 25450] [outer = 09255670]
08:33:15     INFO -  ###!!! ASSERTION: Failed to make GL context current!: 'succeeded', file e:/builds/moz2_slave/m-in-w32-d-0000000000000000000/build/gfx/gl/GLContextProviderWGL.cpp, line 335
08:38:45  WARNING -  TEST-UNEXPECTED-FAIL | file:///c:/talos-slave/test/build/tests/reftest/tests/content/html/content/reftests/468263-2-ref.html | application timed out after 330 seconds with no output
08:38:45     INFO -  args: ['c:\\talos-slave\\test\\build\\tests\\bin\\screenshot.exe', 'c:\\docume~1\\cltbld\\locals~1\\temp\\mozilla-test-fail_4qsanr']
08:38:47     INFO -  SCREENSHOT: <see log>

command timed out: 1200 seconds without output, attempting to kill
program finished with exit code 1
This sounds pretty bad.
Severity: normal → critical
This is absurd. =(
Fairly certain this is fixed by the first patch (part one) in bug 856486. Hard to be perfectly sure without a stack trace, but I'm speculatively adding a dependency. Certainly that fixes one possible way this could happen.
Depends on: 856486
Waldo, do you have any idea why we don't get a stack trace from these? This is supposed to be a fatal assertion, but instead of printing a stack trace and dying immediately it _doesn't_ print a stack trace and lingers on until the harness has to kill it. Kyle suggested I ping you about this.
Flags: needinfo?(jwalden+bmo)
Hmm.  The way we crash is by writing to a super-low address (everywhere), then calling TerminateProcess (Windows) or abort (non-Windows).  Maybe if you do this stuff not from the main thread, some OSes don't immediately kill you?  I don't know how OSes handle this.  But if that were the case, I'd expect to have seen this error much earlier.  Maybe TerminateProcess can deadlock with the main thread, and some recent change makes us contend a whole lot more here?  I'm grasping at straws a bit here.
Flags: needinfo?(jwalden+bmo)
I saw this on MSDN:

> When the system is very busy, TerminateProcess may fail due to timeout. In this case, ERROR_BUSY is set for the last error.

So it looks like TerminateProcess can fail. Are we handling the case where it fails?
https://tbpl.mozilla.org/php/getParsedLog.php?id=22299081&tree=Mozilla-Central
Summary: Intermittent reftests/468263-2-ref.html | application timed out after 330 seconds with no output (Assertion failure: _mOwningThread.GetThread() == PR_GetCurrentThread() (nsStandardURL not thread-safe)) → Intermittent reftests/468263-2-ref.html,468263-2.html | application timed out after 330 seconds with no output (Assertion failure: _mOwningThread.GetThread() == PR_GetCurrentThread() (nsStandardURL not thread-safe))
https://tbpl.mozilla.org/php/getParsedLog.php?id=22677843&tree=Mozilla-Central
Summary: Intermittent reftests/468263-2-ref.html,468263-2.html | application timed out after 330 seconds with no output (Assertion failure: _mOwningThread.GetThread() == PR_GetCurrentThread() (nsStandardURL not thread-safe)) → Intermittent 468263-2-ref.html,468263-2.html,468263-2-alternate-ref.html | application timed out after 330 seconds with no output (Assertion failure: _mOwningThread.GetThread() == PR_GetCurrentThread() (nsStandardURL not thread-safe))
https://tbpl.mozilla.org/php/getParsedLog.php?id=24607792&tree=Fx-Team
Summary: Intermittent 468263-2-ref.html,468263-2.html,468263-2-alternate-ref.html | application timed out after 330 seconds with no output (Assertion failure: _mOwningThread.GetThread() == PR_GetCurrentThread() (nsStandardURL not thread-safe)) → Intermittent 468263-2-ref.html,468263-2.html,468263-2-alternate-ref.html,723484-1.html | application timed out after 330 seconds with no output (Assertion failure: _mOwningThread.GetThread() == PR_GetCurrentThread() (nsStandardURL not thread-safe))
Depends on: 912514
Closing inactive keywords:intermittent-failure bugs where the TBPLbot has previously commented and the test isn't marked as disabled; filter on orange-cleanup-201401.
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → WORKSFORME
You need to log in before you can comment on or make changes to this bug.