Closed Bug 1621195 Opened 4 years ago Closed 4 years ago

Intermittent GECKO(1115) | SUMMARY: ThreadSanitizer: SEGV /builds/worker/checkouts/gecko/toolkit/components/terminator/nsTerminator.cpp:203:7 in mozilla::(anonymous namespace)::RunWatchdog(void*)

Categories

(Core :: DOM: Navigation, defect, P5)

defect

Tracking

()

RESOLVED DUPLICATE of bug 1358898

People

(Reporter: intermittent-bug-filer, Unassigned)

References

(Blocks 1 open bug)

Details

(Keywords: intermittent-failure, Whiteboard: [retriggered])

Filed by: nbeleuzu [at] mozilla.com
Parsed log: https://treeherder.mozilla.org/logviewer.html#?job_id=292387812&repo=autoland
Full log: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/QiZuFjs7SnOSjZTYPDmwQA/runs/0/artifacts/public/logs/live_backing.log


[task 2020-03-10T03:13:12.017Z] 03:13:12 INFO - TEST-OK | docshell/test/mochitest/test_windowedhistoryframes.html | took 3552ms
[task 2020-03-10T03:13:12.225Z] 03:13:12 INFO - TEST-START | Shutdown
[task 2020-03-10T03:13:12.226Z] 03:13:12 INFO - Passed: 294
[task 2020-03-10T03:13:12.227Z] 03:13:12 INFO - Failed: 0
[task 2020-03-10T03:13:12.228Z] 03:13:12 INFO - Todo: 8
[task 2020-03-10T03:13:12.229Z] 03:13:12 INFO - Mode: e10s
[task 2020-03-10T03:13:12.233Z] 03:13:12 INFO - Slowest: 30664ms - /tests/docshell/test/mochitest/test_bug570341.html
[task 2020-03-10T03:13:12.236Z] 03:13:12 INFO - SimpleTest FINISHED
[task 2020-03-10T03:13:12.238Z] 03:13:12 INFO - TEST-INFO | Ran 1 Loops
[task 2020-03-10T03:13:12.240Z] 03:13:12 INFO - SimpleTest FINISHED
[task 2020-03-10T03:13:14.933Z] 03:13:14 INFO - GECKO(1115) | ###!!! [Child][MessageChannel] Error: (msgtype=0x37010E,name=PContent::Msg_AccumulateChildKeyedHistograms) Channel closing: too late to send/recv, messages will be lost
[task 2020-03-10T03:13:14.935Z] 03:13:14 INFO - GECKO(1115) | ###!!! [Child][MessageChannel] Error: (msgtype=0x370112,name=PContent::Msg_RecordDiscardedData) Channel closing: too late to send/recv, messages will be lost
[task 2020-03-10T03:13:15.794Z] 03:13:15 INFO - GECKO(1115) | ###!!! [Child][MessageChannel] Error: (msgtype=0x37010E,name=PContent::Msg_AccumulateChildKeyedHistograms) Channel closing: too late to send/recv, messages will be lost
[task 2020-03-10T03:13:15.794Z] 03:13:15 INFO - GECKO(1115) | ###!!! [Child][MessageChannel] Error: (msgtype=0x370112,name=PContent::Msg_RecordDiscardedData) Channel closing: too late to send/recv, messages will be lost
[task 2020-03-10T03:13:28.140Z] 03:13:28 INFO - GECKO(1115) | console.error: (new Error("SessionFile is closed", "resource:///modules/sessionstore/SessionFile.jsm", 433))
[task 2020-03-10T03:19:17.173Z] 03:19:17 INFO - GECKO(1115) | ThreadSanitizer:DEADLYSIGNAL
[task 2020-03-10T03:19:17.173Z] 03:19:17 INFO - GECKO(1115) | ==1115==ERROR: ThreadSanitizer: SEGV on unknown address 0x000000000000 (pc 0x7fd0b5cda2d2 bp 0x00000000016b sp 0x7fcff04fb270 T1350)
[task 2020-03-10T03:19:17.173Z] 03:19:17 INFO - GECKO(1115) | ==1115==The signal is caused by a WRITE memory access.
[task 2020-03-10T03:19:17.174Z] 03:19:17 INFO - GECKO(1115) | ==1115==Hint: address points to the zero page.
[task 2020-03-10T03:19:17.210Z] 03:19:17 INFO - GECKO(1115) | #0 mozilla::(anonymous namespace)::RunWatchdog(void*) /builds/worker/checkouts/gecko/toolkit/components/terminator/nsTerminator.cpp:203:7 (libxul.so+0x63df2d1)
[task 2020-03-10T03:19:17.211Z] 03:19:17 INFO - GECKO(1115) | #1 _pt_root /builds/worker/checkouts/gecko/nsprpub/pr/src/pthreads/ptthread.c:201:5 (libnspr4.so+0x46a4f)
[task 2020-03-10T03:19:17.212Z] 03:19:17 INFO - GECKO(1115) | #2 __tsan_thread_start_func /builds/worker/fetches/llvm-project/llvm/projects/compiler-rt/lib/tsan/rtl/tsan_interceptors.cc:957:15 (firefox+0x5710c)
[task 2020-03-10T03:19:17.213Z] 03:19:17 INFO - GECKO(1115) | #3 start_thread <null> (libpthread.so.0+0x76b9)
[task 2020-03-10T03:19:17.214Z] 03:19:17 INFO - GECKO(1115) | #4 clone /build/glibc-LK5gWL/glibc-2.23/misc/../sysdeps/unix/sysv/linux/x86_64/clone.S:109 (libc.so.6+0x10741c)
[task 2020-03-10T03:19:17.214Z] 03:19:17 INFO - GECKO(1115) | ThreadSanitizer can not provide additional info.
[task 2020-03-10T03:19:17.215Z] 03:19:17 INFO - GECKO(1115) | SUMMARY: ThreadSanitizer: SEGV /builds/worker/checkouts/gecko/toolkit/components/terminator/nsTerminator.cpp:203:7 in mozilla::(anonymous namespace)::RunWatchdog(void*)
[task 2020-03-10T03:19:17.215Z] 03:19:17 INFO - GECKO(1115) | ==1115==ABORTING
[task 2020-03-10T03:25:27.212Z] 03:25:27 INFO - Buffered messages finished
[task 2020-03-10T03:25:27.213Z] 03:25:27 ERROR - TEST-UNEXPECTED-TIMEOUT | Last test finished | application timed out after 370 seconds with no output
[task 2020-03-10T03:25:27.214Z] 03:25:27 ERROR - Force-terminating active process(es).

Blocks: tsan
Whiteboard: [retriggered]
See Also: → 1614294

Christian can you please take a look? This seems to have showed up when you landed the fix in bug 1614294.

Flags: needinfo?(choller)

(In reply to Andreea Pavel [:apavel] from comment #4)

Christian can you please take a look? This seems to have showed up when you landed the fix in bug 1614294.

As far as I can tell, this is the same bug. This is not caused by 1614294 at all.

If the new shutdown timeout doesn't help, then I would assume that the browser does not shutdown for some reason and really hangs. This seems to happen on all platforms, not just TSan (just on TSan, it is way more frequent, which could be due to the build slowness/timing).

Flags: needinfo?(choller)

Christian are you still working on this, or do you know who can take a look at this?

Flags: needinfo?(choller)

(In reply to Andreea Pavel [:apavel] from comment #6)

Christian are you still working on this, or do you know who can take a look at this?

I don't know who could take a look, but it would help to consolidate this bug with the other shutdown hangs. The fact that TSan has a different message here does not mean that this issue is TSan specific at all. As mentioned in the other bug, this is likely a shutdown hang. The fact that it shows up more often on TSan might be due to timing, but we've seen this on ASan too, and even on regular builds. Treeherder is unlikely to link all of these bugs together though because they show up slightly different on each platform (including the timeout, which for TSan is now 360 seconds, for ASan it should be 180 seconds and for all other platforms it is 60 seconds (all +10 secs I believe for the watchdog message).

Flags: needinfo?(choller)

Just to be clear, by consolidating with other shutdown bugs, you mean with bug 1358898 ?

(In reply to Andreea Pavel [:apavel] from comment #8)

Just to be clear, by consolidating with other shutdown bugs, you mean with bug 1358898 ?

Yes, that sounds like the right meta bug.

Status: NEW → RESOLVED
Closed: 4 years ago
Resolution: --- → DUPLICATE
Depends on: 1624267
You need to log in before you can comment on or make changes to this bug.