1387827 - Permaorange devtools timed out after 1000 seconds of no output on Linux x64 JSDCov

Reporter

Description

•

7 years ago

treeherder

Filed by: archaeopteryx [at] coole-files.de

https://treeherder.mozilla.org/logviewer.html#?job_id=121258817&repo=mozilla-central

https://queue.taskcluster.net/v1/task/dwLQksgyQlaQZxnFfHGm8g/runs/0/artifacts/public/logs/live_backing.log

Comment hidden (Intermittent Failures Robot)

3 failures in 888 pushes (0.003 failures/push) were associated with this bug in the last 7 days.   

Repository breakdown:
* mozilla-central: 3

Platform breakdown:
* linux64-jsdcov: 3

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-07-31&endday=2017-08-06&tree=all

Comment hidden (Intermittent Failures Robot)

13 failures in 901 pushes (0.014 failures/push) were associated with this bug in the last 7 days.   

Repository breakdown:
* mozilla-central: 9
* try: 4

Platform breakdown:
* linux64-jsdcov: 13

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-08-07&endday=2017-08-13&tree=all

Joel Maher ( :jmaher ) (UTC -8)

Comment 3

•

7 years ago

this failure is quite frequent and I we should look into it.  This week it is trending on 30+ failures.  Possibly 1 test needs to be disabled, or maybe we need more chunks or a longer timeout.

:gmierz, can you look into this?

Flags: needinfo?(gmierz2)

Whiteboard: [stockwell needswork]

Greg Mierzwinski [:sparky]

Comment 4

•

7 years ago

I'm on it. Also, I'm leaving the ni? open to save this bug.

Comment hidden (Intermittent Failures Robot)

26 failures in 908 pushes (0.029 failures/push) were associated with this bug in the last 7 days.   

Repository breakdown:
* mozilla-central: 20
* try: 6

Platform breakdown:
* linux64-jsdcov: 26

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-08-21&endday=2017-08-27&tree=all

Comment hidden (Intermittent Failures Robot)

21 failures in 175 pushes (0.12 failures/push) were associated with this bug yesterday.   

Repository breakdown:
* try: 17
* mozilla-central: 4

Platform breakdown:
* linux64-jsdcov: 21

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-08-29&endday=2017-08-29&tree=all

Greg Mierzwinski [:sparky]

Comment 7

•

7 years ago

I've managed to get rid of most of the failures by increasing the number of chunks up to 16: https://treeherder.mozilla.org/#/jobs?repo=try&revision=0b9b1e7fa2e07343af1ff2fab697ad4f5d8bf537

Right now, I'm looking into if just skipping that one test 'browser_dbg_stack-03.js' will get rid of the last error. I'm still not sure why it's perma-failing but I think it's because of the use of the debugger in that test: https://treeherder.mozilla.org/#/jobs?repo=try&revision=c25bc14ca0feea6e8887b0d4be74bedab9920eaf

Greg Mierzwinski [:sparky]

Comment 8

•

7 years ago

The last push that I did didn't work [1]. But looking at the logs of each of the failing tests we have that 'this.content' is null in 'test-actors.js': https://dxr.mozilla.org/mozilla-central/source/devtools/client/shared/test/test-actor.js#681,702

So, there is something consistent to go from. I've also noticed that there are an incredible amount of connection closed errors (not just one) [2]. Also, there is another error at the start, [3].

It's also possible that either addons or marionette is broken as I found a warning in the log here: https://treeherder.mozilla.org/logviewer.html#?job_id=127425665&repo=try&lineNumber=2004

[1]: https://treeherder.mozilla.org/#/jobs?repo=try&revision=c25bc14ca0feea6e8887b0d4be74bedab9920eaf
[2]: https://treeherder.mozilla.org/logviewer.html#?job_id=127425665&repo=try&lineNumber=4067
[3]: https://treeherder.mozilla.org/logviewer.html#?job_id=127425665&repo=try&lineNumber=2337

Comment hidden (Intermittent Failures Robot)

90 failures in 939 pushes (0.096 failures/push) were associated with this bug in the last 7 days. 

This is the #21 most frequent failure this week. 

** This failure happened more than 75 times this week! Resolving this bug is a very high priority. **

** Try to resolve this bug as soon as possible. If unresolved for 1 week, the affected test(s) may be disabled. **  

Repository breakdown:
* mozilla-central: 53
* try: 37

Platform breakdown:
* linux64-jsdcov: 89
* osx-10-10: 1

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-08-28&endday=2017-09-03&tree=all

Comment hidden (Intermittent Failures Robot)

15 failures in 155 pushes (0.097 failures/push) were associated with this bug yesterday.   

Repository breakdown:
* mozilla-central: 15

Platform breakdown:
* linux64-jsdcov: 15

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-09-05&endday=2017-09-05&tree=all

Geoff Brown [:gbrown]

Comment 11

•

7 years ago

I reviewed several logs from yesterday and noticed several ended in something like:

[task 2017-09-05T23:05:41.633056Z] 23:05:41     INFO - GECKO(1942) | console.log: [DISPATCH] {type:..,highlighted:..,nodeFront:.., }
[task 2017-09-05T23:05:42.058503Z] 23:05:42     INFO - GECKO(1942) | console.error:
[task 2017-09-05T23:05:42.059913Z] 23:05:42     INFO - GECKO(1942) |   Message: TypeError: content is null
[task 2017-09-05T23:05:42.060046Z] 23:05:42     INFO - GECKO(1942) |   Stack:
[task 2017-09-05T23:05:42.060112Z] 23:05:42     INFO - GECKO(1942) |     @http://example.com/browser/devtools/client/shared/test/test-actor.js:683:5
[task 2017-09-05T23:05:42.061357Z] 23:05:42     INFO - GECKO(1942) | @http://example.com/browser/devtools/client/shared/test/test-actor.js:683:5
[task 2017-09-05T23:22:22.089404Z] 23:22:22     INFO - Automation Error: mozprocess timed out after 1000 seconds running ...

even though they are running various tests:

https://treeherder.mozilla.org/logviewer.html#?repo=mozilla-central&job_id=128710350&lineNumber=5255
https://treeherder.mozilla.org/logviewer.html#?repo=mozilla-central&job_id=128712747&lineNumber=5254
https://treeherder.mozilla.org/logviewer.html#?repo=mozilla-central&job_id=128712824&lineNumber=5006

:gmierz - Is that something you have noticed before?

Greg Mierzwinski [:sparky]

Comment 13

•

7 years ago

Yes, nearly all the failures in devtools are for that reason. In comment 6, I've detailed what I've found so far in each of the failures. Most of them start with the error in [3] and then make multiple connection closed errors displayed in [2] and then finally fail with the content is null error. I haven't had much time to look further although I did try increasing the number of chunks and increasing the timeouts which didn't help.

Content is definitely null but I haven't found why it becomes null. The last thing I was looking at was trying to find something that sets content to null and see if it's being run, but I haven't had a chance to test this yet.

Would you have any thoughts about why this failure is happening?

Flags: needinfo?(gmierz2)

Geoff Brown [:gbrown]

Comment 14

•

7 years ago

(In reply to Greg Mierzwinski [:gmierz] from comment #13)
> The last
> thing I was looking at was trying to find something that sets content to
> null and see if it's being run, but I haven't had a chance to test this yet.

That sounds like a good approach.
 
> Would you have any thoughts about why this failure is happening?

Sorry, no.

Comment hidden (Intermittent Failures Robot)

20 failures in 64 pushes (0.313 failures/push) were associated with this bug yesterday.   

Repository breakdown:
* mozilla-central: 20

Platform breakdown:
* linux64-jsdcov: 20

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-09-09&endday=2017-09-09&tree=all

Comment hidden (Intermittent Failures Robot)

127 failures in 924 pushes (0.137 failures/push) were associated with this bug in the last 7 days. 

This is the #16 most frequent failure this week. 

** This failure happened more than 75 times this week! Resolving this bug is a very high priority. **

** Try to resolve this bug as soon as possible. If unresolved for 1 week, the affected test(s) may be disabled. **  

Repository breakdown:
* mozilla-central: 127

Platform breakdown:
* linux64-jsdcov: 127

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-09-04&endday=2017-09-10&tree=all

Greg Mierzwinski [:sparky]

Comment 17

•

7 years ago

I spent some time looking into this and I checked if any place that sets content (or a content variable anyway) to null is being run and found that none of them are being run. So, either that is really the case or I somehow missed one of them. 'this.content' is (from what I understand) a content window and it looks to me like it's a devtools window but I haven't tested that yet.

In a previous comment I mentioned the connection closed errors which I've found to mean nothing for this error since we hit the same 'content is null' errors regardless of whether or not that failure is there. For some reason though, in some cases, the test fails and continues to fail on another test, then in others, the mozprocess times out and there are no other errors except for the first one. In my opinion, this means that there could be two different errors occurring - and one isn't being caught- or that it's the same error but a different "manifestation" of it is not being caught. This has nothing to do with the error itself, but it does help with categorizing the error(s) a little.

As I look through the logs though, I see a few errors that are either purposeful or are not being caught, so I plan on looking into that now.

Joel, would you have any thoughts about this or another idea of what I could try?

Flags: needinfo?(jmaher)

Joel Maher ( :jmaher ) (UTC -8)

Comment 18

•

7 years ago

we fail on the same devtools chunks, but the last day we have greatly reduced the failures and now only 1 chunk is failing:
https://treeherder.mozilla.org/#/jobs?repo=mozilla-central&filter-searchStr=jsdcov%20devtools&selectedJob=130324062

it seems to be failing right after:
TEST-START | devtools/client/debugger/test/mochitest/browser_dbg_stack-03.js

so I think possibly we can just skip that test?

Flags: needinfo?(jmaher)

Greg Mierzwinski [:sparky]

Comment 19

•

7 years ago

Ah, that is great! :)

Do you know, off hand, what patch fixed this? Otherwise, I'll look around.

Yes, let's skip it. That test you mention has been failing for a long time now, and I have a feeling that the js debugger may be interfering with it because it uses the debugger also. I'll have a patch up soon to skip this one.

Joel Maher ( :jmaher ) (UTC -8)

Comment 20

•

7 years ago

lets go the path of least resistance.

Comment hidden (Intermittent Failures Robot)

1 failures in 191 pushes (0.005 failures/push) were associated with this bug yesterday.    

** This test has failed more than 200 times in the last 30 days. It should be disabled until it can be fixed. ** 

Repository breakdown:
* mozilla-central: 1

Platform breakdown:
* linux64-jsdcov: 1

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-09-14&endday=2017-09-14&tree=all

Greg Mierzwinski [:sparky]

Comment 22

•

7 years ago

There seems to be two new errors now on jsdcov with the following error: https://treeherder.mozilla.org/logviewer.html#?job_id=131593077&repo=mozilla-central&lineNumber=1979

Not sure why it's happening but I'm going to open a new bug to disable 'browser_dbg_stack-03.js' since it's being a problem regardless of this error.

Comment hidden (Intermittent Failures Robot)

41 failures in 1032 pushes (0.04 failures/push) were associated with this bug in the last 7 days.   

** This failure happened more than 30 times this week! Resolving this bug is a high priority. **

** Try to resolve this bug as soon as possible. If unresolved for 2 weeks, the affected test(s) may be disabled. **  

Repository breakdown:
* mozilla-central: 41

Platform breakdown:
* linux64-jsdcov: 39
* macosx64-nightly: 2

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-09-11&endday=2017-09-17&tree=all

Joel Maher ( :jmaher ) (UTC -8)

Comment 24

•

7 years ago

thanks :gmierz!

Joel Maher ( :jmaher ) (UTC -8)

Comment 25

•

7 years ago

disabled the one test in bug 1400683, I assume this will be reduced greatly or completely in frequency.

Joel Maher ( :jmaher ) (UTC -8)

Updated

•

7 years ago

Depends on: 1401215

Comment hidden (Intermittent Failures Robot)

16 failures in 199 pushes (0.08 failures/push) were associated with this bug yesterday.    

Repository breakdown:
* mozilla-central: 13
* try: 3

Platform breakdown:
* linux64-jsdcov: 16

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-09-21&endday=2017-09-21&tree=all

Comment hidden (Intermittent Failures Robot)

42 failures in 943 pushes (0.045 failures/push) were associated with this bug in the last 7 days. 

This is the #40 most frequent failure this week.  

** This failure happened more than 30 times this week! Resolving this bug is a high priority. **

** Try to resolve this bug as soon as possible. If unresolved for 2 weeks, the affected test(s) may be disabled. **  

Repository breakdown:
* mozilla-central: 40
* try: 2

Platform breakdown:
* linux64-jsdcov: 42

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-09-18&endday=2017-09-24&tree=all

Joel Maher ( :jmaher ) (UTC -8)

Comment 28

•

7 years ago

unfortunately we still see a high failure rate, everything looks to be related to bug 1401215.

Comment hidden (Intermittent Failures Robot)

62 failures in 885 pushes (0.07 failures/push) were associated with this bug in the last 7 days. 

This is the #30 most frequent failure this week.  

** This failure happened more than 30 times this week! Resolving this bug is a high priority. **

** Try to resolve this bug as soon as possible. If unresolved for 2 weeks, the affected test(s) may be disabled. **  

Repository breakdown:
* mozilla-central: 60
* autoland: 2

Platform breakdown:
* linux64-jsdcov: 60
* macosx64-stylo-disabled: 2

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-09-25&endday=2017-10-01&tree=all

Comment hidden (Intermittent Failures Robot)

38 failures in 824 pushes (0.046 failures/push) were associated with this bug in the last 7 days.   

** This failure happened more than 30 times this week! Resolving this bug is a high priority. **

** Try to resolve this bug as soon as possible. If unresolved for 2 weeks, the affected test(s) may be disabled. **  

Repository breakdown:
* mozilla-central: 38

Platform breakdown:
* linux64-jsdcov: 37
* windows7-32-nightly: 1

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-10-02&endday=2017-10-08&tree=all

Phil Ringnalda (:philor)

Comment 31

•

7 years ago

Even though it's only two chunks of permaorange, it's still two chunks of permaorange, not intermittent.

Summary: Intermittent devtools timed out after 1000 seconds of no output on Linux x64 JSDCov → Permaorange devtools timed out after 1000 seconds of no output on Linux x64 JSDCov

Joel Maher ( :jmaher ) (UTC -8)

Comment 32

•

7 years ago

after bug 1393788 is completed we will dive into this bug and see what remains.

Depends on: 1393788

Comment hidden (Intermittent Failures Robot)

76 failures in 947 pushes (0.08 failures/push) were associated with this bug in the last 7 days. 

This is the #20 most frequent failure this week. 

** This failure happened more than 75 times this week! Resolving this bug is a very high priority. **

** Try to resolve this bug as soon as possible. If unresolved for 1 week, the affected test(s) may be disabled. **   

Repository breakdown:
* mozilla-central: 76

Platform breakdown:
* linux64-jsdcov: 74
* linux64-ccov: 2

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-10-09&endday=2017-10-15&tree=all

Joel Maher ( :jmaher ) (UTC -8)

Comment 34

•

7 years ago

Attached patch disable 3 tests on coverage to get this green again — Details — Splinter Review

https://treeherder.mozilla.org/#/jobs?repo=try&revision=fb3584aca76450aa61dc8451cbba17aebd9b2d5a

Attachment #8918902 - Flags: review?(gbrown)

Geoff Brown [:gbrown]

Updated

•

7 years ago

Attachment #8918902 - Flags: review?(gbrown) → review+

Pulsebot

Comment 35

•

7 years ago

Pushed by jmaher@mozilla.com:
https://hg.mozilla.org/integration/mozilla-inbound/rev/f7bf0e655457
Disable 2 devtools tests on coverage builds for frequent timeouts. r=gbrown, a=test-only

Sebastian Hengst [:aryx] (needinfo me if it's about an intermittent or backout)

Comment 36

•

7 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/f7bf0e655457

Status: NEW → RESOLVED

Closed: 7 years ago

status-firefox58: --- → fixed

Resolution: --- → FIXED

Comment hidden (Intermittent Failures Robot)

4 failures in 864 pushes (0.005 failures/push) were associated with this bug in the last 7 days.    

Repository breakdown:
* mozilla-central: 4

Platform breakdown:
* linux64-jsdcov: 4

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2017-10-16&endday=2017-10-22&tree=all

Joel Maher ( :jmaher ) (UTC -8)

Updated

•

7 years ago

Whiteboard: [stockwell disable-recommended] → [stockwell disabled]

Comment hidden (Intermittent Failures Robot)

2 failures in 462 pushes (0.004 failures/push) were associated with this bug in the last 7 days.    

Repository breakdown:
* mozilla-central: 2

Platform breakdown:
* linux64-jsdcov: 2

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2018-01-01&endday=2018-01-07&tree=all

Comment hidden (Intermittent Failures Robot)

1 failures in 702 pushes (0.001 failures/push) were associated with this bug in the last 7 days.    

Repository breakdown:
* try: 1

Platform breakdown:
* osx-10-10: 1

For more details, see:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1387827&startday=2018-02-05&endday=2018-02-11&tree=all

Henrik Skupin [:whimboo][⌚️UTC+1]

Updated

•

6 years ago

Status: RESOLVED → REOPENED

Component: General → Developer Tools

Keywords: test-disabled

Product: Release Engineering → Firefox

Resolution: FIXED → ---

Henrik Skupin [:whimboo][⌚️UTC+1]

Comment 40

•

6 years ago

Tests have been disabled here, so the bug shouldn't have been marked as fixed.

Henrik Skupin [:whimboo][⌚️UTC+1]

Updated

•

6 years ago

status-firefox58: fixed → disabled

Comment hidden (Intermittent Failures Robot)

1 failures in 765 pushes (0.001 failures/push) were associated with this bug in the last 7 days.    

Repository breakdown:
* mozilla-central: 1

Platform breakdown:
* linux64-jsdcov: 1

For more details, see:
https://treeherder.mozilla.org/intermittent-failures.html#/bugdetails?bug=1387827&startday=2018-04-02&endday=2018-04-08&tree=trunk

Comment hidden (Intermittent Failures Robot)

1 failures in 782 pushes (0.001 failures/push) were associated with this bug in the last 7 days.    

Repository breakdown:
* mozilla-central: 1

Platform breakdown:
* linux64-jsdcov: 1

For more details, see:
https://treeherder.mozilla.org/intermittent-failures.html#/bugdetails?bug=1387827&startday=2018-04-16&endday=2018-04-22&tree=trunk

Firefox Bug Husbandry Bot

Comment 43

•

6 years ago

https://wiki.mozilla.org/Bug_Triage#Intermittent_Test_Failure_Cleanup

Status: REOPENED → RESOLVED

Closed: 7 years ago → 6 years ago

Resolution: --- → INCOMPLETE

BMO Automation

Updated

•

6 years ago

Product: Firefox → DevTools

Greg Mierzwinski [:sparky]

Comment 44

•

6 years ago

The linux64-jsdcov build has been disabled, and no longer runs in taskcluster, see bug 1496791.

Comment 45

•

4 years ago

Attached file Bug 1387827 - Delete skip line for browser_browser_toolbox.js and browser_browser_toolbox_fission_inspector.js as they are green on ccov. r=jmaher — Details

Phabricator Automation

Updated

•

4 years ago

Assignee: nobody → csabou

Cosmin Sabou [:CosminS]

Assignee

Comment 46

•

4 years ago

Stumbled upon these two lines (https://searchfox.org/mozilla-central/source/devtools/client/framework/browser-toolbox/test/browser.ini#27,32) as I was working on another bug, searched them with .mach test-info and found that both have:
windows10-64/ccov-opt-e10s: 0 failures ( 0 skipped) in 25 runs
linux1804-64/ccov-opt-e10s: 0 failures ( 0 skipped) in 12 runs
so that's the reason for this patch.

Assignee: csabou → nobody

Phabricator Automation

Updated

•

4 years ago

Assignee: nobody → csabou

Pulsebot

Comment 47

•

4 years ago

Pushed by shindli@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/64b1ca50cf4f
Delete skip line for browser_browser_toolbox.js and browser_browser_toolbox_fission_inspector.js as they are green on ccov. r=jmaher

Stefan Hindli [:stefan_hindli]

Comment 48

•

4 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/64b1ca50cf4f

Comment hidden (Intermittent Failures Robot)

disable 3 tests on coverage to get this green again 7 years ago Joel Maher ( :jmaher ) (UTC -8) 2.03 KB, patch	gbrown : review+	Details \| Diff \| Splinter Review
Bug 1387827 - Delete skip line for browser_browser_toolbox.js and browser_browser_toolbox_fission_inspector.js as they are green on ccov. r=jmaher 4 years ago Cosmin Sabou [:CosminS] 47 bytes, text/x-phabricator-request		Details \| Review