Closed
Bug 1397612
Opened 7 years ago
Closed 6 years ago
Intermittent test_quit_restart.py TestQuitRestart.test_in_app_restart_safe_mode | IOError: Process has been unexpectedly closed (Exit code: -15) (Reason: [Errno 111] Connection refused)
Categories
(Testing :: Marionette Client and Harness, defect, P3)
Tracking
(firefox57 fixed, firefox58 disabled, firefox59 disabled, firefox62 disabled, firefox63 disabled, firefox64 disabled, firefox65 fixed)
RESOLVED
FIXED
mozilla65
People
(Reporter: intermittent-bug-filer, Assigned: whimboo)
References
Details
(Keywords: intermittent-failure)
Attachments
(4 files, 1 obsolete file)
1.11 KB,
patch
|
automatedtester
:
review+
whimboo
:
checked-in+
|
Details | Diff | Splinter Review |
59 bytes,
text/x-review-board-request
|
jmaher
:
review+
|
Details |
912 bytes,
patch
|
whimboo
:
review+
|
Details | Diff | Splinter Review |
46 bytes,
text/x-phabricator-request
|
Details | Review |
Filed by: philringnalda [at] gmail.com https://treeherder.mozilla.org/logviewer.html#?job_id=129142170&repo=autoland https://queue.taskcluster.net/v1/task/OthL_bIoQJi13RiICXQA7w/runs/0/artifacts/public/logs/live_backing.log
Assignee | ||
Comment 1•7 years ago
|
||
Marionette client doesn't connect and so it times out. A fix for bug 1362293 will also solve this.
Depends on: 1362293
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Assignee | ||
Comment 4•7 years ago
|
||
Assignee: nobody → hskupin
Attachment #8909472 -
Flags: review?(dburns)
Updated•7 years ago
|
Attachment #8909472 -
Flags: review?(dburns) → review+
Comment 6•7 years ago
|
||
bugherder |
https://hg.mozilla.org/mozilla-central/rev/63d97c8b46b2
Status: NEW → RESOLVED
Closed: 7 years ago
status-firefox57:
--- → fixed
Resolution: --- → FIXED
Target Milestone: --- → mozilla57
Assignee | ||
Updated•7 years ago
|
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Assignee | ||
Comment 12•7 years ago
|
||
The failure here happens due to an in_app restart and should be gone once my patch on bug 1410366 landed.
Depends on: 1410366
Assignee | ||
Comment 13•7 years ago
|
||
Actually the upcoming patch on bug 1410366 should fix it.
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Assignee | ||
Updated•7 years ago
|
Assignee | ||
Updated•7 years ago
|
Attachment #8909472 -
Flags: checked-in+
Comment hidden (mozreview-request) |
Assignee | ||
Updated•7 years ago
|
Attachment #8925493 -
Flags: review?(jmaher)
Comment 17•7 years ago
|
||
mozreview-review |
Comment on attachment 8925493 [details] Bug 1397612 - Backed out changeset 63d97c8b46b2 https://reviewboard.mozilla.org/r/196632/#review201890
Attachment #8925493 -
Flags: review?(jmaher) → review+
Comment 18•7 years ago
|
||
Pushed by hskupin@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/1a83be186f45 Backed out changeset 63d97c8b46b2 r=jmaher
Comment 19•7 years ago
|
||
bugherder |
https://hg.mozilla.org/mozilla-central/rev/1a83be186f45
Status: ASSIGNED → RESOLVED
Closed: 7 years ago → 7 years ago
Resolution: --- → FIXED
Assignee | ||
Comment 20•7 years ago
|
||
If still possible please uplift this test-only patch to beta/release (57). Thanks
Keywords: test-disabled → checkin-needed
Comment 21•7 years ago
|
||
bugherder uplift |
https://hg.mozilla.org/releases/mozilla-release/rev/33538b41a352
Keywords: checkin-needed
Assignee | ||
Updated•7 years ago
|
Assignee | ||
Comment 23•7 years ago
|
||
The failure is still present, but might have another underlying issue now. Lets keep the remaining problem tracked on bug 1415404. I will undo the duplication.
No longer depends on: 1400819
Comment hidden (Intermittent Failures Robot) |
Assignee | ||
Comment 26•7 years ago
|
||
(In reply to Cristina Coroiu [:ccoroiu] from comment #19) > https://hg.mozilla.org/mozilla-central/rev/1a83be186f45 We have to get this commit backed-out on central and beta, because the re-enabled test is causing bug 1391545 which is a high intermittent. I will have to continue to investigate what's wrong with safe mode.
Keywords: checkin-needed
Whiteboard: [backout on central, beta]
Comment 27•7 years ago
|
||
Backout by archaeopteryx@coole-files.de: https://hg.mozilla.org/mozilla-central/rev/f607af87cc3c Backed out changeset 1a83be186f45 on request from whimboo for causing bug 1391545. r=backout a=backout on a CLOSED TREE
Keywords: checkin-needed
Updated•7 years ago
|
Keywords: checkin-needed
Updated•7 years ago
|
Whiteboard: [backout on central, beta] → [backout on beta]
Assignee | ||
Updated•7 years ago
|
Updated•7 years ago
|
status-firefox58:
fixed → ---
Target Milestone: mozilla57 → ---
Updated•7 years ago
|
Target Milestone: --- → mozilla57
Updated•7 years ago
|
status-firefox58:
--- → fixed
Target Milestone: mozilla57 → ---
Updated•7 years ago
|
Keywords: checkin-needed
Whiteboard: [backout on beta]
Assignee | ||
Comment 28•7 years ago
|
||
Btw the backout on mozilla-beta is: https://hg.mozilla.org/releases/mozilla-beta/rev/540e78757c473280d0136536f6b283831400f6da
Comment hidden (Intermittent Failures Robot) |
Assignee | ||
Updated•7 years ago
|
Status: REOPENED → ASSIGNED
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment 33•7 years ago
|
||
In the last 7 days there are 37 failures. They occur only on Linux. A recent log example: https://treeherder.mozilla.org/logviewer.html#?repo=autoland&job_id=151473919&lineNumber=32392 [task 2017-12-13T16:53:35.230Z] 16:53:35 INFO - JavaScript error: jar:file:///builds/worker/workspace/build/application/firefox/omni.ja!/components/nsUrlClassifierListManager.js, line 69: NS_ERROR_XPC_GS_RETURNED_FAILURE: Component returned failure code: 0x80570016 (NS_ERROR_XPC_GS_RETURNED_FAILURE) [nsIJSCID.getService] [task 2017-12-13T16:53:35.231Z] 16:53:35 INFO - JavaScript error: jar:file:///builds/worker/workspace/build/application/firefox/omni.ja!/components/nsUrlClassifierListManager.js, line 69: NS_ERROR_XPC_GS_RETURNED_FAILURE: Component returned failure code: 0x80570016 (NS_ERROR_XPC_GS_RETURNED_FAILURE) [nsIJSCID.getService] [task 2017-12-13T16:55:33.880Z] 16:55:33 INFO - TEST-UNEXPECTED-ERROR | testing/marionette/harness/marionette_harness/tests/unit/test_quit_restart.py TestQuitRestart.test_in_app_restart_safe_mode | IOError: Process has been unexpectedly closed (Exit code: -15) (Reason: [Errno 111] Connection refused) ----------- ----------- ----------- [task 2017-12-13T16:57:38.312Z] 16:57:38 INFO - FAILED TESTS [task 2017-12-13T16:57:38.313Z] 16:57:38 INFO - ------- [task 2017-12-13T16:57:38.314Z] 16:57:38 INFO - test_quit_restart.py test_quit_restart.TestQuitRestart.test_in_app_restart_safe_mode [task 2017-12-13T16:57:38.315Z] 16:57:38 INFO - SUITE-END | took 535s [task 2017-12-13T16:57:38.317Z] 16:57:38 INFO - 1513184258295 Marionette DEBUG Closed connection 1 [task 2017-12-13T16:57:39.799Z] 16:57:39 ERROR - Return code: 10 :whimboo , can you please take a look?
Flags: needinfo?(hskupin)
Whiteboard: [stockwell needswork]
Assignee | ||
Comment 34•7 years ago
|
||
We should just skip the patch on all platforms for now. Can someone please land it? Thanks.
Flags: needinfo?(hskupin)
Attachment #8936662 -
Flags: review+
Assignee | ||
Updated•7 years ago
|
Keywords: checkin-needed,
test-disabled
Comment 35•7 years ago
|
||
Pushed by ryanvm@gmail.com: https://hg.mozilla.org/integration/mozilla-inbound/rev/b949944f17b0 Skip TestQuitRestart.test_in_app_restart_safe_mode across all platforms. r=whimboo
Keywords: checkin-needed
Comment 36•7 years ago
|
||
bugherder |
https://hg.mozilla.org/mozilla-central/rev/b949944f17b0
Status: ASSIGNED → RESOLVED
Closed: 7 years ago → 7 years ago
Resolution: --- → FIXED
Target Milestone: --- → mozilla59
Assignee | ||
Updated•7 years ago
|
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment hidden (Intermittent Failures Robot) |
Updated•7 years ago
|
Whiteboard: [stockwell unknown] → [stockwell disabled]
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment 42•6 years ago
|
||
Bug https://bugzilla.mozilla.org/show_bug.cgi?id=1391545 which is recommended for disable is depending on this bug. :whimboo do you have any updates regarding dis bug?
Flags: needinfo?(hskupin)
Assignee | ||
Comment 43•6 years ago
|
||
(In reply to Arthur Iakab [arthur_iakab] from comment #42) > Bug https://bugzilla.mozilla.org/show_bug.cgi?id=1391545 which is > recommended for disable is depending on this bug. > > :whimboo do you have any updates regarding dis bug? As you can see this test is disabled and shouldn't cause any harm for 59. If wanted we could still uplift the skip patch to 58 to stop the failures on the release branch.
Flags: needinfo?(hskupin) → needinfo?(aryx.bugmail)
Updated•6 years ago
|
Flags: needinfo?(aryx.bugmail) → needinfo?(jmaher)
Comment 44•6 years ago
|
||
RyanVM could you uplift the patch from comment 35 to beta- then we should have no failures by next week.
Flags: needinfo?(jmaher)
Assignee | ||
Comment 45•6 years ago
|
||
(In reply to Joel Maher ( :jmaher) (UTC-5) from comment #44) > RyanVM could you uplift the patch from comment 35 to beta- then we should > have no failures by next week. CC'ing Ryan, and putting ni? on him. Thanks.
Flags: needinfo?(ryanvm)
Assignee | ||
Comment 46•6 years ago
|
||
I had the chance to take a look at this failure, and simply tried to reproduce it on MacOS by letting it run for a while in background via --headless. Interestingly it failed for me immediately after a couple of loops, and it simply hang. After a bit of investigation it turned out that the underlying problem is the `using_context` decorator in combination with the quit/restart callback. It is trying to switch back to the former content, even with the connection already shutdown in some cases. As result an IOError is thrown, and the @process_check decorator kicks in because it is used for `_send_message`. That by itself causes to run the code in `handle_socket_failure`. Here we wait for the application to shutdown, but for a restart this will not happen. As such Marionette is trying to kill the application. Interestingly I see a hang in `self.process_handler.kill()` at this stage. Maybe this is related to bug 1421289. I will dig further before proposing a solution here.
Assignee | ||
Comment 47•6 years ago
|
||
I also have to add that the Firefox process got a new parent pid after the restart, which is 1. So maybe this is causing problems because it's not in the process group anymore.
Comment 48•6 years ago
|
||
Comment 35 is already on Beta. We're in that point of the cycle where both m-c and m-b are tracking 59. I can skip it on m-r for Fx58 if you feel strongly about it, but I'm leaning towards no.
Flags: needinfo?(ryanvm)
Comment hidden (Intermittent Failures Robot) |
Assignee | ||
Comment 50•6 years ago
|
||
Marking as P1 due to the inappropriate use of the process_check decorator on `_send_message()`. We have to make it more stable. I will still have to look at bug 1421289 first.
Status: REOPENED → ASSIGNED
Priority: P5 → P1
Comment hidden (Intermittent Failures Robot) |
Assignee | ||
Comment 52•6 years ago
|
||
While I was checking that bug again for a possible fix I noticed that this actually is a very busted behavior in quit and restart! It's not only related to a user callback for quit and shutdown, but could happen at any time for in_app restarts. I will file a new bug which will get a fix today and will fix this bug and maybe all the other restart tests.
Assignee | ||
Comment 53•6 years ago
|
||
To ensure my patch on bug 1433873 works, I will already try to re-enable all of those restart tests.
Comment hidden (Intermittent Failures Robot) |
Assignee | ||
Comment 55•6 years ago
|
||
Not an actionable bug for me until bug 1433873 got fixed.
Assignee: hskupin → nobody
Status: ASSIGNED → NEW
Priority: P1 → P3
Assignee | ||
Comment 56•6 years ago
|
||
All dependencies have been fixed. As such I pushed a try build to check if the test works now as expected: https://treeherder.mozilla.org/#/jobs?repo=try&revision=3771ebd758948c5e8f1e7761e95a1d4f78dfced4
Assignee | ||
Comment 57•6 years ago
|
||
Henrik Skupin <mail@hskupin.info> HG: branch 'default' HG: bookmark 'marionette_enable_safe_mode' HG: changed testing/marionette/harness/marionette_harness/tests/unit/test_quit_restart.py
Comment 58•6 years ago
|
||
Pushed by hskupin@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/463d82852c31 [marionette] Re-enable test TestQuitRestart.test_in_app_restart_safe_mode. r=ato
Comment 59•6 years ago
|
||
bugherder |
https://hg.mozilla.org/mozilla-central/rev/463d82852c31
Status: NEW → RESOLVED
Closed: 7 years ago → 6 years ago
Resolution: --- → FIXED
Assignee | ||
Updated•6 years ago
|
Assignee: nobody → hskupin
Whiteboard: [stockwell disabled]
Target Milestone: mozilla59 → mozilla64
Comment 60•6 years ago
|
||
Backout by aciure@mozilla.com: https://hg.mozilla.org/mozilla-central/rev/bd60f5f2f402 Backed out changeset 463d82852c31 for accidentally disabling nearly all the tests in test_quit_restart.py a=backout
Updated•6 years ago
|
Status: RESOLVED → REOPENED
Flags: needinfo?(hskupin)
Resolution: FIXED → ---
Target Milestone: mozilla64 → ---
Updated•6 years ago
|
Assignee | ||
Comment 61•6 years ago
|
||
I will try to land this again correctly by next week once the merge to beta are no longer done and central is on 65.
status-firefox64:
--- → disabled
Updated•6 years ago
|
Attachment #9017470 -
Attachment is obsolete: true
Assignee | ||
Comment 62•6 years ago
|
||
Comment 63•6 years ago
|
||
Pushed by hskupin@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/9234d32ec23c [marionette] Re-enable test TestQuitRestart.test_in_app_restart_safe_mode. r=ato
Comment 64•6 years ago
|
||
bugherder |
https://hg.mozilla.org/mozilla-central/rev/9234d32ec23c
Status: REOPENED → RESOLVED
Closed: 6 years ago → 6 years ago
status-firefox65:
--- → fixed
Resolution: --- → FIXED
Target Milestone: --- → mozilla65
Assignee | ||
Updated•6 years ago
|
Flags: needinfo?(hskupin)
Keywords: test-disabled
Updated•1 year ago
|
Product: Testing → Remote Protocol
Comment 65•1 year ago
|
||
Moving bug to Testing::Marionette Client and Harness component per bug 1815831.
Component: Marionette → Marionette Client and Harness
Product: Remote Protocol → Testing
You need to log in
before you can comment on or make changes to this bug.
Description
•