Closed Bug 1673786 Opened 5 years ago Closed 5 years ago

Intermittent testing/firefox-ui/tests/functional/safebrowsing/test_notification.py TestSafeBrowsingNotificationBar.test_notification_bar | IOError: Process killed because the connection to Marionette server is lost. Check gecko.log for errors

Categories

(Testing :: Firefox UI Tests, defect, P5)

defect

Tracking

(firefox-esr78 unaffected, firefox82 unaffected, firefox83 unaffected, firefox84 fixed)

RESOLVED FIXED
84 Branch
Tracking Status
firefox-esr78 --- unaffected
firefox82 --- unaffected
firefox83 --- unaffected
firefox84 --- fixed

People

(Reporter: nataliaCs, Assigned: whimboo)

References

(Regression)

Details

(Keywords: intermittent-failure, regression, Whiteboard: [stockwell disable-recommended])

Attachments

(1 obsolete file)

Failure log: https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=319962774&repo=autoland&lineNumber=1681

[task 2020-10-28T02:43:51.413Z] 02:43:51 INFO - TEST-UNEXPECTED-ERROR | testing/firefox-ui/tests/functional/safebrowsing/test_notification.py TestSafeBrowsingNotificationBar.test_notification_bar | IOError: Process killed because the connection to Marionette server is lost. Check gecko.log for errors (Reason: Connection timed out after 360.0s)
[task 2020-10-28T02:43:51.413Z] 02:43:51 INFO - Traceback (most recent call last):
[task 2020-10-28T02:43:51.414Z] 02:43:51 INFO - File "/builds/worker/workspace/build/venv/lib/python2.7/site-packages/marionette_harness/marionette_test/testcases.py", line 214, in run
[task 2020-10-28T02:43:51.414Z] 02:43:51 INFO - testMethod()
[task 2020-10-28T02:43:51.414Z] 02:43:51 INFO - File "/builds/worker/workspace/build/tests/firefox-ui/tests/testing/firefox-ui/tests/functional/safebrowsing/test_notification.py", line 60, in test_notification_bar
[task 2020-10-28T02:43:51.414Z] 02:43:51 INFO - self.check_get_me_out_of_here_button()
[task 2020-10-28T02:43:51.414Z] 02:43:51 INFO - File "/builds/worker/workspace/build/tests/firefox-ui/tests/testing/firefox-ui/tests/functional/safebrowsing/test_notification.py", line 111, in check_get_me_out_of_here_button
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - message="The default home page has not been loaded",
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - File "/builds/worker/workspace/build/venv/lib/python2.7/site-packages/marionette_driver/wait.py", line 134, in until
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - rv = condition(self.marionette)
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - File "/builds/worker/workspace/build/tests/firefox-ui/tests/testing/firefox-ui/tests/functional/safebrowsing/test_notification.py", line 110, in <lambda>
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - lambda mn: self.default_homepage in mn.get_url(),
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - File "/builds/worker/workspace/build/venv/lib/python2.7/site-packages/marionette_driver/marionette.py", line 1464, in get_url
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - return self._send_message("WebDriver:GetCurrentURL", key="value")
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - File "/builds/worker/workspace/build/venv/lib/python2.7/site-packages/marionette_driver/decorators.py", line 37, in _
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - m._handle_socket_failure()
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - File "/builds/worker/workspace/build/venv/lib/python2.7/site-packages/marionette_driver/marionette.py", line 712, in _handle_socket_failure
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - IOError, IOError(message.format(returncode=returncode, reason=exc)), tb
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - File "/builds/worker/workspace/build/venv/lib/python2.7/site-packages/marionette_driver/decorators.py", line 27, in _
[task 2020-10-28T02:43:51.415Z] 02:43:51 INFO - return func(*args, **kwargs)
[task 2020-10-28T02:43:51.416Z] 02:43:51 INFO - File "/builds/worker/workspace/build/venv/lib/python2.7/site-packages/marionette_driver/marionette.py", line 617, in _send_message
[task 2020-10-28T02:43:51.416Z] 02:43:51 INFO - msg = self.client.request(name, params)
[task 2020-10-28T02:43:51.416Z] 02:43:51 INFO - File "/builds/worker/workspace/build/venv/lib/python2.7/site-packages/marionette_driver/transport.py", line 288, in request
[task 2020-10-28T02:43:51.417Z] 02:43:51 INFO - return self.receive()
[task 2020-10-28T02:43:51.417Z] 02:43:51 INFO - File "/builds/worker/workspace/build/venv/lib/python2.7/site-packages/marionette_driver/transport.py", line 192, in receive
[task 2020-10-28T02:43:51.417Z] 02:43:51 INFO - "Connection timed out after {}s".format(self.socket_timeout)
[task 2020-10-28T02:43:51.417Z] 02:43:51 INFO - TEST-INFO took 445423ms

Depends on: 1519354

It's failing kinda lot over the last days, and it looks to be a regression. Can we please run a back-fill so we can see which changeset was causing it?

To fix it we will need bug 1669169 landed, which will at least fix it for the actor usage.

Depends on: 1669169
Flags: needinfo?(sheriffs)

Ok, so bug 1665404 made about:home a privileged site. The test that is failing here does the following:

It navigates to https://www.itisatrap.org/firefox/its-a-trap.html, and then clicks the get me out of here button of the navigation bar. That forces the page behind the pref browser.startup.homepage to be loaded. And that is about:home.

There is not that much we can do right now, except waiting for bug 1669169 to be landed. Hopefully this should happen in the next couple of days.

Dorel, how often does it fail? Would it be ok to wait those couple days, or should we mark the test as expected fail for now?

Flags: needinfo?(hskupin) → needinfo?(dluca)
Regressed by: 1665404

~50% of the Linux asan, Linux x64 debug and Windows 7 debug runs fail. Comment 4 sounds like there won't be a fix landed before the weekend. Please disable the tests for now.

Flags: needinfo?(dluca) → needinfo?(hskupin)
Assignee: nobody → hskupin
Status: NEW → ASSIGNED
Flags: needinfo?(hskupin)
Pushed by hskupin@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/8050b4336680 [firefox-ui] Temporarily skip test_notification.py for debug and asan builds on Linux and Windows. r=marionette-reviewers,maja_zf
Keywords: leave-open
Whiteboard: [stockwell disabled]
Assignee: hskupin → nobody
Status: ASSIGNED → NEW

As an update, we had 58 occurrences of this failure in the past week. It is now failing on Win7 and Win10 opt, MacOS opt and debug, Linux64 opt.
Should we move forward and disable this on all platforms until Bug 1669169 will be landed?

Flags: needinfo?(aryx.bugmail)
Flags: needinfo?(aryx.bugmail) → needinfo?(hskupin)

We are most likely going to enable actors by default for all builds tomorrow. So I would say - if it's ok with you - to not specifically disable it.

Flags: needinfo?(hskupin) → needinfo?(aryx.bugmail)

Given that we no longer have failures since Marionette switched to JSWindowActor by default, I pushed a try to get the skip patch backed out. Lets see what the results are:

https://treeherder.mozilla.org/jobs?repo=try&revision=5ad4f8bd933a204408d5303ae928e73de5343fcb

The try build is all fine. Sheriffs please backout the following changeset:
https://hg.mozilla.org/mozilla-central/rev/8050b4336680

Flags: needinfo?(sheriffs)
Assignee: nobody → hskupin
Status: NEW → ASSIGNED
Attachment #9184600 - Attachment is obsolete: true
Status: ASSIGNED → RESOLVED
Closed: 5 years ago
Resolution: --- → FIXED
Whiteboard: [stockwell disabled]
Target Milestone: --- → 84 Branch
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: