Closed Bug 1642032 Opened 4 years ago Closed 4 years ago

Perma Windows 10 MinGW TEST-UNEXPECTED-TIMEOUT | automation.py | application timed out after 370 seconds with no output

Categories

(Firefox Build System :: General, defect, P5)

defect

Tracking

(firefox-esr68 unaffected, firefox77 unaffected, firefox78 fixed, firefox79 fixed)

RESOLVED FIXED
mozilla79
Tracking Status
firefox-esr68 --- unaffected
firefox77 --- unaffected
firefox78 --- fixed
firefox79 --- fixed

People

(Reporter: intermittent-bug-filer, Assigned: froydnj)

References

(Blocks 1 open bug, Regression)

Details

(Keywords: intermittent-failure, regression, Whiteboard: [retriggered][stockwell disable-recommended])

Attachments

(3 files)

Filed by: ccoroiu [at] mozilla.com
Parsed log: https://treeherder.mozilla.org/logviewer.html#?job_id=304330247&repo=mozilla-central
Full log: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/fta-7imvQa6ZnUaTWNsN7g/runs/0/artifacts/public/logs/live_backing.log


task 2020-05-29T22:23:41.957Z] 22:23:41 INFO - runtests.py | Waiting for browser...
[task 2020-05-29T22:26:41.309Z] 22:26:41 INFO - [6484, Main Thread] WARNING: NS_ENSURE_SUCCESS(rv, rv) failed with result 0x80004005 (NS_ERROR_FAILURE): file /builds/worker/checkouts/gecko/toolkit/components/places/Database.cpp, line 544
[task 2020-05-29T22:26:41.310Z] 22:26:41 INFO - [6484, Main Thread] WARNING: Unable to get a connection to vacuum database: file /builds/worker/checkouts/gecko/storage/VacuumManager.cpp, line 128
[task 2020-05-29T22:26:41.389Z] 22:26:41 INFO - [6484, IPDL Background] WARNING: '!gBasePath', file /builds/worker/checkouts/gecko/dom/quota/ActorsParent.cpp, line 3625
[task 2020-05-29T22:26:41.389Z] 22:26:41 INFO - [6484, IPDL Background] WARNING: profile-do-change must precede QuotaManager::GetOrCreate(): file /builds/worker/checkouts/gecko/dom/quota/ActorsParent.cpp, line 3626
[task 2020-05-29T22:26:41.389Z] 22:26:41 INFO - [6484, IPDL Background] WARNING: '!quotaManager', file /builds/worker/checkouts/gecko/dom/quota/ActorsParent.cpp, line 8592
[task 2020-05-29T22:26:51.612Z] 22:26:51 INFO - Buffered messages finished
[task 2020-05-29T22:26:51.612Z] 22:26:51 ERROR - TEST-UNEXPECTED-TIMEOUT | automation.py | application timed out after 370 seconds with no output
[task 2020-05-29T22:26:51.612Z] 22:26:51 ERROR - Force-terminating active process(es).
[task 2020-05-29T22:26:51.612Z] 22:26:51 INFO - Determining child pids from psutil...
[task 2020-05-29T22:26:51.622Z] 22:26:51 INFO - [10744]
[task 2020-05-29T22:26:51.622Z] 22:26:51 INFO - Found child pids: set([10744])
[task 2020-05-29T22:26:51.622Z] 22:26:51 INFO - Killing process: 10744
[task 2020-05-29T22:26:51.623Z] 22:26:51 INFO - TEST-INFO | started process screenshot
[task 2020-05-29T22:26:51.729Z] 22:26:51 INFO - TEST-INFO | screenshot: exit 0
[task 2020-05-29T22:26:51.729Z] 22:26:51 INFO - mozcrash Writing a dump to c:\users\task_1590789669\appdata\local\temp\tmpuadnsm.mozrunner\minidumps\2c4c79cb-4b23-4cf8-b6c9-f0fd5149097a.dmp for [10744]
[task 2020-05-29T22:26:51.730Z] 22:26:51 WARNING - mozcrash unable to create minidump file for pid 10744: 3
[task 2020-05-29T22:26:51.757Z] 22:26:51 INFO - psutil found pid 10744 dead
[task 2020-05-29T22:26:51.757Z] 22:26:51 INFO - Killing process: 4832
[task 2020-05-29T22:26:51.758Z] 22:26:51 INFO - Not taking screenshot here: see the one that was previously logged
[task 2020-05-29T22:26:51.758Z] 22:26:51 INFO - mozcrash Writing a dump to c:\users\task_1590789669\appdata\local\temp\tmpuadnsm.mozrunner\minidumps\9c9d2b37-9955-47e2-b379-771a9b9ce2f9.dmp for [4832]
[task 2020-05-29T22:26:51.758Z] 22:26:51 WARNING - mozcrash unable to create minidump file for pid 4832: 3
[task 2020-05-29T22:26:51.759Z] 22:26:51 WARNING - mozcrash kill_pid(): unable to terminate pid 4832: 5
[task 2020-05-29T22:26:51.764Z] 22:26:51 INFO - psutil found pid 4832 dead
[task 2020-05-29T22:26:51.783Z] 22:26:51 INFO - TEST-INFO | Main app process: exit 1
[task 2020-05-29T22:26:51.783Z] 22:26:51 INFO - Buffered messages finished
[task 2020-05-29T22:26:51.784Z] 22:26:51 ERROR - TEST-UNEXPECTED-FAIL | automation.py | application terminated with exit code 1
[task 2020-05-29T22:26:51.785Z] 22:26:51 INFO - runtests.py | Application ran for: 0:06:10.182000
[task 2020-05-29T22:26:51.785Z] 22:26:51 INFO - zombiecheck | Reading PID log: c:\users\task_1590789669\appdata\local\temp\tmpgu65t9pidlog
[task 2020-05-29T22:26:51.785Z] 22:26:51 INFO - Traceback (most recent call last):
[task 2020-05-29T22:26:51.786Z] 22:26:51 INFO - File "Z:\task_1590789669\build\tests\mochitest\runtests.py", line 2927, in doTests
[task 2020-05-29T22:26:51.786Z] 22:26:51 INFO - e10s=options.e10s
[task 2020-05-29T22:26:51.786Z] 22:26:51 INFO - File "Z:\task_1590789669\build\tests\mochitest\runtests.py", line 2428, in runApp
[task 2020-05-29T22:26:51.786Z] 22:26:51 INFO - six.reraise(exc, value, tb)
[task 2020-05-29T22:26:51.787Z] 22:26:51 INFO - File "Z:\task_1590789669\build\tests\mochitest\runtests.py", line 2345, in runApp
[task 2020-05-29T22:26:51.787Z] 22:26:51 INFO - self.marionette.start_session()
[task 2020-05-29T22:26:51.787Z] 22:26:51 INFO - File "Z:\task_1590789669\build\venv\lib\site-packages\marionette_driver\decorators.py", line 36, in _
[task 2020-05-29T22:26:51.788Z] 22:26:51 INFO - m._handle_socket_failure()
[task 2020-05-29T22:26:51.789Z] 22:26:51 INFO - File "Z:\task_1590789669\build\venv\lib\site-packages\marionette_driver\marionette.py", line 654, in _handle_socket_failure
[task 2020-05-29T22:26:51.789Z] 22:26:51 INFO - reraise(exc_cls, exc, tb)
[task 2020-05-29T22:26:51.789Z] 22:26:51 INFO - File "Z:\task_1590789669\build\venv\lib\site-packages\marionette_driver\decorators.py", line 26, in _
[task 2020-05-29T22:26:51.789Z] 22:26:51 INFO - return func(*args, **kwargs)
[task 2020-05-29T22:26:51.790Z] 22:26:51 INFO - File "Z:\task_1590789669\build\venv\lib\site-packages\marionette_driver\marionette.py", line 1112, in start_session
[task 2020-05-29T22:26:51.790Z] 22:26:51 INFO - self.raise_for_port(timeout=timeout)
[task 2020-05-29T22:26:51.790Z] 22:26:51 INFO - File "Z:\task_1590789669\build\venv\lib\site-packages\marionette_driver\marionette.py", line 573, in raise_for_port
[task 2020-05-29T22:26:51.790Z] 22:26:51 INFO - self.host, self.port))
[task 2020-05-29T22:26:51.791Z] 22:26:51 INFO - timeout: Timed out waiting for connection on 127.0.0.1:2828!
[task 2020-05-29T22:26:51.791Z] 22:26:51 ERROR - Automation Error: Received unexpected exception while running application

Has Regression Range: --- → yes
Keywords: regression

Sigh, this is so lame. Bug 1641640 apparently changed this so that the zip file no longer sets the executable mode bit on the file, so something winds up complaining. I guess this is a problem that we've had with other files, like the asan dlls we distribute, and we just didn't dig hard enough to figure this out?

I will look into this very early next week; somewhere in the install manifest code we should be copying mode bits and we are dropping them on the floor instead.

Assignee: nobody → nfroyd
Status: NEW → ASSIGNED
Flags: needinfo?(nfroyd)
Product: Testing → Firefox Build System

Actually, I take that back: the zip entries are exactly the same for the relevant file before and after the patch, modulo where they live in the archive. So we're still packaging the file and we didn't change anything about the file; why is the test falling over?

tjr, do you know if there is something peculiar with how the d3d compiler dll is handled in mingw?

Flags: needinfo?(tom)

Set release status flags based on info from the regressing bug 1641640

Blocks: 1633761
Flags: needinfo?(tom)

The problem is that d3dcompiler_47.dll is now added to dependentlibs.list. It wasn't before. Once removed from that file, the browser starts again.

Whiteboard: [retriggered][stockwell disable-recommended] → [retriggered][stockwell needswork:owner]
Whiteboard: [retriggered][stockwell disable-recommended] → [retriggered][stockwell needswork:owner]

We're going to reuse this machinery in a subsequent patch. I think this
change, by virtue of moving all the output to a single place, also makes
things clearer.

Such files are, at least as of this writing, only libraries, and we can
get by with installing them later. This change will also avoid having
them added to the dependent libraries list.

Depends on D78617

Pushed by nfroyd@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/a9239d026627
part 1 - factor out install targets for final target files; r=nalexander
https://hg.mozilla.org/integration/autoland/rev/c1628d8da86f
part 2 - install non-wildcard absolute FINAL_TARGET_FILES during misc; r=nalexander
Status: ASSIGNED → RESOLVED
Closed: 4 years ago
Resolution: --- → FIXED
Target Milestone: --- → mozilla79
Flags: needinfo?(nfroyd)

We moved the installation of shared libraries, such as
d3dcompiler_47.dll to the misc tier with the idea of making sure
they weren't taken into account during the computation of the
dependentlibs.list. Unfortunately, dependentlibs.list is itself
computed during misc, so we have an unfortunate race condition where
sometimes libraries might wind up in the list and sometimes they won't.

For this particular case, we can follow what we already do for ICU and
ignore the library, especially since we're dynamically loading the
library anyway.

(In reply to Narcis Beleuzu [:NarcisB] from comment #24)

:froydnj , it`s still failing. Can you please take a look?
https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=305806116&repo=mozilla-central&lineNumber=1149

Sorry, apparently my try push won some races during the build, but our normal builds lost the race. I've posted a new patch.

Flags: needinfo?(nfroyd)
Pushed by nfroyd@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/7bc89fd27221
ignore d3dcompiler_47.dll in dependentlibs determination; r=firefox-build-system-reviewers,rstewart

== Change summary for alert #26222 (as of Mon, 15 Jun 2020 10:43:24 GMT) ==

Improvements:

3% tp5n main_startup_fileio windows7-32-shippable opt e10s stylo 115,299,781.67 -> 111,603,684.58
3% tp5n main_startup_fileio windows7-32-shippable opt e10s stylo 115,011,309.83 -> 111,537,300.67

For up to date results, see: https://treeherder.mozilla.org/perf.html#/alerts?id=26222

Do we need to uplift this to Beta for 78 given that it's the next ESR release?

Flags: needinfo?(nfroyd)

Comment on attachment 9154802 [details]
Bug 1642032 - part 1 - factor out install targets for final target files; r=#build

ESR Uplift Approval Request

  • If this is not a sec:{high,crit} bug, please state case for ESR consideration: Fixes some perma-failures on mingw, which seems good for Tor, and also claws back some startup regressions.
  • User impact if declined: Slower startup.
  • Fix Landed on Version: 79
  • Risk to taking this patch: Low
  • Why is the change risky/not risky? (and alternatives if risky): These patches are not risky; they tweak only how files are installed and do not generally affect functionality of the browser. We clearly have tests that exercise the functionality of the files installed. =D
  • String or UUID changes made by this patch: None
Flags: needinfo?(nfroyd)
Attachment #9154802 - Flags: approval-mozilla-esr78?
Attachment #9154803 - Flags: approval-mozilla-esr78?
Attachment #9155693 - Flags: approval-mozilla-esr78?

Comment on attachment 9154802 [details]
Bug 1642032 - part 1 - factor out install targets for final target files; r=#build

Fixes mingw permafails that will affect ESR78. Approved for 78.0b9.

Attachment #9154802 - Flags: approval-mozilla-esr78? → approval-mozilla-esr78+
Attachment #9154802 - Flags: approval-mozilla-esr78+ → approval-mozilla-beta+
Attachment #9154803 - Flags: approval-mozilla-beta+
Attachment #9155693 - Flags: approval-mozilla-esr78? → approval-mozilla-beta+
Attachment #9154803 - Flags: approval-mozilla-esr78?
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: