Closed Bug 1860468 Opened 1 year ago Closed 1 year ago

Perma tier2 macosx1100-64-aarch64-qr TEST-UNEXPECTED-TIMEOUT | automation.py | application timed out after 370 seconds with no output

Categories

(Release Engineering :: Applications: MozharnessCore, defect, P5)

Tracking

(firefox120 fixed, firefox121 fixed)

RESOLVED FIXED
Tracking Status
firefox120 --- fixed
firefox121 --- fixed

People

(Reporter: intermittent-bug-filer, Assigned: hneiva)

References

(Regression)

Details

(Keywords: intermittent-failure, regression)

Attachments

(2 files)

Filed by: nerli [at] mozilla.com
Parsed log: https://treeherder.mozilla.org/logviewer?job_id=433371853&repo=mozilla-central
Full log: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/SMsbEeJzRz2n0J8duHKERg/runs/0/artifacts/public/logs/live_backing.log


[task 2023-10-22T23:52:28.753Z] 23:52:28     INFO - runtests.py | Waiting for browser...
[task 2023-10-22T23:52:28.754Z] 23:52:28     INFO - GECKO(1476) | 1698018748754	Marionette	DEBUG	Closed connection 1
[task 2023-10-22T23:52:30.726Z] 23:52:30     INFO - GECKO(1476) | [Parent 1476, IndexedDB #8] WARNING: QM_TRY failure (WARNING): 'ToResult(!madeProgress) failed with resultCode 0x80004005, resultName NS_ERROR_FAILURE', file dom/indexedDB/ActorsParent.cpp:7020
[task 2023-10-22T23:52:30.727Z] 23:52:30     INFO - GECKO(1476) | [Parent 1476, IndexedDB #8] WARNING: QM_TRY failure (WARNING): 'ToResult(!madeProgress) failed with resultCode 0x80004005, resultName NS_ERROR_FAILURE', file dom/indexedDB/ActorsParent.cpp:7020
[task 2023-10-22T23:52:30.727Z] 23:52:30     INFO - GECKO(1476) | [Parent 1476, IndexedDB #8] WARNING: QM_TRY failure (WARNING): 'ToResult(!madeProgress) failed with resultCode 0x80004005, resultName NS_ERROR_FAILURE', file dom/indexedDB/ActorsParent.cpp:7020
[task 2023-10-22T23:52:30.728Z] 23:52:30     INFO - GECKO(1476) | [Parent 1476, IndexedDB #8] WARNING: QM_TRY failure (WARNING): 'ToResult(!madeProgress) failed with resultCode 0x80004005, resultName NS_ERROR_FAILURE', file dom/indexedDB/ActorsParent.cpp:7020
[task 2023-10-22T23:52:30.729Z] 23:52:30     INFO - GECKO(1476) | [Parent 1476, IndexedDB #8] WARNING: QM_TRY failure (WARNING): 'ToResult(!madeProgress) failed with resultCode 0x80004005, resultName NS_ERROR_FAILURE', file dom/indexedDB/ActorsParent.cpp:7020
[task 2023-10-22T23:58:40.791Z] 23:58:40     INFO - TEST-UNEXPECTED-TIMEOUT | automation.py | application timed out after 370 seconds with no output
[task 2023-10-22T23:58:40.791Z] 23:58:40     INFO - TEST-INFO 
[task 2023-10-22T23:58:40.792Z] 23:58:40     INFO - Buffered messages finished
[task 2023-10-22T23:58:40.792Z] 23:58:40  WARNING - Force-terminating active process(es).
[task 2023-10-22T23:58:40.793Z] 23:58:40     INFO - Determining child pids from psutil...
Flags: needinfo?(jcristau)
Summary: Perma macosx1100-64-aarch64-qr TEST-UNEXPECTED-TIMEOUT | automation.py | application timed out after 370 seconds with no output → Perma tier2 macosx1100-64-aarch64-qr TEST-UNEXPECTED-TIMEOUT | automation.py | application timed out after 370 seconds with no output

[task 2023-10-17T22:18:05.702Z] 22:18:05 INFO - GECKO(4006) | dyld: Library not loaded: @rpath/libnss3.dylib
[task 2023-10-17T22:18:05.702Z] 22:18:05 INFO - GECKO(4006) | Referenced from: /opt/worker/tasks/task_169757806727819/build/application/Firefox NightlyDebug.app/Contents/MacOS/plugin-container.app/Contents/MacOS/plugin-container
[task 2023-10-17T22:18:05.702Z] 22:18:05 INFO - GECKO(4006) | Reason: no suitable image found. Did find:
[task 2023-10-17T22:18:05.702Z] 22:18:05 INFO - GECKO(4006) | /opt/worker/tasks/task_169757806727819/build/application/Firefox NightlyDebug.app/Contents/MacOS/plugin-container.app/Contents/MacOS/../../../libnss3.dylib: code signature in (/opt/worker/tasks/task_169757806727819/build/application/Firefox NightlyDebug.app/Contents/MacOS/plugin-container.app/Contents/MacOS/../../../libnss3.dylib) not valid for use in process using Library Validation: mapped file has no Team ID and is not a platform binary (signed with custom identity or adhoc?)
[task 2023-10-17T22:18:05.702Z] 22:18:05 INFO - GECKO(4006) | /opt/worker/tasks/task_169757806727819/build/application/Firefox NightlyDebug.app/Contents/MacOS/plugin-container.app/Contents/MacOS/../../../libnss3.dylib: stat() failed with errno=1

Heitor, Haik, any idea what's going wrong with the nss signature here? I'm surprised that this is intermittent, could it be differences in test worker configs (I can see some pushes have a mix of green and busted tests)?

Flags: needinfo?(jcristau)
Flags: needinfo?(hneiva)
Flags: needinfo?(haftandilian)

More debugging needed to confirm, but the root cause of the problem appears to be that we are signing debug builds with our self-signed "Mozilla Fake DMG" cert while also using the production entitlement configuration. With the fix for bug 1593072, our production entitlement configuration is no longer compatible with self-signing. Until we have improved our codesigning to not-use self-signed certs, we'll have to either 1) sign our Debug central builds with the official cert OR 2) change our Debug central builds to use the development entitlement configuration.

If this evaluation is correct, this problem should not apply to try builds.

To explain this failure, plugin-container is configured to not allow loading of third party dylibs in production builds, but that setting is not compatible with self-signed certs. For our developer builds (which use self-signing), we opt out of that restriction. This incompatibility (where self-sign builds require the entitlement com.apple.security.cs.disable-library-validation in order to load a dylib that is also self-signed) must not apply on our macOS 10.15 x64 runners. One explanation could be that Apple Silicon or macOS 11 is more strict in this regard. The test is passing on macOS 10.15 on x64 and the failures are on macOS 11 on Apple Silicon. Apple Silicon does have more strict codesigning requirements per documentation from Apple, so it wouldn't be surprising if that was the difference.

macOS 11 (aka 10.16) is the first version to support Apple Silicon.

Regarding the solution, we do not sign non-official builds with production certs to avoid any potential confusion or malicious behavior where non-official build is passed off as official. For example a try build, but we might be OK with debug builds of central.

I'll discuss more with Heitor.

Regressed by: 1593072
Assignee: nobody → hneiva
Flags: needinfo?(haftandilian)
Attachment #9359917 - Attachment description: Bug 1860468 - Use nonprod entitlements on debug build r?haik,#releng → Bug 1860468 - Use nonprod entitlements on non-shippable builds r?haik,#releng
Pushed by hneiva@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/65f4cd99ae3f Use nonprod entitlements on non-shippable builds r=haik,releng-reviewers,taskgraph-reviewers,bhearsum,jcristau
Status: NEW → RESOLVED
Closed: 1 year ago
Resolution: --- → FIXED
Pushed by mkmelin@iki.fi: https://hg.mozilla.org/comm-central/rev/cf559e2b9cc6 Adjust Thunderbird mac-signing config. rs=bustage-fix

Heitor, could this get an uplift rquest for beta, please?

Removes the direct relationship between hardened-signing type (production/developer) and project/release level. Having them separate should make it more explicit on when to use production entitlements or not.

Original Revision: https://phabricator.services.mozilla.com/D191705

Attachment #9362382 - Flags: approval-mozilla-beta?

Uplift Approval Request

  • String changes made/needed: None
  • Steps to reproduce for manual QE testing: Tests failing in CI without this patch
  • Needs manual QE test: no
  • Explanation of risk level: Only non-shippable builds were changed
  • User impact if declined: None
  • Fix verified in Nightly: yes
  • Risk associated with taking this patch: No foreseable risk
  • Is Android affected?: no
  • Code covered by automated testing: no

Comment on attachment 9362382 [details]
Bug 1860468 - Use nonprod entitlements on non-shippable builds

Approved for 120.0b8

Attachment #9362382 - Flags: approval-mozilla-beta? → approval-mozilla-beta+
Flags: needinfo?(rob)
Flags: needinfo?(rob)
Flags: needinfo?(hneiva)
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: