Closed Bug 1984622 Opened 2 months ago Closed 2 months ago

Perma damp | netmonitor/custom.js: Test timed out | single tracking bug

Categories

(Testing :: Talos, defect, P5)

defect

Tracking

(firefox-esr128 unaffected, firefox-esr140 unaffected, firefox142 unaffected, firefox143 unaffected, firefox144 fixed)

RESOLVED FIXED
144 Branch
Tracking Status
firefox-esr128 --- unaffected
firefox-esr140 --- unaffected
firefox142 --- unaffected
firefox143 --- unaffected
firefox144 --- fixed

People

(Reporter: intermittent-bug-filer, Unassigned)

References

(Regression)

Details

(Keywords: intermittent-failure, regression)

Filed by: chorotan [at] mozilla.com
Parsed log: https://treeherder.mozilla.org/logviewer?job_id=523573281&repo=mozilla-central&task=cb3IQHaTRzaL6shobpIMdw.0
Full log: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/cb3IQHaTRzaL6shobpIMdw/runs/0/artifacts/public/logs/live_backing.log


task 2025-08-22T08:56:49.335+00:00] 08:56:49     INFO -  PID 2494 | [waitForAllRequestsFinished] Waiting for -1399 / 45 requests
[task 2025-08-22T08:56:50.064+00:00] 08:56:50     INFO -  PID 2494 | [waitForAllRequestsFinished] Waiting for -1402 / 45 requests
[task 2025-08-22T08:56:50.079+00:00] 08:56:50     INFO -  PID 2494 | Received pageshow event for 77
[task 2025-08-22T08:56:50.079+00:00] 08:56:50     INFO -  PID 2494 | Wait for pending paints on 'custom.netmonitor.bigdatarequests.navigate'
[task 2025-08-22T08:56:50.079+00:00] 08:56:50     INFO -  PID 2494 | 'custom.netmonitor.bigdatarequests.navigate.settle.DAMP' took 0.06278399997972883ms.
[task 2025-08-22T08:56:50.445+00:00] 08:56:50     INFO -  PID 2494 | [waitForAllRequestsFinished] Waiting for -1405 / 45 requests
[task 2025-08-22T08:56:50.446+00:00] 08:56:50     INFO -  PID 2494 | [waitForAllRequestsFinished] Waiting for -1408 / 45 requests
[task 2025-08-22T08:56:50.446+00:00] 08:56:50     INFO -  PID 2494 | [waitForAllRequestsFinished] Waiting for -1411 / 45 requests
[task 2025-08-22T08:56:50.446+00:00] 08:56:50     INFO -  PID 2494 | [waitForAllRequestsFinished] Waiting for -1414 / 45 requests
[task 2025-08-22T08:56:50.446+00:00] 08:56:50     INFO -  PID 2494 | [waitForAllRequestsFinished] Waiting for -1417 / 45 requests
[task 2025-08-22T08:56:50.446+00:00] 08:56:50     INFO -  PID 2494 | [waitForAllRequestsFinished] Waiting for -1420 / 45 requests
[task 2025-08-22T09:01:44.489+00:00] 09:01:44     INFO -  PID 2494 | TEST-UNEXPECTED-FAIL | damp | netmonitor/custom.js: Test timed out
[task 2025-08-22T09:01:44.497+00:00] 09:01:44     INFO -  PID 2494 | [DampLoad helper] Unregister DampLoad actors
[task 2025-08-22T10:01:44.554+00:00] 10:01:44     INFO - Automation Error: mozharness timed out after 3600 seconds running ['/opt/worker/tasks/task_175585157930752/build/venv/bin/python', '/opt/worker/tasks/task_175585157930752/build/tests/talos/talos/run_tests.py', '--executablePath', '/opt/worker/tasks/task_175585157930752/build/application/Firefox Nightly.app/Contents/MacOS/firefox', '--suite', 'damp-other', '--title', 'macmini-r8-5', '--symbolsPath', 'https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/JzUn1UQjT0qNLZQmHBLQCQ/artifacts/public/build/target.crashreporter-symbols.zip', '--project', 'mozilla-central', '--screenshot-on-failure', '--setpref=talos.damp.suite=other', '--setpref=media.peerconnection.mtransport_process=false', '--setpref=network.process.enabled=false', '--setpref=layers.d3d11.enable-blacklist=false', '--log-tbpl-level=info', '--log-errorsummary=/opt/worker/tasks/task_175585157930752/build/blobber_upload_dir/damp-other_errorsummary.log']
[task 2025-08-22T10:01:44.558+00:00] 10:01:44     INFO - Return code: -9
[task 2025-08-22T10:01:44.558+00:00] 10:01:44  WARNING - setting return code to -9
[task 2025-08-22T10:01:44.559+00:00] 10:01:44  WARNING - setting return code to 2
[task 2025-08-22T10:01:44.559+00:00] 10:01:44     INFO - Running post-action listener: _package_coverage_data
[task 2025-08-22T10:01:44.559+00:00] 10:01:44     INFO - Running post-action listener: _resource_record_post_action
[task 2025-08-22T10:01:44.559+00:00] 10:01:44     INFO - Running post-action listener: process_java_coverage_data
[task 2025-08-22T10:01:44.559+00:00] 10:01:44     INFO - [mozharness: 2025-08-22 10:01:44.559316Z] Finished run-tests step (success)
[task 2025-08-22T10:01:44.559+00:00] 10:01:44     INFO - Running post-run listener: _resource_record_post_run
[task 2025-08-22T10:01:47.423+00:00] 10:01:47     INFO - Total resource usage - Wall time: 5260s; CPU: 6%; Read bytes: 3850698752; Write bytes: 3710353408; Read time: 103739; Write time: 8742
[task 2025-08-22T10:01:47.423+00:00] 10:01:47     INFO - TinderboxPrint: CPU usage<br/>6.3%
[task 2025-08-22T10:01:47.423+00:00] 10:01:47     INFO - TinderboxPrint: I/O read bytes / time<br/>3,850,698,752 / 103,739
[task 2025-08-22T10:01:47.423+00:00] 10:01:47     INFO - TinderboxPrint: I/O write bytes / time<br/>3,710,353,408 / 8,742
[task 2025-08-22T10:01:47.423+00:00] 10:01:47     INFO - TinderboxPrint: CPU idle<br/>59,060.0 (93.7%)
[task 2025-08-22T10:01:47.423+00:00] 10:01:47     INFO - TinderboxPrint: CPU system<br/>1,667.6 (2.6%)
[task 2025-08-22T10:01:47.423+00:00] 10:01:47     INFO - TinderboxPrint: CPU user<br/>2,311.8 (3.7%)
[task 2025-08-22T10:01:47.423+00:00] 10:01:47     INFO - TinderboxPrint: Swap in / out<br/>806,707,200 / 0

Julian, can you please take a look?

Flags: needinfo?(jdescottes)
Keywords: regression
Regressed by: 1983496

Looking, it seems we are getting too many requests and never resolve in a specific test. I am suprised this would only regress on macos after Bug 1983496, not ruling out a regression from something else at this point.

I also seems to fail quite late in the test, and knowing that we run the same tests many time, this seems to indicate we hit an intermittent issue consistently enough to perma fail the job.

We can also see ~1500 requests finishing in this tests while we expect ~50. It doesn't really match the previous page (which has 2000 requests), but it surely doesn't match the page we are supposed to be on.

Set release status flags based on info from the regressing bug 1983496

Summary: Perma damp | netmonitor/custom.js: Test timed out → Perma damp | netmonitor/custom.js: Test timed out | single tracking bug

I have a few try pushes in progress, but since it's on mac, they might take a lot of time to get scheduled: https://treeherder.mozilla.org/jobs?repo=try&author=jdescottes%40mozilla.com&fromchange=0433dbbdc5670053a45f7434b0cd60b900ec4854&tochange=4c82e1a7cbc1cd977affdd9e863ccf3cd7b8af56

In the meantime I can't reproduce this locally.

Flags: needinfo?(jdescottes)

:bomsy, since you are the author of the regressor, bug 1983496, could you take a look?

For more information, please visit BugBot documentation.

Flags: needinfo?(hmanilla)
Status: NEW → RESOLVED
Closed: 2 months ago
Resolution: --- → FIXED
Regressed by: 1830230
No longer regressed by: 1983496
Flags: needinfo?(hmanilla)
Target Milestone: --- → 144 Branch
You need to log in before you can comment on or make changes to this bug.