Closed Bug 1724992 Opened 3 years ago Closed 3 years ago

Permanent macOS T-swr(s) Talos svgr software render PROCESS-CRASH | tscrollx | application crashed [unknown top frame] after Timeout waiting for test completion; killing browser... when Gecko 93 merges to Beta on 2021-09-06

Categories

(Core :: Graphics: WebRender, defect)

defect

Tracking

()

RESOLVED FIXED
Tracking Status
firefox-esr78 --- unaffected
firefox-esr91 --- unaffected
firefox91 --- unaffected
firefox92 --- unaffected
firefox93 + fixed

People

(Reporter: aryx, Unassigned)

References

(Regression)

Details

(Keywords: regression)

Tentatively from bug 1723665, will verify with a backout on Try but that takes 3h.

[task 2021-08-10T13:03:16.841Z] 13:03:16     INFO -  PID 1075 | Cycle 1(21): loaded http://127.0.0.1:49221/tests/scroll/reader.htm (next: http://127.0.0.1:49221/tests/scroll/tiled.html)
[task 2021-08-10T13:03:21.313Z] 13:03:21     INFO -  PID 1075 | Cycle 1(22): loaded http://127.0.0.1:49221/tests/scroll/reader.htm (next: http://127.0.0.1:49221/tests/scroll/tiled.html)
[task 2021-08-10T13:03:21.838Z] 13:03:21     INFO -  Timeout waiting for test completion; killing browser...
[task 2021-08-10T13:03:21.839Z] 13:03:21     INFO -  Killing psutil.Process(pid=1075, name='firefox', status='running', started='12:53:21') and writing a minidump file
Flags: needinfo?(gwatson)

I've asked for this to be backed out for now, as it caused another regression too. I'll investigate today.

Flags: needinfo?(gwatson)

Try push confirmed it's from bug 1723665. Thank you for investigating.

I found what's causing the failure, but still need to investigate further the underlying cause.

The issue is that the talos test is timing out (600 seconds) under this configuration and being killed by the test runner.

There's a few possible causes:

  • The patch severely regresses performance on this test. In theory it should have zero effect on performance, but it's possible something weird is going on. I'm running all the talos tests on other platforms with sw-wr / hw-wr and this patch to see if they have regressed at all.
  • Might be hitting something related to the issues in this test (https://bugzilla.mozilla.org/show_bug.cgi?id=1645275)
  • Might have been right on the edge of timing out before and this change subtly changes timing in a small way to go over the timeout.

The regression was reopened (patch backed out), so closing this regression.

Status: NEW → RESOLVED
Closed: 3 years ago
Resolution: --- → FIXED
Has Regression Range: --- → yes
You need to log in before you can comment on or make changes to this bug.