Open Bug 1945658 Opened 1 year ago Updated 11 hours ago

[meta] Artifact builds produce too many perma-failures to be useful right now

Categories

(Testing :: General, task, P3)

task

Tracking

(Not tracked)

ASSIGNED

People

(Reporter: Gijs, Assigned: florian, NeedInfo)

References

(Depends on 1 open bug)

Details

(Keywords: meta)

Attachments

(5 files)

This type of thing is not atypical.

Push health says 145 failures. Only a handful of those are intermittents. 4 tests perma-fail cross-platform, accounting for about 40 failures total, and another 16 Windows a11y tests fail on Windows (which is the only place they are being run), for another 64 failures, tracked in bug 1885239. I think some of the remainder is even duplicate errors from the same failures (e.g. due to crashes via asserts or aborted runs due to too many failures).

Seeing the wood for the trees becomes very hard in this situation, especially if you don't push regularly or don't always use artifact builds.

The deps (I'll file a few more) will hopefully be sufficient to deal with the current set of issues. However, I'd like to avoid coming back to this place. But I'm not 100% sure how we'd do that.

Aryx, do you have ideas? I know we run artifact builds on central, but I don't think we run tests against them - is doing a tier-2 type test run and then noticing this type of thing when it lands a possibility? Or do you think we'd still miss that a given test has gone perma-fail in artifact mode?

Flags: needinfo?(aryx.bugmail)
Depends on: 1945662
Depends on: 1945663
Depends on: 1948777
Depends on: 1948780
Severity: -- → N/A
Depends on: 1953425

Gijs, is this meta and needinfo still relevant after 3 months and the many dependencies landed, or could the remaining issue be moved to its own bug?

Flags: needinfo?(gijskruitbosch+bugs)

All the deps are closed so from that PoV we could close. I would really like to find a more durable solution that avoids the problem recurring, though...

Flags: needinfo?(gijskruitbosch+bugs)

Lets keep this open just in case to find a better solution

Flags: needinfo?(aryx.bugmail)
Priority: -- → P3

(In reply to Johannes from comment #3)

Lets keep this open just in case to find a better solution

Let's keep the needinfo for Aryx, then...

Flags: needinfo?(aryx.bugmail)

This came up in General triage with 23 days after needinfo. Do we need to do anything with this bug at this point?

(In reply to Stephen Thompson [:sthompson] from comment #5)

This came up in General triage with 23 days after needinfo. Do we need to do anything with this bug at this point?

I'd like a more permanent way of making sure that artifact builds don't descend into an orange-fest, thence the needinfo for Aryx.

I've pushed a blank trypush to see how bad the sitch is at the moment: https://treeherder.mozilla.org/jobs?repo=try&landoCommitID=137313 .

Depends on: 1756409
Depends on: 1974095

The Bugbug bot thinks this bug should belong to the 'Core::Disability Access APIs' component, and is moving the bug to that component. Please correct in case you think the bot is wrong.

Component: General → Disability Access APIs
Product: Firefox → Core
Component: Disability Access APIs → General
Product: Core → Firefox
Depends on: 2012613

Moving to Testing::General since it sounds like we're considering to add artifact build test failures as tier 2 failures.

Product: Firefox → Testing

Run mochitest-browser-chrome, mochitest-chrome, mochitest-devtools-chrome,
mochitest-plain, and xpcshell tests against artifact builds to catch
perma-failures before they accumulate.

On Linux and Windows, these jobs run on mozilla-central and backstop pushes
on autoland. On macOS, they are try-only to preserve limited machine
capacity.

Assignee: nobody → florian
Status: NEW → ASSIGNED

These debug artifact builds are needed so that we can run tier 2 tests
against them, matching what we already do for opt artifact builds.

The artifact-build kind had keep-artifacts: false, which prevented it from
uploading the build outputs that test jobs need to download.

The attached patch stack adds tier2 test jobs on both opt and debug artifact builds on mozilla-central. That should be enough for sheriffs to notice new perma-fails and file bugs on them. I didn't schedule the jobs on autoland to limit the extra cost.

Attachment #9566998 - Attachment description: Bug 1945658 - Add tier 2 test jobs for artifact builds on Linux, Windows, and macOS. r=ahal → Bug 1945658 - Add tier 2 test jobs for artifact builds on Linux, Windows. r=ahal
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: