Closed Bug 1581194 Opened 6 years ago Closed 6 years ago

[scheduling] Consider changing SETA to run the full set of tasks every 10th push or 2 hours

Categories

(Firefox Build System :: Task Configuration, task)

task
Not set
normal

Tracking

(firefox73 fixed)

RESOLVED FIXED
mozilla73
Tracking Status
firefox73 --- fixed

People

(Reporter: ahal, Assigned: ahal)

References

(Blocks 1 open bug)

Details

Attachments

(2 files)

We currently do this every 5th push or 1 hour.

I have a recipe that analyzes the effects of this change. Results below:

Scheduler Total Tasks Primary Backouts Secondary Backouts Secondary Backout Rate Scheduler Efficiency
seta_10_120 447160 68 42 0.38 0.59
baseline 677090 77 43 0.36 0.41

This data still has a few bugs in it I need to iron out*. But essentially it means that since Aug 29th, this change would reduce the total tasks we run on autoland/inbound by around 34% while only very slightly increasing the secondary backout rate.

The secondary backout rate is the percentage of backouts where none of the failing tasks ran on the offending push (i.e a sheriff needed to backfill them to find the regression).

In other words, this change could provide a substantial decrease in the number of tasks we run, at the cost of slightly more work to the sheriffs.

* The bugs are that sometimes downloading the seta_10_120 artifact is failing for some reason, so the number of tasks is artificially low. So the actual percentage decrease is lower than 34% (but still substantial). I'll update the numbers when I have this fixed. It only happens on a small number of pushes.

** There may also be bugs in the methodology here (e.g how a secondary backout is calculated). This is all very new and unproven. Though I do think there is at least a signal here that making this change might be a good idea.

Hi Sebastian, this change isn't imminent or anything but wanted to get your opinion on it. Aside from fixing the data, do you have any reservations about this change? Anything you'd like to see before it lands?

Flags: needinfo?(aryx.bugmail)

Sorry for the delayed reply. It looks good to ship.

Concerns:

  • Less likely to have a merge candidate because there are fewer => sometimes delays in shipping changes with Nightly, they will catch the next one instead.
  • Sometimes the frequency with which a failure can be observed contributes to its detection (e.g. intermittent or existed as an intermittent before).

Actions:

  • Treeherder's Backfill and Custom Action > Backfill commands need the default values changed to run on the 10 (better: 9) previous jobs

Questions:

  • Is there an estimation available how much the total testing costs will decrease due to that?

Thank you for working on the load reduction.

Flags: needinfo?(aryx.bugmail)
Assignee: nobody → ahal

We'll need to fix the treeherder backfill button before landing this, but just wanted to get a patch ready in the meantime.

:armenzg, can you coordinate getting the treeherder backfill button to do 9 jobs (instead of 4 or 5 now)?

Flags: needinfo?(armenzg)
Depends on: 1599898

This should cover all the pushes between the ones that scheduled all tasks.

Depends on D55020

Depends on: 1600135

This is an in-tree request. The patch has been reviewed and will be landed next week.

Flags: needinfo?(armenzg)
Pushed by ahalberstadt@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/ad90e3772a31 Change SETA to run all tasks every 10th push or two hours, r=jmaher https://hg.mozilla.org/integration/autoland/rev/b487505e9ebe [ci.backfill] Change default backfill depth from 5 to 9, r=armenzg
Status: NEW → RESOLVED
Closed: 6 years ago
Resolution: --- → FIXED
Target Milestone: --- → mozilla73
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: