Closed Bug 1838648 Opened 2 years ago Closed 1 year ago

Reduce scratch disks for GCP pools

Categories

(Release Engineering :: Firefox-CI Administration, task)

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: ahal, Assigned: jcristau)

References

(Blocks 1 open bug)

Details

Attachments

(7 files, 1 obsolete file)

We realized that most of our GCP pools have way too many scratch disks. We likely only need a single disk (or the minimum for the given instance type) per pool.

More information on valid # of SSDs by instance type here:
https://cloud.google.com/compute/docs/disks/local-ssd

Attachment #9339337 - Attachment is obsolete: true

The extra scratch disks make it more likely we run out of quota and can't spin up as many workers as we'd like, so they might be hurting our end-to-end times. We should double check whether they really help performance.

Blocks: 1797804

Stealing this.

Assignee: ahal → jcristau
Keywords: leave-open
QA Contact: michelle

This was copied over from the corresponding aws pool, but is not true in
GCP, since each scratch disk is 375GB.

These only run short/simple tasks, they don't need 4x375GB scratch
space, even with capacityPerInstance: 8.

The linux-gcp workers for the ci-t, code-analysis-{1,3}, nss-{1,3},
releng-{t,1,3} and taskgraph-t groups are likely not IO bound and can do
with a single scratch disk.

Pushed by jcristau@mozilla.com: https://hg.mozilla.org/ci/ci-configuration/rev/78b4facf0ee2 remove obsolete comment about misc-gcp pool's disk space. r=releng-reviewers,ahal https://hg.mozilla.org/ci/ci-configuration/rev/a478824bf8da reduce scratch disks for misc-gcp pools. r=releng-reviewers,ahal https://hg.mozilla.org/ci/ci-configuration/rev/93607eccd3ae reduce scratch disks for */linux-gcp pools. r=releng-reviewers,ahal https://hg.mozilla.org/ci/ci-configuration/rev/8d5943ec810e reduce scratch disks in gecko-1/b-linux-gcp-bug1797804-n2 pool. r=releng-reviewers,ahal https://hg.mozilla.org/ci/ci-configuration/rev/76a8ee6ac1c7 reduce scratch disks in mobile-{1,3}/b-linux-large-gcp pools. r=ahal https://hg.mozilla.org/ci/ci-configuration/rev/dfd80dcda744 Add a check to avoid unnecessary GCP SSDs, r=releng-reviewers,gabriel

Error out if we define a pool with too few scratch disks for the machine type.

Pushed by jcristau@mozilla.com: https://hg.mozilla.org/ci/ci-configuration/rev/1dee85457c9c check against number of scratch disks smaller than the minimum. r=ahal

Let's call this one done and we can have further improvements in separate bugs.

Status: ASSIGNED → RESOLVED
Closed: 1 year ago
Resolution: --- → FIXED
See Also: → 1887975
Blocks: 1904900
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: