Closed Bug 644402 Opened 15 years ago Closed 12 years ago

Try harder to get a fast slave

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: nthomas, Unassigned)

Details

(Whiteboard: [automation])

Nick Thomas [:nthomas] (UTC+12)

Reporter

Description

•

15 years ago

When there are lots of checkins we are often getting VMs for win32 compiles, which are much slower to complete the job. We need to try harder to find a fast slave before falling back to a VM. I think what is happening is * one of the masters polls the db for pending work (every minute) * finds some * checks slave availability * doesn't have any fast builders, and so gives work to a VM That makes sense if we lose a colo or something similarly disastrous, but not for normal operation. I think we need to add some sort of delay, so that work is only assigned to VMs after all the masters have had a chance to poll and assign it to an available fast slave.

Dustin J. Mitchell [:dustin] (he/him)

Comment 1

•

15 years ago

This is a dupe of another bug that catlee's working on, but I can't find it right now..

Nick Thomas [:nthomas] (UTC+12)

Reporter

Comment 2

•

15 years ago

Perhaps you're thinking of bug 636101, but that's a bit different.

Chris Cooper [:coop] (he/him)

Updated

•

15 years ago

Priority: -- → P5

Whiteboard: [automation][scheduler]

Chris Cooper [:coop] (he/him)

Updated

•

15 years ago

Whiteboard: [automation][scheduler] → [automation]

Nick Thomas [:nthomas] (UTC+12)

Reporter

Comment 3

•

14 years ago

Fom discussion with catlee here's a sketch of how this might work. In _nextFastSlave (http://mxr.mozilla.org/build/source/buildbotcustom/misc.py#286) we can get the list of build requests for the builder (but not the actual request(s) looking for a slave). If we have a fast slave there's no change. If we only have slow slaves, then we only return one if the oldest request is older than some threshold (say 5 minutes). Otherwise we claim there are no slaves available.

Aki Sasaki (not active)

Comment 4

•

14 years ago

Is this still an issue?

Nobody; OK to take it and work on it

Assignee

Updated

•

12 years ago

Product: mozilla.org → Release Engineering

Nick Thomas [:nthomas] (UTC+12)

Reporter

Comment 5

•

12 years ago

We don't use Windows VMs any more, but we have AWS instances for linux instead. Bug 936222 is the most recent bug to handle this issue.

Status: NEW → RESOLVED

Closed: 12 years ago

Resolution: --- → FIXED

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Try harder to get a fast slave

Categories

(Release Engineering :: General, defect, P5)

Tracking

(Not tracked)

People

(Reporter: nthomas, Unassigned)

References

Details

(Whiteboard: [automation])

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Updated

Updated

Comment 3

Comment 4

Updated

Comment 5