Closed Bug 634432 Opened 13 years ago Closed 13 years ago

bm-xserve21 has a bad drive?

Categories

(Infrastructure & Operations :: RelOps: General, task)

x86
macOS
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: zandr, Assigned: zandr)

References

Details

Machine has been extremely flakey, and per phong needs a new drive. Looking for the spec now.
FWIW, there is history of these x-serves getting corrupted file systems, which causes them to hang whenever you try to do serious I/O (ie a build). If the hardware diagnostics say the disk is sick then fair enough, but a reimage might be sufficient.
Group: core-security
Group: core-security
What is the status of this machine?  I can now ssh into it, but not sure if it has had the needed work done.
(In reply to comment #2)
> What is the status of this machine?  I can now ssh into it, but not sure if it
> has had the needed work done.

Phong kicked it yesterday in https://bugzilla.mozilla.org/show_bug.cgi?id=634368#c1

No further action beyond the filing of this bug has been taken.
cool.  I have taken this slave out of production to be cautious even though it has done a couple green builds.  If we want to put it back in the pool to see if it is fine after the reboot, it can be done by moving /builds/slave/buildbot.tac.off to /builds/slave/buildbot.tac and rebooting.
Assignee: server-ops-releng → zandr
I'm going to toss this into the preproduction pool while rolling out slavealloc.  We'll see how it does.
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
whoops, didn't mean to close this.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Seems to be running fine.  I'll move it to production.
Moved to the build-sjc1 pool.
Status: REOPENED → RESOLVED
Closed: 13 years ago13 years ago
Resolution: --- → FIXED
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.