Closed Bug 593889 Opened 15 years ago Closed 15 years ago

Network problem in Castro - all linux and windows minis down

Categories

(mozilla.org Graveyard :: Server Operations, task)

x86
All
task
Not set
major

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: nthomas, Assigned: jabba)

Details

Nagios reports that t-r3-w764-*, talos-r3-fed-*, talos-r3-fed64-*, talos-r3-w7-*, talos-r3-xp-* are failing PING checks. No reports of problems with talos-r3-snow-* or talos-r3-leopard-*.
Assignee: server-ops → shyam
Assignee: shyam → dmoore
The Castro office experienced a power failure at 16:00 on Monday. All Mac minis in the office would have been impacted, and we have yet to address the lack of restore-on-power-failure for the Linux and Windows platforms. I'm heading to the office now to manually power on the minis that I can find. ETA one hour.
Just finished the first pass, and it looks like about 90% are back online. I'll try to pick up a few more stragglers, but we probably have a few casualties that jabba/jlazaro can address on Tuesday.
Bouncing over to jabba, please update this bug with a list of minis which need further attention.
Assignee: dmoore → jdow
Severity: blocker → major
Thanks Derek. Nagios says the missing slaves are: t-r3-w764-001 talos-r3-w7-008
Those two hosts are back online now.
Status: NEW → RESOLVED
Closed: 15 years ago
Resolution: --- → FIXED
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.