Closed Bug 593574 Opened 14 years ago Closed 14 years ago

dp-dxr01 blade failure

Categories

(mozilla.org Graveyard :: Server Operations, task)

All
Other
task
Not set
critical

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: justdave, Assigned: phong)

Details

(Whiteboard: HP: 4620192021)

dp-dxr01 suddenly dropped off the net this morning.
04:12:03 <@nagios-phx> [141] dp-dxr01.phx is DOWN: PING CRITICAL - Packet loss = 100%

The HPOA is showing the entire blade as failed, and no amount of E-Fuse resets seems to bring it back.

If it's not something catesrophic, a physical remove and re-insert may do the trick.
Flags: colo-trip+
Open a case with Core TSI to get the blade reset:

http://www.coretsi.com/CORE_Assist_Service_Desk.html

(there is not quick colo-trip)
I don't appear to have an account on there and I don't see an obvious link to sigh up.  How do I get access?
coretsi has been emailed.
Assignee: server-ops → justdave
The blade is DOA, tech can't get it to power up.  We swapped drives with a spare to get DXR back online.

https://inventory.mozilla.org/systems/edit/1587/ is the broken one, now in slot 10 on the chassis.

https://inventory.mozilla.org/systems/show/1508/ is the formerly-unused one we swapped the drives into, now in slot 11.
Assignee: justdave → phong
Whiteboard: HP: 4620192021
system board replace and it's back to being a spare machine.
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.