Closed Bug 616407 Opened 14 years ago Closed 14 years ago

Can't access cb-xserve01

Categories

(Infrastructure & Operations :: RelOps: General, task)

x86
macOS
task
Not set
critical

Tracking

(Not tracked)

VERIFIED FIXED

People

(Reporter: alqahira, Assigned: phong)

Details

This evening cb-xserve01 went perma-yellow on our tinderbox page, and I can't SSH to it (I get a "ssh_exchange_identification: Connection closed by remote host" response).  I also can't ping it from the jumphost.

Can someone please investigate (and restart, if necessary) cb-xserve01?  Thanks!
Flags: colo-trip+
Assignee: server-ops → phong
Any ETA on the next colo trip (it's been almost a week now since this bug was filed)?
Phong, Matthew? Don't make me come upstairs tomorrow!
Severity: normal → critical
rebooted and back online.
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
I started Tinderbox on the machine and it seems to have immediately frozen up (I can't type anything over SSH, and I'm not getting any UI reaction via VNC).
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
I suspect, based on the symptoms, that we've probably experienced bug 513718 comment 4 again and the box will need re-imaging again :( (using the 10.4 Xserve image).
Do you want to upgrade to 10.5.2 like the rest of the releng machines?  If it needs to be 10.4, then I can find the older image.
It needs to be 10.4 for our purposes.
I know the all-hands is this week, but what's the ETA on this? Early next week?
Can you send me the logon credentials for this server?  I'll try to get to it today.
Any ETA on this yet?  I know Sam sent the credentials via email the week before Christmas.
heading there this afternoon.
re-imaged with 10.4.8 image.
Status: REOPENED → RESOLVED
Closed: 14 years ago14 years ago
Resolution: --- → FIXED
Thanks, Phong!  We're building again :)

There's a firmware update that wants to be applied; I opened bug 623798 about it, so that when someone is in the colo again for whatever reason, the update can be done as a ride-along.
Status: RESOLVED → VERIFIED
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.