Closed Bug 723299 Opened 13 years ago Closed 13 years ago

hg outage - Feb 1 2012

Categories

(mozilla.org Graveyard :: Server Operations, task)

x86
macOS
task
Not set
blocker

Tracking

(Not tracked)

RESOLVED INCOMPLETE

People

(Reporter: jhford, Assigned: bkero)

Details

[46] dm-hg02:http - hg.mozilla.org is CRITICAL: CRITICAL - Socket timeout after 10 seconds
nagios-sjc1
[48] dm-hg02:https_cert - hg.mozilla.org is CRITICAL: CRITICAL - Socket timeout after 10 seconds
nagios-sjc1
[50] dm-svn02:health is CRITICAL: CHECK_NRPE: Socket timeout after 30 seconds.
nagios-sjc1
[52] dm-hg02:https - hg.mozilla.org is CRITICAL: CRITICAL - Socket timeout after 10 seconds
nagios-sjc1
[54] dm-svn02:root partition is CRITICAL: CHECK_NRPE: Socket timeout after 60 seconds.

Looks like HG is down.
Summary: HG outtage Feb 1 2012 → hg outage - Feb 1 2012
This was a general host failure of dm-(svn|hg)02.  The machine has been rebooted and services are coming back online.
Assignee: server-ops → bkero
I've checked the log files, and cannot find anything indicating a failure, or even a graceful shutdown.  This along with the following description lead me to believe it was disk read failure.

14:51 <@justdave> I was using the java console
14:51 <@justdave> there was a banner and a login prompt, and it echoed back when I typed in "root<enter>"
14:51 <@justdave> and then never gave me a password prompt, and never re-issued the login  prompt
I'm afraid we weren't able to receive any information as to why this happened.  All ILO and system logs told us no clues about why this happened.  If this problem occurs again we can investigate replacing the hardware.
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → INCOMPLETE
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.