Closed Bug 1092603 Opened 10 years ago Closed 10 years ago

IPMI Log on admin1a.private.tpe1.mozilla.com is CRITICAL: CRITICAL - 11 -- 11/01/2014 -- 20:12:57 -- Temperature #0x12 -- Upper Non-critical going high -- Asserted

Categories

(Infrastructure & Operations :: MOC: Problems, task)

Other
Other
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: nagiosapi, Unassigned)

References

()

Details

(Whiteboard: [id=nagios1.private.scl3.mozilla.com:458865])

Automated alert report from nagios1.private.scl3.mozilla.com:

Hostname: admin1a.private.tpe1.mozilla.com
Service:  IPMI Log
State:    CRITICAL
Output:   CRITICAL -   11 -- 11/01/2014 -- 20:12:57 -- Temperature #0x12 -- Upper Non-critical going high -- Asserted

Runbook:  http://m.allizom.org/IPMI+Log
ipmitool lists the system temp @ 69 Celcius. Im going to fail the admin host over to admin1b.
keepalived failed over to admin1b.private.tpe1.   

admin1a shutdown to hopefully not damage the hardware.  No other temprature sensor alarms present at the moment.
Automated alert recovery:

Hostname: admin1a.private.tpe1.mozilla.com
Service:  IPMI Log
State:    OK
Output:   OK: IPMI Log OK
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Component: MOC: Incidents → MOC: Problems
You need to log in before you can comment on or make changes to this bug.