Closed Bug 1092603 Opened 11 years ago Closed 11 years ago

IPMI Log on admin1a.private.tpe1.mozilla.com is CRITICAL: CRITICAL - 11 -- 11/01/2014 -- 20:12:57 -- Temperature #0x12 -- Upper Non-critical going high -- Asserted

Categories

(Infrastructure & Operations :: MOC: Problems, task)

Other
Other
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: nagiosapi, Unassigned)

References

()

Details

(Whiteboard: [id=nagios1.private.scl3.mozilla.com:458865])

Automated alert report from nagios1.private.scl3.mozilla.com: Hostname: admin1a.private.tpe1.mozilla.com Service: IPMI Log State: CRITICAL Output: CRITICAL - 11 -- 11/01/2014 -- 20:12:57 -- Temperature #0x12 -- Upper Non-critical going high -- Asserted Runbook: http://m.allizom.org/IPMI+Log
ipmitool lists the system temp @ 69 Celcius. Im going to fail the admin host over to admin1b.
keepalived failed over to admin1b.private.tpe1. admin1a shutdown to hopefully not damage the hardware. No other temprature sensor alarms present at the moment.
Automated alert recovery: Hostname: admin1a.private.tpe1.mozilla.com Service: IPMI Log State: OK Output: OK: IPMI Log OK
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Component: MOC: Incidents → MOC: Problems
You need to log in before you can comment on or make changes to this bug.