Automated alert report from nagios1.private.scl3.mozilla.com: Hostname: admin1a.private.tpe1.mozilla.com Service: IPMI Log State: CRITICAL Output: CRITICAL - 11 -- 11/01/2014 -- 20:12:57 -- Temperature #0x12 -- Upper Non-critical going high -- Asserted Runbook: http://m.allizom.org/IPMI+Log
ipmitool lists the system temp @ 69 Celcius. Im going to fail the admin host over to admin1b.
keepalived failed over to admin1b.private.tpe1. admin1a shutdown to hopefully not damage the hardware. No other temprature sensor alarms present at the moment.
Automated alert recovery: Hostname: admin1a.private.tpe1.mozilla.com Service: IPMI Log State: OK Output: OK: IPMI Log OK
Status: NEW → RESOLVED
Last Resolved: 4 years ago
Resolution: --- → FIXED
Component: MOC: Incidents → MOC: Problems
Product: Infrastructure & Operations → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.