IPMI Log on admin1a.private.tpe1.mozilla.com is CRITICAL: CRITICAL - 11 -- 11/01/2014 -- 20:12:57 -- Temperature #0x12 -- Upper Non-critical going high -- Asserted

RESOLVED FIXED

Status

RESOLVED FIXED
4 years ago
2 years ago

People

(Reporter: nagiosapi, Unassigned)

Tracking

Details

(Whiteboard: [id=nagios1.private.scl3.mozilla.com:458865], URL)

(Reporter)

Description

4 years ago
Automated alert report from nagios1.private.scl3.mozilla.com:

Hostname: admin1a.private.tpe1.mozilla.com
Service:  IPMI Log
State:    CRITICAL
Output:   CRITICAL -   11 -- 11/01/2014 -- 20:12:57 -- Temperature #0x12 -- Upper Non-critical going high -- Asserted

Runbook:  http://m.allizom.org/IPMI+Log
ipmitool lists the system temp @ 69 Celcius. Im going to fail the admin host over to admin1b.
keepalived failed over to admin1b.private.tpe1.   

admin1a shutdown to hopefully not damage the hardware.  No other temprature sensor alarms present at the moment.
(Reporter)

Comment 3

4 years ago
Automated alert recovery:

Hostname: admin1a.private.tpe1.mozilla.com
Service:  IPMI Log
State:    OK
Output:   OK: IPMI Log OK
Status: NEW → RESOLVED
Last Resolved: 4 years ago
Resolution: --- → FIXED
Component: MOC: Incidents → MOC: Problems
Product: Infrastructure & Operations → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.