Closed Bug 992020 Opened 11 years ago Closed 11 years ago

LSI Raid on backup1.private.lon1.mozilla.com is CRITICAL: CHECK_NRPE: Socket timeout after 20 seconds.

Categories

(mozilla.org Graveyard :: Server Operations: MOC, task)

Other
Other
task
Not set
normal

Tracking

(Not tracked)

RESOLVED WONTFIX

People

(Reporter: nagiosapi, Unassigned)

References

()

Details

(Whiteboard: [id=nagios1.private.scl3.mozilla.com:329160])

Automated alert report from nagios1.private.scl3.mozilla.com: Hostname: backup1.private.lon1.mozilla.com Service: LSI Raid State: CRITICAL Output: CHECK_NRPE: Socket timeout after 20 seconds. Runbook: http://m.allizom.org/LSI+Raid
[20:17:56] nagios-scl3: Thu 20:17:56 PDT [5016] backup1.private.lon1.mozilla.com: LSI Raid is WARNING: WARNING: 0:0:RAID-6:16 drives:8.571GB:Optimal 0:1:RAID-6:16 drives:1.809TB:Optimal Drives:16 online(6378 Errors) (http://m.mozilla.org/LSI+Raid)
Automated alert acknowledgement: (ashlee)Bug 992020
Status: NEW → ASSIGNED
Automated alert acknowledgement: (ericz)Bug 1004302
Automated alert acknowledgement: (ashish)Bug 983145
Automated alert acknowledgement: (w0ts0n)looking
Automated alert acknowledgement: (dgarvey)bug 983145
NRPE timeouts and other bugs from nagios that we can't do much about anyway.
Status: ASSIGNED → RESOLVED
Closed: 11 years ago
Resolution: --- → WONTFIX
Product: mozilla.org → mozilla.org Graveyard
Thu 23:26:24 PST [5308] backup1.private.lon1.mozilla.com:LSI Raid is CRITICAL: CRITICAL: 0:0:RAID-6:16 drives:8.571GB:Partially 0:1:RAID-6:16 drives:1.809TB:Partially Drives:16 online1 Bad Drives (4 Errors) What do we do when it alerts, ludo?
Flags: needinfo?(ludovic)
Last time I checked with AJ and RBryce they said we couldn't do much about the error numbers. I think we should work as a group to figure out waht we want to do with these kinds of alerts. Bring it on at the next meeting.
Flags: needinfo?(ludovic)
You need to log in before you can comment on or make changes to this bug.