Closed
Bug 992020
Opened 11 years ago
Closed 11 years ago
LSI Raid on backup1.private.lon1.mozilla.com is CRITICAL: CHECK_NRPE: Socket timeout after 20 seconds.
Categories
(mozilla.org Graveyard :: Server Operations: MOC, task)
Tracking
(Not tracked)
RESOLVED
WONTFIX
People
(Reporter: nagiosapi, Unassigned)
References
()
Details
(Whiteboard: [id=nagios1.private.scl3.mozilla.com:329160])
Automated alert report from nagios1.private.scl3.mozilla.com:
Hostname: backup1.private.lon1.mozilla.com
Service: LSI Raid
State: CRITICAL
Output: CHECK_NRPE: Socket timeout after 20 seconds.
Runbook: http://m.allizom.org/LSI+Raid
Comment 1•11 years ago
|
||
[20:17:56]
nagios-scl3: Thu 20:17:56 PDT [5016] backup1.private.lon1.mozilla.com:
LSI Raid is WARNING: WARNING: 0:0:RAID-6:16 drives:8.571GB:Optimal 0:1:RAID-6:16 drives:1.809TB:Optimal Drives:16 online(6378 Errors) (http://m.mozilla.org/LSI+Raid)
Reporter | ||
Comment 2•11 years ago
|
||
Automated alert acknowledgement: (ashlee)Bug 992020
Status: NEW → ASSIGNED
Reporter | ||
Comment 3•11 years ago
|
||
Automated alert acknowledgement: (ericz)Bug 1004302
Reporter | ||
Comment 4•11 years ago
|
||
Automated alert acknowledgement: (ashish)Bug 983145
Reporter | ||
Comment 5•11 years ago
|
||
Automated alert acknowledgement: (w0ts0n)looking
Reporter | ||
Comment 6•11 years ago
|
||
Automated alert acknowledgement: (dgarvey)bug 983145
Comment 7•11 years ago
|
||
NRPE timeouts and other bugs from nagios that we can't do much about anyway.
Status: ASSIGNED → RESOLVED
Closed: 11 years ago
Resolution: --- → WONTFIX
Assignee | ||
Updated•10 years ago
|
Product: mozilla.org → mozilla.org Graveyard
Comment 8•10 years ago
|
||
Thu 23:26:24 PST [5308] backup1.private.lon1.mozilla.com:LSI Raid is CRITICAL: CRITICAL: 0:0:RAID-6:16 drives:8.571GB:Partially 0:1:RAID-6:16 drives:1.809TB:Partially Drives:16 online1 Bad Drives (4 Errors)
What do we do when it alerts, ludo?
Flags: needinfo?(ludovic)
Comment 9•10 years ago
|
||
Last time I checked with AJ and RBryce they said we couldn't do much about the error numbers. I think we should work as a group to figure out waht we want to do with these kinds of alerts. Bring it on at the next meeting.
Flags: needinfo?(ludovic)
You need to log in
before you can comment on or make changes to this bug.
Description
•