Closed Bug 670269 Opened 13 years ago Closed 8 years ago

Socorro - log monitoring inadequate

Categories

(Socorro :: General, task)

x86
Linux
task
Not set
normal

Tracking

(Not tracked)

RESOLVED WONTFIX

People

(Reporter: lars, Unassigned)

Details

Looking back through the logs in the processor, I can see that the Friday 2011-07-08 HBase outage began at 11:30am.  Yet we got no warnings from Nagios until 2pm.   If we had something monitoring the logs for the string CRITICAL, we could have caught this trouble two and a half hours earlier.
any updates on this?  is there a bug on the actual outage as well?
the only update that I have is that the 11:30am outage was actually brief.  It lasted only about three minutes.  Since the later multi-hour outage of that day has an identified cause, this earlier glitch was likely unrelated.  

However, I still stand by my request that we monitor the logs for CRITICAL errors...
Component: Socorro → General
Product: Webtools → Socorro
old and outdated
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → WONTFIX
You need to log in before you can comment on or make changes to this bug.