sumotools1.webapp.phx1: Add nagios check for "input spikes"

RESOLVED INVALID

Status

RESOLVED INVALID
5 years ago
3 years ago

People

(Reporter: atoll, Unassigned)

Tracking

Details

(Whiteboard: :Moc)

Per bug 933334#c7, we should add a Nagios check that checks the contents of a certain file and pages certain people in certain ways at certain times. This would be for the host sumotools1.webapp.phx1.

Based on the current process (send an email when things are bad, otherwise don't send an email), this should probably be a logfile check. I would advise that whatever process is currently sending emails-on-CRITICAL additionally should append a single line to a logfile somewhere. That way we can have the nagios check simply take that line and emit it via IRC/email/SMS.

Should nagios deliver alerts on IRC, by email, and/or by SMS? (needinfo :cww)

Does the above logfile check sound reasonable? (needinfo :ashish)
Flags: needinfo?(cwwmozilla)
Flags: needinfo?(ashish)
See Also: → bug 933334

Comment 1

5 years ago
Alerts via email. We can have it sent to a mailing list which subscribers can be added to.
Flags: needinfo?(cwwmozilla)
Sorry for the late response but logfile check sounds doable... Not sure if the application already does that but do let me know once that is setup and we can fix up the Nagios-side of things.
Flags: needinfo?(ashish)
Whiteboard: :Moc

Updated

4 years ago
Group: infra
Component: Server Operations → MOC: Service Requests
Product: mozilla.org → Infrastructure & Operations
QA Contact: shyam → lypulong
Just checking in - is this request still relevant?

Updated

4 years ago
Assignee: server-ops → nobody

Updated

3 years ago
Flags: needinfo?(rsoderberg)
I'm going to say no for now, since I can't discern what log entries I cared about in 2013. Sorry for the noise!
Status: NEW → RESOLVED
Last Resolved: 3 years ago
Flags: needinfo?(rsoderberg)
Resolution: --- → INVALID
You need to log in before you can comment on or make changes to this bug.