Closed Bug 1021003 Opened 10 years ago Closed 10 years ago

Enable check_stop_idle nagios check for aws-manager1.srv.releng.scl3.mozilla.com

Categories

(mozilla.org Graveyard :: Server Operations, task)

x86_64
Linux
task
Not set
major

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: rail, Assigned: ashish)

References

Details

The nrpe config is in place on that host:

$ cat  /etc/nagios/nrpe.d/check_stop_idle.cfg 
command[check_stop_idle]=/usr/lib64/nagios/plugins/check_file_age -w 600 -c 1800 -f /builds/aws_manager/aws_stop_idle.log
Assignee: relops → server-ops
Component: RelOps → Server Operations
Product: Infrastructure & Operations → mozilla.org
QA Contact: arich → shyam
pink :)
Rail,

I don't see this box in puppet(aws-manager1.srv.releng.scl3.mozilla.com)? Is it a different puppet master? Sorry new guy here...
It's managed by https://wiki.mozilla.org/ReleaseEngineering/PuppetAgain

All needed nrpe entry points are set up on that host in bug 1001416, nagios should just start querying them.

Bug 861359 looks a bit similar to this one.
Can we bump the priority of this bug? It hang yesterday and caused a tree closure. :(
Severity: normal → major
Assignee: server-ops → ashish
Status: NEW → ASSIGNED
:dgarvey - I'm gonna grab this because of severity bump.

I've added this to Nagios. IMO this check could reuse check_file_age in NRPE (see buildbot-masters) for the sake of consistency.

https://nagios.mozilla.org/releng-scl3/cgi-bin/extinfo.cgi?type=2&host=aws-manager1.srv.releng.scl3.mozilla.com&service=check_stop_idle
Status: ASSIGNED → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Thanks!
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.