After the gaia bumper runs, it touches /builds/gaia_bumper/gaia_bumper.stamp. Can we add a nagios check for buildbot-master66.srv.releng.usw2.mozilla.com that this file isn't too old? The bumper is set to run every 5 minutes right now, so make nagios alert after 15?
I'm set to add this check but it requires the "file_age" NRPE plugin to be realised on the server. The default check_file_age has a different set of parameters. Thanks! :arr For reference, check_file_age_ok_not_exists is the check_command I'm considering (and that file_age provides via NRPE)
I refer you back to catlee for making any changes to AWS systems.
nrpe on this machine is managed by puppet. do we have a recipe for getting the "file_age" nrpe plugin deployed with puppet?
Co-ordinated with :catlee on IRC and worked this out. Check has been added: https://nagios.mozilla.org/releng-scl3/cgi-bin/extinfo.cgi?type=2&host=buildbot-master66.srv.releng.usw2.mozilla.com&service=File+Age+-+%2Fbuilds%2Fgaia_bumper%2Fgaia_bumper.stamp Please reopen this bug if there are any false alerts. Thanks!