Replication failed today and caused some troubles for QA and the AMO team. Can we monitor this in nagios? Thanks
Let's just do business hour monitors on this.
wfm. <scopecreep>Can nagios idle in #amo and give us failure/recovery notices? That could help QA know when things are dead or not.
Here's what I need to complete this bug: https://intranet.mozilla.org/SysAdmin/index.php/Nagios
Jabba helped me put this in. We changed Nagios configs to add this check, verified it works. Waiting for config to propogate to cluster.
Status: NEW → RESOLVED
Last Resolved: 8 years ago
Resolution: --- → FIXED
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.