Closed Bug 509700 Opened 15 years ago Closed 15 years ago

nagios checks for talos-master

Categories

(Infrastructure & Operations :: RelOps: General, task)

All
Other
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: catlee, Assigned: fox2mike)

References

Details

Please add the usual host checks on talos-master, as well as checking that two buildbot processes are running.
Blocks: 509281
oh, might as well do talos-staging-master at the same time.  same checks as talos-master.
Assignee: server-ops → dtran
Host check + buildbot checks added. talos-staging-master will only notify during business hours.
Status: NEW → RESOLVED
Closed: 15 years ago
Resolution: --- → FIXED
Sorry, should have been more explicit: added buildbot + generic service checks on these guys on bm-admin01 with build as the contact group. You'll need to install NRPE on these machines if you want Nagios to monitor disk/load.
(In reply to comment #3)
> Sorry, should have been more explicit: added buildbot + generic service checks
> on these guys on bm-admin01 with build as the contact group. You'll need to
> install NRPE on these machines if you want Nagios to monitor disk/load.

NRPE is running on them...
Reopening because the disk checks are incomplete. Both talos-master and talos-staging-master need a 'disk - /builds' and a 'root partition' check. You can use the size parameters that production-master has defined.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
I think dtran has finished his internship now, moving bug to fox2mike (our nagios hero).
Assignee: dtran → shyam
Haha, added /builds, / was already setup.
Status: REOPENED → RESOLVED
Closed: 15 years ago15 years ago
Resolution: --- → FIXED
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.