please add load monitoring to konigsberg (nagios?)

VERIFIED FIXED

Status

mozilla.org Graveyard
Server Operations
VERIFIED FIXED
7 years ago
3 years ago

People

(Reporter: timeless, Assigned: fox2mike)

Tracking

Details

(Reporter)

Description

7 years ago
load was around 40 last night because of run away genxref's filed up from the 14th. As each piled up, more were added by cron.

I killed them about 7 hours ago and cron was nice enough to send me (timeless) email about the failures.

It'd be nice if nagios or something could have sent me email by the 17th (before ehsan filed a bug on the 18th complaining that the xref's were stale).

I think load warnings at 20 should do.

Error here shouldn't generate high priority pages for normal IT. Someone during the normal day run should be able to poke me. I'll leave contact info in this bug.
(Reporter)

Comment 2

7 years ago
Email can of course be delivered to me via gmail.com or however cron found me (which works).
(Assignee)

Comment 3

7 years ago
SMS is not possible to non-US numbers, it's too expensive. Email notifications for konigsberg.nl will land up to timeless at gmail, let me know if that's not the right address.
Assignee: server-ops → shyam
Status: NEW → RESOLVED
Last Resolved: 7 years ago
Resolution: --- → FIXED
(Assignee)

Comment 4

7 years ago
Checking has some issues, working on that.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
(Assignee)

Comment 5

7 years ago
And fixed up the firewall, so this should be good.
Status: REOPENED → RESOLVED
Last Resolved: 7 years ago7 years ago
Resolution: --- → FIXED
(Reporter)

Comment 6

7 years ago
yep, i got the email complaining that i'm out fo space. thanks
Status: RESOLVED → VERIFIED
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.