Closed
Bug 1146974
Opened 9 years ago
Closed 9 years ago
Add automated alerting for runner abnormally high retries.
Categories
(Release Engineering :: General, defect)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: mrrrgn, Assigned: mrrrgn)
References
Details
We can make use of the influxdb data (the same that powers: https://stats.taskcluster.net/grafana/#/dashboard/db/runner)
Assignee | ||
Updated•9 years ago
|
Summary: Add automated alerting for runner high retries. → Add automated alerting for runner abnormally high retries.
Assignee | ||
Comment 1•9 years ago
|
||
It looks like the way to go here is via integration with Nagios.
Assignee | ||
Updated•9 years ago
|
Assignee: nobody → winter2718
Assignee | ||
Comment 2•9 years ago
|
||
Come to think of it, it would be even simpler to do this via papertrail.
Assignee | ||
Comment 3•9 years ago
|
||
builddity and myself are signed up for alerts. Alerts are triggered if more than 100 retries (across all machines) are seen within a one minute period.
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Updated•7 years ago
|
Component: Tools → General
You need to log in
before you can comment on or make changes to this bug.
Description
•