Closed
Bug 1261710
Opened 9 years ago
Closed 9 years ago
Nagios check for Backlog Age should ignore Win10 testing
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task)
Infrastructure & Operations Graveyard
CIDuty
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: nthomas, Assigned: nthomas)
References
Details
Attachments
(1 file)
2.72 KB,
patch
|
catlee
:
review+
nthomas
:
checked-in+
|
Details | Diff | Splinter Review |
Win64 testing isn't ready yet, but RyanVM is pushing to try to work on it. No problem with that, but there's only 1 slave (out of the two for testing) and it takes many hours to clear the backlog out. In the meantime nagios is bleating like this:
Sun 13:50:46 PDT [4458] nagios1.private.releng.scl3.mozilla.com:Backlog Age is CRITICAL: CRITICAL Backlog Age: 24h:09m:25s
So, can we somehow exclude jobs names that begin with 'Windows 10 64-bit try' until we have a full-size test pool ?
Comment 1•9 years ago
|
||
(In reply to Nick Thomas [:nthomas] from comment #0)
> Win64 testing isn't ready yet, but RyanVM is pushing to try to work on it.
To be clear, Windows 10 testing is the issue here, not Win64 :)
Summary: Nagios check for Backlog Age should ignore Win64 testing → Nagios check for Backlog Age should ignore Win10 testing
Updated•9 years ago
|
Assignee: relops → nobody
Component: RelOps → Buildduty
Product: Infrastructure & Operations → Release Engineering
QA Contact: arich → bugspam.Callek
Assignee | ||
Comment 2•9 years ago
|
||
Assignee: nobody → nthomas
Attachment #8737945 -
Flags: review?(catlee)
Comment 3•9 years ago
|
||
Comment on attachment 8737945 [details] [diff] [review]
Add support to ignore prefixes
Review of attachment 8737945 [details] [diff] [review]:
-----------------------------------------------------------------
just a minor nit to fix up. looks good otherwise!
::: nagios-related/check_backlog_age.py
@@ +35,2 @@
> # Get the earliest 'submitted_at' value
> +def get_min_submitted_at(ignore_prefixes):
This argument name shadows the global variable name above, which is a bit confusing.
Can you either rename the global (e.g. IGNORE_PREFIXES), or have this function simply use the global variable?
Attachment #8737945 -
Flags: review?(catlee) → review+
Assignee | ||
Comment 4•9 years ago
|
||
Comment on attachment 8737945 [details] [diff] [review]
Add support to ignore prefixes
Landed with fix to use ignore_prefixes as a global:
https://hg.mozilla.org/build/braindump/rev/0a3b9bec2572
Attachment #8737945 -
Flags: checked-in+
Assignee | ||
Comment 5•9 years ago
|
||
ashish, how do we deploy this change to nagios1.private.releng.scl3.mozilla.com ?
Flags: needinfo?(ashish)
Comment 6•9 years ago
|
||
Passing on ni? to :ryanc, who's point for Nagios now.
Flags: needinfo?(ashish) → needinfo?(rchilds)
Comment 7•9 years ago
|
||
Hey Nick,
You'd commit this in Puppet via,
modules/nrpe/files/plugins/check_backlog_age
Let me know if you need further assistance with this.
Flags: needinfo?(rchilds)
Assignee | ||
Comment 8•9 years ago
|
||
ryanc, I don't have access to that part of SVN, would you mind landing it for me ? Attachment 8737945 [details] [diff] is the diff but note that the file path and name differ.
Flags: needinfo?(rchilds)
Comment 9•9 years ago
|
||
Nick,
Committed in r116593. Still looks to function the same,
17:26:42 <nagios-releng> ryanc: nagios1.private.releng.scl3.mozilla.com:Backlog Age is WARNING - WARNING Backlog Age: 11h:22m:59s, Ignored: 6 Last Checked: 2016-04-07 17:26:11 PDT
Let me know if you should need anything else.
Flags: needinfo?(rchilds)
Assignee | ||
Comment 10•9 years ago
|
||
Thanks Ryan, the "Ignore: <N>" part is new so all is well.
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Updated•7 years ago
|
Product: Release Engineering → Infrastructure & Operations
Updated•5 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•