Closed
Bug 838925
Opened 11 years ago
Closed 11 years ago
Add monitoring for stuck timeout loops
Categories
(Testing Graveyard :: Mozpool, defect)
Testing Graveyard
Mozpool
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: dustin, Assigned: dustin)
Details
Attachments
(3 files)
1.46 KB,
patch
|
mcote
:
review+
|
Details | Diff | Splinter Review |
1.43 KB,
patch
|
dividehex
:
review+
|
Details | Diff | Splinter Review |
860 bytes,
patch
|
ashish
:
review+
|
Details | Diff | Splinter Review |
Bug 817762 seems to be back again, at least (thankfully) in staging. We should monitor for this in production, too. I think the easiest way will be for mozpool to touch a file every time it runs the timeout loop. Then, nagios can check the age of that file.
Assignee | ||
Comment 1•11 years ago
|
||
Easy. I'll add this config item via puppet, and then monitor it with nagios.
Attachment #711943 -
Flags: review?(mcote)
Assignee | ||
Comment 2•11 years ago
|
||
corresponding patch for puppet
Attachment #711949 -
Flags: review?(jwatkins)
Assignee | ||
Comment 3•11 years ago
|
||
And the change to add the monitoring in nagios. check_file_age already exists on the imaging servers and in nrpe.cfg.
Attachment #711954 -
Flags: review?(ashish)
Comment 4•11 years ago
|
||
Comment on attachment 711954 [details] [diff] [review] infrapuppet.patch Looks good!
Attachment #711954 -
Flags: review?(ashish) → review+
Updated•11 years ago
|
Attachment #711949 -
Flags: review?(jwatkins) → review+
Assignee | ||
Comment 5•11 years ago
|
||
Comment on attachment 711943 [details] [diff] [review] bug838925.patch D'oh! I just landed this one instead of the puppet patch. I will back out if it's not OK.
Comment 6•11 years ago
|
||
Comment on attachment 711943 [details] [diff] [review] bug838925.patch Looks good.
Attachment #711943 -
Flags: review?(mcote) → review+
Assignee | ||
Comment 7•11 years ago
|
||
The puppetagain and mozpool patches are landed. I'll land the infra puppet patch when the others are in production.
Assignee | ||
Comment 8•11 years ago
|
||
infra puppet patch landed, although it had at least three bugs in it (wrong nagios server, wrong hostgroup, and in the wrong file)!
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Updated•8 years ago
|
Product: Testing → Testing Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•