Closed
Bug 1109752
Opened 10 years ago
Closed 10 years ago
All trees closed due to high AWS pending test backlog
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task)
Infrastructure & Operations Graveyard
CIDuty
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: RyanVM, Assigned: Callek)
Details
Attachments
(1 file)
1002 bytes,
patch
|
mrrrgn
:
review+
mrrrgn
:
review+
|
Details | Diff | Splinter Review |
I'm seeing high numbers of pending AWS tests across all trees and nagios is alerting in #buildduty about high numbers of pending jobs. All trees closed, including Gaia.
Comment 1•10 years ago
|
||
New instances are failing to start buildbot because runner can't find hg via hgtool.py. We're bleeding capacity as old instances terminate themselves to pick up the new image.
Comment 2•10 years ago
|
||
Clarifying this is a configuration issue on the AWS machines (can't find the hgtool.py script), not an issue interacting with hg.m.o
Assignee | ||
Comment 3•10 years ago
|
||
This patch should fix the root of the problem In parallel :rail is reverting the golden AMI's to yesterday's ones which will avoid this bustage alltogether
Assignee: nobody → bugspam.Callek
Status: NEW → ASSIGNED
Attachment #8534536 -
Flags: review?(winter2718)
Updated•10 years ago
|
Attachment #8534536 -
Flags: review+
Updated•10 years ago
|
Attachment #8534536 -
Flags: review+
Reporter | ||
Comment 4•10 years ago
|
||
As an update, we're just waiting for the current pending backlog to come down before reopening. Rail's revert is working for getting the line moving in the right direction :)
Reporter | ||
Comment 5•10 years ago
|
||
Backlog is looking better and new linux test jobs appear to be starting reasonably fast now. I'm reopening everything.
Assignee | ||
Updated•10 years ago
|
Attachment #8534536 -
Flags: review?(winter2718)
Assignee | ||
Comment 6•10 years ago
|
||
Pushed http://hg.mozilla.org/build/puppet/rev/a2e182a28c4e
Assignee | ||
Comment 7•10 years ago
|
||
Cautiously optimistic here, marking as fixed. We'll know for sure after tomorrow's AMI's get generated. A link that showed the problem today: https://www.hostedgraphite.com/da5c920d/grafana/#/dashboard/temp/e5db589335c850ef95f52b85c2585442aa61c401?panelId=5&fullscreen
Assignee | ||
Updated•10 years ago
|
Status: ASSIGNED → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Assignee | ||
Comment 8•10 years ago
|
||
and I landed an unsaved version, and tested said version -- thus I caused bustage. The fix: https://hg.mozilla.org/build/puppet/rev/01a37f44eafe https://hg.mozilla.org/build/puppet/rev/521aa8dd8a02
Comment 9•10 years ago
|
||
I don't like the conditional here, as depending on install order /usr/bin/hg may end up pointing to the releng hg or the system hg. Maybe the two packages should explicitly conflict, so that only one can be installed?
Flags: needinfo?(bugspam.Callek)
Updated•9 years ago
|
Flags: needinfo?(bugspam.Callek)
Updated•6 years ago
|
Product: Release Engineering → Infrastructure & Operations
Updated•4 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•