Closed
Bug 1223256
Opened 9 years ago
Closed 9 years ago
Verify decommissioned pandas have stopped taking jobs
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task)
Infrastructure & Operations Graveyard
CIDuty
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: coop, Assigned: kmoir)
References
Details
philor pointed me at two pandas tonight that were still taking jobs despite having been (in theory) decommissioned: https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?name=panda-0303 https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?name=panda-0305 Looking at an older copy of devices.json, I saw that these pandas were attached to foopy56. All of the pandas on foopy56 had been decommissioned, but watch_devices.sh was still trying to run against all the pandas and falling back to on-disk tac files. These pandas were still valiantly trying to do work, and burning every second job as a result. I think our decommissioning docs must be missing a step about disabling pandas on foopies before removing them from the db, etc. Can buildduty please verify that all the pandas that were disabled are in fact stopped on the foopies? The easiest way to do this is to: * figure out which foopies are affected * create a shutdown.stamp file for each affected panda on each foopy (a la https://bugzilla.mozilla.org/show_bug.cgi?id=1001705#c1) * kill any processes related to those pandas (look for processes like retry.py, manage_buildslave.sh, watch_devices.sh) * remove the panda-0??? dir once their are no processes running. You'll likely need to be root to do this.
Assignee | ||
Comment 1•9 years ago
|
||
I've done this for the pandas referenced in bug 1193002 Also, updated the doc here, even though pandas are going away soon https://wiki.mozilla.org/ReleaseEngineering/How_To/Android_Tegras#To_disable_pandas
Assignee: nobody → kmoir
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Updated•9 years ago
|
Blocks: panda-0303
Updated•6 years ago
|
Product: Release Engineering → Infrastructure & Operations
Updated•4 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•