Closed Bug 1189476 Opened 9 years ago Closed 9 years ago

decommission foopies, mozpool, panda buildbot servers once pandas are no longer in use

Categories

(Infrastructure & Operations :: RelOps: General, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: kmoir, Assigned: arich)

References

Details

Attachments

(1 file, 1 obsolete file)

Should be able to relocate some of this hardware to the linux talos test pool
Summary: decommission foopies and mozpool servers once pandas are no longer used → decommission foopies and mozpool servers once pandas are no longer in use
Depends on: 1193002
foopies 102-104,39-56 can be decommissioned as described in bug 1193002
Assignee: nobody → relops
Component: Platform Support → RelOps
Product: Release Engineering → Infrastructure & Operations
QA Contact: coop → arich
Version: unspecified → other
It's going to take some figuring to correlate these to the panda chassis/relays and racks/mobile imaging servers since they weren't decommed according to the physical layout of the systems. I'll do that and update this bug with the things we can actually decomm.
Assignee: relops → arich
I'm going to open a dcops bug to decommission things by chassis/panda-relay instead of by specific board, since we aren't going to pull individual boards. Here's what I've come up with that most closely matches what you've specified. panda-relay-002.p10.releng.scl3.mozilla.com (22-32) panda-relay-003.p10.releng.scl3.mozilla.com (34-43) panda-relay-004.p10.releng.scl3.mozilla.com (46-55) We can't decomm the rest of p10, so that leaves a rack less than half full. panda-relay-007.p1.releng.scl3.mozilla.com (82-92, 610) panda-relay-008.p1.releng.scl3.mozilla.com (93-103, 611) panda-relay-009.p1.releng.scl3.mozilla.com (104-114, 612) panda-relay-010.p1.releng.scl3.mozilla.com (115-125) panda-relay-011.p1.releng.scl3.mozilla.com (127-136, 613-614) panda-relay-012.p1.releng.scl3.mozilla.com (137-147) panda-relay-013.p1.releng.scl3.mozilla.com (148-158, 615) panda-relay-014.p1.releng.scl3.mozilla.com (159-169) That's all of p1, but we can't decomm mobile-imaging-001 since it's the primary mozpool server that syncs with the DB. That has to go last. So we'll have a rack empty except for this one chassis (set of 4 machines, 3 foopies and the mobile imaging server). panda-relay-015.p2.releng.scl3.mozilla.com (170-180) panda-relay-016.p2.releng.scl3.mozilla.com (181-191, 616) panda-relay-017.p2.releng.scl3.mozilla.com (192-202) panda-relay-018.p2.releng.scl3.mozilla.com (203-212) panda-relay-019.p2.releng.scl3.mozilla.com (214-224,617) panda-relay-020.p2.releng.scl3.mozilla.com (225-235, 618) panda-relay-021.p2.releng.scl3.mozilla.com (236-246) panda-relay-022.p2.releng.scl3.mozilla.com (33, 247-255) That's all of p2, so we can decommission mobile-imaging-002.p2.releng.scl3.mozilla.com and everything in that rack. panda-relay-023.p3.releng.scl3.mozilla.com (258-268) panda-relay-024.p3.releng.scl3.mozilla.com (269-279) panda-relay-025.p3.releng.scl3.mozilla.com (280-290) panda-relay-026.p3.releng.scl3.mozilla.com (291-301, 620) Can we decomm panda-0620 so we can decomm all of panda-relay-026, please? This leaves us with part of p3, part of p10, and all of p4, p5 and p6. p7, p8, and p9 had already been decommed.
Flags: needinfo?(kmoir)
We could just change the primary
I've updated my patch to decomm panda-620 in bug 193002 and disabled it in slavealloc
Flags: needinfo?(kmoir)
Attached patch panda-decomm.diff (obsolete) — Splinter Review
Remove decommed pandas and infrastructure from nagios.
Attachment #8679058 - Flags: review?(kmoir)
This adds in pandas that have been retasked to replace dead pandas.
Attachment #8679058 - Attachment is obsolete: true
Attachment #8679058 - Flags: review?(kmoir)
Attachment #8679061 - Flags: review?(kmoir)
Attachment #8679061 - Flags: review?(kmoir) → review+
I had one typo and had to rmeove the mozpool relay check for mobile-imaging-001. Original checkin is revision 109696; fixes checked in in revision 109702. Kim: let me know when I can tell dcops to decomm stuff.
Flags: needinfo?(kmoir)
Depends on: 1218571
I think the mozpool server change need to land ahead first (bug 1218571) before we decomm stuff
Flags: needinfo?(kmoir)
I removed mobile-imaging-001 in revision 109808. I'll open up a bug for dcops to do the actual physical decomm now.
Depends on: 1219260
The remaining hosts in p3 and p10 removed from nagios in svn revision 109982.
I think the following foopies can be decommed too foopy102,103,53, 54, 55, 56
Please decommission all remaining panda racks, foopies 60-80 and mozpool servers. No remaining panda jobs are running.
Summary: decommission foopies and mozpool servers once pandas are no longer in use → decommission foopies, mozpool, panda buildbot servers once pandas are no longer in use
I also removed entries for bm89, and bm100-102 today in puppet etc and disabled them in slavealloc so these machines could be decommissioned
Depends on: 1259076
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: