Closed
Bug 629763
Opened 13 years ago
Closed 13 years ago
decommission DNR hosts
Categories
(Infrastructure & Operations :: RelOps: General, task)
Infrastructure & Operations
RelOps: General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: dustin, Assigned: mlarrain)
References
()
Details
This system seems to be having disk problems? It took forever to start up, refusing NRPE and SSH connections, and once it did start accepting those, it did so very slowly. The first time running 'uptime' took a good 5-10 seconds. This will need some investigation from server ops, I think.
Reporter | ||
Comment 1•13 years ago
|
||
moz2-darwin9-slave05 seems to have similar problems.
Summary: moz2-darwin9-slave40 *very* slow → moz2-darwin9-slave40, moz2-darwin9-slave05 *very* slow
Reporter | ||
Comment 2•13 years ago
|
||
Machines are now DNR'd
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → WONTFIX
Comment 3•13 years ago
|
||
So what does DNR mean from the point of view nagios/dns/etc ? I ask because moz2-darwin9-slave40 is still chirruping away in #build: [77] moz2-darwin9-slave40.build.sjc1:PING is CRITICAL: PING CRITICAL - Packet loss = 100%
Comment 4•13 years ago
|
||
Hm, imho, DNR means we should decommission them and pull them from service. I'll remove them from nagios. These should be pulled from the racks.
Assignee: server-ops-releng → mlarrain
Status: RESOLVED → REOPENED
colo-trip: --- → sjc1
Resolution: WONTFIX → ---
Summary: moz2-darwin9-slave40, moz2-darwin9-slave05 *very* slow → decommission moz2-darwin9-slave40 and moz2-darwin9-slave05
Comment 5•13 years ago
|
||
The following are marked as DNR and should be decommissioned: moz2-darwin9-slave05 moz2-darwin9-slave20 moz2-darwin9-slave40 moz2-darwin9-slave59
Summary: decommission moz2-darwin9-slave40 and moz2-darwin9-slave05 → decommission DNR hosts
Comment 10•13 years ago
|
||
try-mac-slave11 p3-linux01 (old dell or hp workstation likely in the mtv third floor server room) bm-l10n-pmac-01 (see bug 625512) try-pmac-unit-01 (see bug 625512) cm-bbot-leopard-003 is sick (see bug 631966)
Assignee | ||
Comment 11•13 years ago
|
||
I will go and pull all these machines today/tomorrow and build a fort around my desk with them till I can wipe there drives and pass them off.
Assignee | ||
Comment 12•13 years ago
|
||
try-mac-slave11 cm-bbot-leopard-003 Unable to find the above machines. Will work with netops to switch hunt them down.
Comment 13•13 years ago
|
||
bug 631966 covers cm-bbot-leopard-003. it got put on zandr's desk sometime back. bug 655126 covers try-mac-slave11. it was also on zandr's desk.
Comment 14•13 years ago
|
||
moz2-darwin9-slave10
Assignee | ||
Comment 15•13 years ago
|
||
all these machines have been pulled and are at my desk. I will figure out if we are going to wipe or drill these drives.
Updated•13 years ago
|
Severity: normal → minor
Comment 16•13 years ago
|
||
Closing because these machines are now tracked at https://docs.google.com/spreadsheet/ccc?key=0AnwZMiMU7toNdE5VQ2swUDJmLUhRR1gwb1pmd0FtMWc&hl=en_US#gid=0
Status: REOPENED → RESOLVED
Closed: 13 years ago → 13 years ago
Resolution: --- → FIXED
Comment 17•13 years ago
|
||
Being removed from slave-alloc in bug 700705.
Updated•11 years ago
|
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in
before you can comment on or make changes to this bug.
Description
•