Closed Bug 629763 Opened 13 years ago Closed 13 years ago

decommission DNR hosts

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: dustin, Assigned: mlarrain)

References

(
URL
)

Details

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Description

•

13 years ago

This system seems to be having disk problems?  It took forever to start up, refusing NRPE and SSH connections, and once it did start accepting those, it did so very slowly.  The first time running 'uptime' took a good 5-10 seconds.

This will need some investigation from server ops, I think.

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Comment 1

•

13 years ago

moz2-darwin9-slave05 seems to have similar problems.

Summary: moz2-darwin9-slave40 *very* slow → moz2-darwin9-slave40, moz2-darwin9-slave05 *very* slow

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Comment 2

•

13 years ago

Machines are now DNR'd

Status: NEW → RESOLVED

Closed: 13 years ago

Resolution: --- → WONTFIX

Nick Thomas [:nthomas] (UTC+12)

Comment 3

•

13 years ago

So what does DNR mean from the point of view nagios/dns/etc ? I ask because moz2-darwin9-slave40 is still chirruping away in #build:
[77] moz2-darwin9-slave40.build.sjc1:PING is CRITICAL: PING CRITICAL - Packet loss = 100%

Amy Rich [:arr] [:arich]

Comment 4

•

13 years ago

Hm, imho, DNR means we should decommission them and pull them from service.  I'll remove them from nagios.

These should be pulled from the racks.

Assignee: server-ops-releng → mlarrain

Status: RESOLVED → REOPENED

colo-trip: --- → sjc1

Resolution: WONTFIX → ---

Summary: moz2-darwin9-slave40, moz2-darwin9-slave05 *very* slow → decommission moz2-darwin9-slave40 and moz2-darwin9-slave05

Amy Rich [:arr] [:arich]

Comment 5

•

13 years ago

The following are marked as DNR and should be decommissioned:

moz2-darwin9-slave05
moz2-darwin9-slave20
moz2-darwin9-slave40
moz2-darwin9-slave59

Summary: decommission moz2-darwin9-slave40 and moz2-darwin9-slave05 → decommission DNR hosts

Amy Rich [:arr] [:arich]

Comment 10

•

13 years ago

try-mac-slave11
p3-linux01 (old dell or hp workstation likely in the mtv third floor server room)
bm-l10n-pmac-01 (see bug 625512) 
try-pmac-unit-01 (see bug 625512)
cm-bbot-leopard-003 is sick (see bug 631966)

Nick Thomas [:nthomas] (UTC+12)

Updated

•

13 years ago

Blocks: 683011

Matthew Larrain[:MaRu]

Assignee

Comment 11

•

13 years ago

I will go and pull all these machines today/tomorrow and build a fort around my desk with them till I can wipe there drives and pass them off.

Matthew Larrain[:MaRu]

Assignee

Comment 12

•

13 years ago

    try-mac-slave11
    cm-bbot-leopard-003 

Unable to find the above machines. Will work with netops to switch hunt them down.

Amy Rich [:arr] [:arich]

Comment 13

•

13 years ago

bug 631966 covers cm-bbot-leopard-003.  it got put on zandr's desk sometime back.
bug 655126 covers try-mac-slave11.  it was also on zandr's desk.

Amy Rich [:arr] [:arich]

Comment 14

•

13 years ago

moz2-darwin9-slave10

Matthew Larrain[:MaRu]

Assignee

Comment 15

•

13 years ago

all these machines have been pulled and are at my desk. I will figure out if we are going to wipe or drill these drives.

Amy Rich [:arr] [:arich]

Updated

•

13 years ago

Severity: normal → minor

Amy Rich [:arr] [:arich]

Comment 16

•

13 years ago

Closing because these machines are now tracked at 
https://docs.google.com/spreadsheet/ccc?key=0AnwZMiMU7toNdE5VQ2swUDJmLUhRR1gwb1pmd0FtMWc&hl=en_US#gid=0

URL: https://docs.google.com/spreadsheet/c...

Status: REOPENED → RESOLVED

Closed: 13 years ago → 13 years ago

Resolution: --- → FIXED

Armen [:armenzg]

Comment 17

•

13 years ago

Being removed from slave-alloc in bug 700705.

Nobody; OK to take it and work on it

Updated

•

11 years ago

Component: Server Operations: RelEng → RelOps

Product: mozilla.org → Infrastructure & Operations

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

decommission DNR hosts

Categories

(Infrastructure & Operations :: RelOps: General, task)

Tracking

(Not tracked)

People

(Reporter: dustin, Assigned: mlarrain)

References

(
URL
)

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 10

Updated

Comment 11

Comment 12

Comment 13

Comment 14

Comment 15

Updated

Comment 16

Comment 17

Updated