Closed Bug 720167 (w32-ix-slave16) Opened 12 years ago Closed 12 years ago

w32-ix-slave16 problem tracking

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: nthomas, Unassigned)

Details

(Whiteboard: [buildduty][capacity][buildslaves])

Nick Thomas [:nthomas] (UTC+12)

Reporter

Description

•

12 years ago

From bug 719284:
(In reply to Jake Watkins [:dividehex] from comment #6)
> (In reply to Jake Watkins [:dividehex] from comment #2)
> > Also missed this one in that last reboot bug
> > w32-ix-slave16 (w32-ix-slave16-mgmt isn't responding, this is one of the
> > machines that didn't like the extra RAM in bug 672969)
> 
> This slave couldn't keep a solid net link going.  Cable tested ok.  A reboot
> seems to have cleared this up but if it happens again, we should pull it for
> repairs.

Nagios thinks it's down again.

Amy Rich [:arr] [:arich]

Updated

•

12 years ago

Assignee: server-ops-releng → jwatkins

colo-trip: --- → scl1

QA Contact: mrz → zandr

Jake Watkins [:dividehex]

Comment 1

•

12 years ago

Net link was going up and down.  I've pulled this for repairs.

Jake Watkins [:dividehex]

Comment 2

•

12 years ago

IX Ticket ID: IGS-317706

IX will pick this up when the drop off the repaired systems.

Jake Watkins [:dividehex]

Comment 3

•

12 years ago

IX will repair/remove for repairs tomorrow (1/31) when they visit SCL1 for the memory upgrades that didn't take.

Jake Watkins [:dividehex]

Comment 4

•

12 years ago

Chris from IX has taken this for repairs

Amy Rich [:arr] [:arich]

Updated

•

12 years ago

Assignee: jwatkins → mlarrain

Matthew Larrain[:MaRu]

Comment 5

•

12 years ago

Talked to iX today they will be at SCL1 either Wednesday or Thursday to drop off this system.

Status: NEW → ASSIGNED

Matthew Larrain[:MaRu]

Comment 6

•

12 years ago

This machine was still broken when iX brought it back they took it again for service.

Jake Watkins [:dividehex]

Comment 7

•

12 years ago

Digipengi has been emailing with Matt Finney at IX and apparently they lost track of the repair ticket and closed it before it was returned to us.  It has been repaired and we are arranging for a time/date for them to return it.

Jake Watkins [:dividehex]

Comment 8

•

12 years ago

We just received this slave back from IX.  Net link issue is fixed but it refuses to detect a HDD attached to it now.  I have emailed IX about it.

Jake Watkins [:dividehex]

Comment 9

•

12 years ago

Paul from IX came back out and found the HDD power cable was not actually attached to the cable coming from the PSU.  He reattached it and it is now detecting the drive.

It is currently being re-imaged.

Matthew Larrain[:MaRu]

Comment 10

•

12 years ago

Machine has been imaged and is ready to go back into the pool

Assignee: mlarrain → nobody

Component: Server Operations: RelEng → Release Engineering

QA Contact: zandr → release

Aki Sasaki (not active)

Updated

•

12 years ago

Component: Release Engineering → Release Engineering: Machine Management

QA Contact: release → armenzg

Whiteboard: [buildduty][capacity]

Aki Sasaki (not active)

Comment 11

•

12 years ago

Updated hostname, deleted from opsi; it appears to now be installing opsi packages.
Also disabled in slavealloc atm.

Alias: w32-ix-slave16

Summary: Pull w32-ix-slave16 for repairs → w32-ix-slave16 problem tracking

Aki Sasaki (not active)

Comment 12

•

12 years ago

Cleared opsi log (per alert dialog) and added that to https://wiki.mozilla.org/ReleaseEngineering/How_To/Set_Up_a_Freshly_Imaged_Slave#Reimaged .

Reenabled in slavealloc and rebooted.

Status: ASSIGNED → RESOLVED

Closed: 12 years ago

Resolution: --- → FIXED

Nick Thomas [:nthomas] (UTC+12)

Reporter

Comment 13

•

12 years ago

It didn't manage to sync with opsi past that first time, and has the wrong ssh keys. I've fixed the latter, and taking a quick look at opsi.

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Nick Thomas [:nthomas] (UTC+12)

Reporter

Comment 14

•

12 years ago

According to c:\tmp\logonlog.txt, production-opsi is returning http 401 Unauthorized errors to this slave. What's the fix for that armenzg ?

Armen [:armenzg]

Comment 15

•

12 years ago

I believe we have to follow this info:
https://wiki.mozilla.org/ReleaseEngineering/OPSI#error:_HTTP.2F1.1_401_Unauthorized

Let me know.

Mike Taylor [:bear]

Updated

•

12 years ago

Whiteboard: [buildduty][capacity] → [buildduty][capacity][buildslaves]

Chris AtLee [:catlee]

Comment 16

•

12 years ago

Deleted C:\Program Files\opsi.org\preloginloader\cfg\locked.cfg and rebooted. Looks to be back in business.

Status: REOPENED → RESOLVED

Closed: 12 years ago → 12 years ago

Resolution: --- → FIXED

Nobody; OK to take it and work on it

Assignee

Updated

•

11 years ago

Product: mozilla.org → Release Engineering

BMO Automation

Updated

•

6 years ago

Product: Release Engineering → Infrastructure & Operations

BMO Automation

Updated

•

4 years ago

Product: Infrastructure & Operations → Infrastructure & Operations Graveyard

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

w32-ix-slave16 problem tracking

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task)

Tracking

(Not tracked)

People

(Reporter: nthomas, Unassigned)

References

Details

(Whiteboard: [buildduty][capacity][buildslaves])

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Comment 2

Comment 3

Comment 4

Updated

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Updated

Comment 11

Comment 12

Comment 13

Comment 14

Comment 15

Updated

Comment 16

Updated

Updated

Updated