Closed
Bug 711166
(linux64-ix-slave14)
Opened 13 years ago
Closed 13 years ago
linux64-ix-slave14 problem tracking
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task, P2)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: arich, Unassigned)
References
Details
(Whiteboard: [buildduty][buildslave][capacity])
linux65-ix-slave14 is reporting the following errors and should be debugged (try reseating the RAM) and sent back to iX for repairs if it's still non-functional
CPU 0: Machine Check Exception 4 Bank 5: 00000000000000
TSC 0
Comment 1•13 years ago
|
||
Ram has been reseated and returned to the rack for testing.
Reporter | ||
Comment 2•13 years ago
|
||
This hasn't been acting up again, so I'm going to close this bug and we can open a new one if the issue occurs again.
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Comment 3•13 years ago
|
||
This slave is down again. Let's get it fixed.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Summary: hardware errors on linux64-ix-slave14 → Please repair linux64-ix-slave14
Comment 4•13 years ago
|
||
To be yoinked from scl1, then over to matt to handle the repairs.
colo-trip: --- → scl1
Comment 5•13 years ago
|
||
I'll be bringing this to mtv today.
Comment 7•13 years ago
|
||
Please give this machine to desktop for repair.
Assignee: mlarrain → hlangi
Component: Server Operations: RelEng → Server Operations: Desktop Issues
QA Contact: zandr → tfairfield
Comment 8•13 years ago
|
||
My bad this should go to iX per me. I guess I thought it was a mini at first. My bad.
Assignee: hlangi → mlarrain
Component: Server Operations: Desktop Issues → Server Operations
QA Contact: tfairfield → cshields
Comment 9•13 years ago
|
||
This machine has been given to iX systems for repair.
Updated•13 years ago
|
Assignee: mlarrain → jwatkins
Component: Server Operations → Server Operations: RelEng
QA Contact: cshields → zandr
Reporter | ||
Updated•13 years ago
|
Assignee: jwatkins → mlarrain
Comment 10•13 years ago
|
||
Talked to iX today they will be at SCL1 either Wednesday or Thursday to drop off this system.
Comment 11•13 years ago
|
||
this machine is back at scl1 and racked. needs to be imaged once dspc is working again.
Status: REOPENED → ASSIGNED
Comment 12•13 years ago
|
||
jake is getting DSPC working again and will get this up and running soon.
Assignee: mlarrain → jwatkins
Comment 13•13 years ago
|
||
DSPC is working now. I have re-imaged this slave and it is ready to be setup for production.
Assignee: jwatkins → nobody
Component: Server Operations: RelEng → Release Engineering
QA Contact: zandr → release
Updated•13 years ago
|
Component: Release Engineering → Release Engineering: Machine Management
Priority: -- → P3
Whiteboard: [buildduty][buildslave][capacity]
Updated•13 years ago
|
Alias: linux64-ix-slave14
Summary: Please repair linux64-ix-slave14 → linux64-ix-slave14 problem tracking
Comment 14•13 years ago
|
||
I was going to try to bring this back up, but it's not pingable and going in through ipmi won't let me power cycle it.
Status: ASSIGNED → NEW
Depends on: 734909
Comment 15•13 years ago
|
||
this didn't come back up, so I clicked the "reset" button on IPMI a bunch of times. The machine came back up, but is inaccessible via the network.
Reporter | ||
Comment 16•13 years ago
|
||
The console preview is also showing a bunch of usb device errors. I'm thinking this needs to go back to ix again.
Comment 17•13 years ago
|
||
c&p from 738418:
>I didn't have any problems accessing the ipmi or the java remote console although >I did find the eth0 mac did not match what was in inventory/dhcp. Inventory has >been updated and is retrieving the correct IP now.
I haven't seen any errors on this slave since I worked on it on Mon 3/26. Usb error may have been from plugging and unplugging the usb kb during the reimaging. We will need more evidence of a hardware failure before we send it back to IX. I think maybe we should put this back into production first and see how it does.
Updated•13 years ago
|
Assignee: nobody → coop
Status: NEW → ASSIGNED
OS: Mac OS X → Linux
Priority: P3 → P2
Comment 18•13 years ago
|
||
(In reply to Jake Watkins [:dividehex] from comment #17)
> I think maybe we should put this back into production first
> and see how it does.
It's back in production now.
Status: ASSIGNED → RESOLVED
Closed: 13 years ago → 13 years ago
Resolution: --- → FIXED
Assignee | ||
Updated•11 years ago
|
Product: mozilla.org → Release Engineering
Updated•11 years ago
|
Assignee: coop → nobody
Updated•7 years ago
|
Product: Release Engineering → Infrastructure & Operations
Updated•5 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•