Closed Bug 890317 (t-w732-ix-064) Opened 12 years ago Closed 10 years ago

t-w732-ix-064 problem tracking

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task, P3)

x86
Windows 7

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: bhearsum, Unassigned)

References

Details

(Whiteboard: [buildduty][buildslaves][capacity])

Attachments

(1 file)

trying pdu reboot
didn't work, needs it help
Depends on: 890333
Per 890333 the nic is still being "weird" but it is taking jobs right now. I gave the OK to it to yank it as needed, I've disabled in slavealloc so future boots don't start jobs.
cc'ing the sheriffs in case the issue mentioned in comment 2 gets on our radars
re-enabled in slavealloc
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Depends on: 894440
Resolution: FIXED → ---
Product: mozilla.org → Release Engineering
Slave is ready for production...but didn't come back from a reboot.
Depends on: 912206
Slave is in production, but not ready for it - failing webgl tests, timing out xperf tests, timing out cloning talos, generally acting like even after a reimage to catch it up with all the things it missed over the last two months, it'll still be in pretty bad shape. Disabled in slavealloc.
Depends on: 913469
(In reply to Phil Ringnalda (:philor) from comment #6) > Slave is in production, but not ready for it - failing webgl tests, timing > out xperf tests, timing out cloning talos, generally acting like even after > a reimage to catch it up with all the things it missed over the last two > months, it'll still be in pretty bad shape. > > Disabled in slavealloc. Cheers. Back to diagnostics.
Diagnostics showed no errors. Not sure what to do now.
Since diags did a reimage and saw no errors I tried a reboot, but that failed too, so we'll need human touch again anyway. [jwood@cruncher.srv.releng.scl3 ~]$ for i in 064; do curl http://slaveapi-dev1.srv.releng.scl3.mozilla.com:8000/slave/t-w732-ix-$i/action/reboot; done { "reboots": { "55975568": { "state": 3, "text": "Attempting SSH reboot...Failed.\nAttempting IPMI reboot...Failed.\nCan't do anything else, human intervention needed." } } }
Depends on: 929180
Status: REOPENED → RESOLVED
Closed: 12 years ago11 years ago
Resolution: --- → FIXED
I rebooted it again.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
2 monitors. dxdiag shows acceleration disabled. Need IT's intervention.
Depends on: 933957
The graphics setup should be correct now - I put the machine back in production.
Status: REOPENED → RESOLVED
Closed: 11 years ago11 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
(In reply to Ryan VanderMeulen [:RyanVM UTC-5] from comment #14) > Sure about that? > https://tbpl.mozilla.org/php/getParsedLog.php?id=30437374&tree=Mozilla- > Central > > Disabled. Of course I'm not.
Depends on: 938274
Hard drive has been replaced and the machine has been re-imaged. Back into production.
Status: REOPENED → RESOLVED
Closed: 11 years ago11 years ago
Resolution: --- → FIXED
Attempting SSH reboot...Failed. Attempting IPMI reboot...Failed. Filed IT bug for reboot (bug 1188656)
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Reenabled after another disk replacement/reimage.
Status: REOPENED → RESOLVED
Closed: 11 years ago10 years ago
Resolution: --- → FIXED
But after multiple reboots, it still isn't taking jobs.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Re-imaged the slave enabled it in slavealloc. At the moment, it has already successfully completed two jobs.
Status: REOPENED → RESOLVED
Closed: 10 years ago10 years ago
Resolution: --- → FIXED
Product: Release Engineering → Infrastructure & Operations
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: