Closed
Bug 635416
Opened 13 years ago
Closed 13 years ago
reboot requests
Categories
(Infrastructure & Operations :: RelOps: General, task)
Infrastructure & Operations
RelOps: General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: dustin, Assigned: zandr)
References
Details
(Whiteboard: [slaveduty])
bug 629511 took care of a bunch of reboots, but a few were missed: mv-moz2-linux-ix-slave07 mv-moz2-linux-ix-slave21 mv-moz2-linux-ix-slave22 w32-ix-slave05 I can't get to the IPMI interface on any of these. The first three are unpingable, while the last failed doing a MozBuildTools install, and seems to be hung without RDP, VNC, or SSH access available.
Reporter | ||
Updated•13 years ago
|
Alias: reboots
Reporter | ||
Comment 1•13 years ago
|
||
no ping: talos-r3-fed-003 talos-r3-fed-018
Reporter | ||
Comment 2•13 years ago
|
||
no ping: talos-r3-fed-024 talos-r3-fed-008 talos-r3-w7-032
Reporter | ||
Comment 3•13 years ago
|
||
no ping: talos-r3-w7-036
Reporter | ||
Comment 4•13 years ago
|
||
^^ w7-036 may deserve its own bug for further - it seems to fail to reboot more often than others. I'll leave that to server ops..
Reporter | ||
Comment 5•13 years ago
|
||
rebooted in bug 634368, but I didn't manage to catch it before it disappeared again: moz2-darwin9-slave51
Reporter | ||
Comment 6•13 years ago
|
||
ping, but no SSH or VNC. Not running buildslave, so no worries: talos-r3-fed64-040
Reporter | ||
Comment 7•13 years ago
|
||
no ping: talos-r3-fed64-044
Reporter | ||
Comment 8•13 years ago
|
||
argh, my eyes are getting bleary. ** IGNORE comment 7 ** no ping: talos-r3-fed-044 talos-r3-fed-039
Reporter | ||
Comment 9•13 years ago
|
||
the two hosts in comment #8 are part of the mass die-off in bug 636051. I'll re-add them here if we decide a reboot is the appropriate solution.
Assignee | ||
Comment 10•13 years ago
|
||
fed-003: date problem fed-008: date problem fed-018: DHCP failure at 19 Feb 07:07 fed-024: DHCP failure at 20 Feb 18:57 fed-039: got it in bug 636051 fed-044: got in in bug 636051 fed64-040: looked like it never got rebooted after imaging? w7-032: gray screen -> reboot w7-036: gray screen -> reboot
Reporter | ||
Comment 11•13 years ago
|
||
After zandr's impromptu scl trip, the list is: moz2-darwin9-slave51 mv-moz2-linux-ix-slave07 mv-moz2-linux-ix-slave21 mv-moz2-linux-ix-slave22 talos-r3-fed-042 w32-ix-slave05 (and yes, talos-r3-fed64-040 hasn't been set up yet)
Assignee | ||
Updated•13 years ago
|
Assignee: server-ops-releng → zandr
Reporter | ||
Comment 12•13 years ago
|
||
add mv-moz2-linux-ix-slave10 (no ping)
Reporter | ||
Comment 13•13 years ago
|
||
add linux-ix-slave16 (fallout from bug 636342)
Reporter | ||
Comment 14•13 years ago
|
||
add w32-ix-slave10 (stuck at the OPSI prompt; needs a reboot and the event log needs to be cleared (run -> eventvwr, clear out the OPSI list))
Reporter | ||
Comment 15•13 years ago
|
||
add talos-r3-fed64-030 (no ping)
Comment 16•13 years ago
|
||
(In reply to comment #12) > add > mv-moz2-linux-ix-slave10 (no ping) Managed to reset this using IPMI, there was barf on the console from puppetd.
Reporter | ||
Comment 17•13 years ago
|
||
add: talos-r3-xp-024 (no ping)
Reporter | ||
Comment 18•13 years ago
|
||
add w32-ix-slave10 (stuck at the OPSI prompt; needs a reboot and the event log needs to be cleared (run -> eventvwr, clear out the OPSI list))
Reporter | ||
Comment 19•13 years ago
|
||
add w32-ix-slave14 (same reason)
Reporter | ||
Comment 20•13 years ago
|
||
add talos-r3-fed-022 (no ping)
Reporter | ||
Comment 21•13 years ago
|
||
add cm-bbot-linux-002.mozilla.org (if you'd like this one on a separate bug, let me know)
Reporter | ||
Comment 22•13 years ago
|
||
add win64-ix-ref (no ping) (see bug 635416)
Comment 24•13 years ago
|
||
add talos-r3-w7-036.build.scl1.mozilla.com no ping or ssh
Reporter | ||
Comment 25•13 years ago
|
||
linux-ix-slave16 is on the list, but also has slow io, so maybe it should just be bundled off to IX while it's down?
Reporter | ||
Comment 26•13 years ago
|
||
add w32-ix-slave08 (no ping, IMPI doesn't work)
Comment 27•13 years ago
|
||
add w32-ix-slave18 (no ping or ssh)
Comment 28•13 years ago
|
||
(In reply to comment #27) > add > w32-ix-slave18 (no ping or ssh) ignore this - nick can reach it via vnc and it appears stopped
Reporter | ||
Comment 29•13 years ago
|
||
add talos-r3-fed-023 (no ping)
Assignee | ||
Comment 30•13 years ago
|
||
talos-r3-xp-024: gray screen -> reboot talos-r3-w7-036: gray screen -> reboot talos-r3-fed-022: date problem talos-r3-fed-023: gray screen -> reboot talos-r3-fed-042: gray screen -> reboot talos-r3-fed-051: date problem talos-r3-fed64-030: gray screen -> reboot talos-r3-fed64-038: gray screen -> reboot
Assignee | ||
Comment 31•13 years ago
|
||
w32-ix-slave05: Hung in MozillaBuild install -> rebooted, came up normally IPMI is pending inventory update: correct address is 10.250.50.229 w32-ix-slave08: S.M.A.R.T. status BAD -> powered off, added to repair list IPMI is pending inventory update: correct address is 10.250.50.232 w32-ix-slave10: Hung at OPSI prompt -> rebooted, came up normally IPMI is pending inventory update: correct address is 10.250.50.234 w32-ix-slave14: Up, responsive. IPMI is pending inventory update: correct address is 10.250.50.238 linux-ix-slave16: rebooted during (inadvertent) move to scl1. mv-moz2-linux-ix-slave07: No address on eth0, rebooted. bug 636390? mv-moz2-linux-ix-slave10: Up and responsive mv-moz2-linux-ix-slave21: host down, IPMI wedged -> bug 639424 mv-moz2-linux-ix-slave22: No address on eth0, rebooted. bug 636390? I've sent rtucker the inventory update file, should get applied in a day or so. That leaves: moz2-darwin9-slave51 cm-bbot-linux-002 which are MPT, so I filed bug 639425 Thus endeth another reboots bug. :)
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Comment 32•13 years ago
|
||
Could you remove the itrequest flag on bug 639425 ?
Comment 33•13 years ago
|
||
And thanks!
Assignee | ||
Comment 34•13 years ago
|
||
(In reply to comment #32) > Could you remove the itrequest flag on bug 639425 ? done
Reporter | ||
Updated•13 years ago
|
Alias: reboots
Updated•11 years ago
|
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in
before you can comment on or make changes to this bug.
Description
•