Closed
Bug 635416
Opened 15 years ago
Closed 15 years ago
reboot requests
Categories
(Infrastructure & Operations :: RelOps: General, task)
Infrastructure & Operations
RelOps: General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: dustin, Assigned: zandr)
References
Details
(Whiteboard: [slaveduty])
bug 629511 took care of a bunch of reboots, but a few were missed:
mv-moz2-linux-ix-slave07
mv-moz2-linux-ix-slave21
mv-moz2-linux-ix-slave22
w32-ix-slave05
I can't get to the IPMI interface on any of these. The first three are unpingable, while the last failed doing a MozBuildTools install, and seems to be hung without RDP, VNC, or SSH access available.
| Reporter | ||
Updated•15 years ago
|
Alias: reboots
| Reporter | ||
Comment 1•15 years ago
|
||
no ping:
talos-r3-fed-003
talos-r3-fed-018
| Reporter | ||
Comment 2•15 years ago
|
||
no ping:
talos-r3-fed-024
talos-r3-fed-008
talos-r3-w7-032
| Reporter | ||
Comment 3•15 years ago
|
||
no ping:
talos-r3-w7-036
| Reporter | ||
Comment 4•15 years ago
|
||
^^ w7-036 may deserve its own bug for further - it seems to fail to reboot more often than others. I'll leave that to server ops..
| Reporter | ||
Comment 5•15 years ago
|
||
rebooted in bug 634368, but I didn't manage to catch it before it disappeared again:
moz2-darwin9-slave51
| Reporter | ||
Comment 6•15 years ago
|
||
ping, but no SSH or VNC. Not running buildslave, so no worries:
talos-r3-fed64-040
| Reporter | ||
Comment 7•15 years ago
|
||
no ping:
talos-r3-fed64-044
| Reporter | ||
Comment 8•15 years ago
|
||
argh, my eyes are getting bleary. ** IGNORE comment 7 **
no ping:
talos-r3-fed-044
talos-r3-fed-039
| Reporter | ||
Comment 9•15 years ago
|
||
the two hosts in comment #8 are part of the mass die-off in bug 636051. I'll re-add them here if we decide a reboot is the appropriate solution.
| Assignee | ||
Comment 10•15 years ago
|
||
fed-003: date problem
fed-008: date problem
fed-018: DHCP failure at 19 Feb 07:07
fed-024: DHCP failure at 20 Feb 18:57
fed-039: got it in bug 636051
fed-044: got in in bug 636051
fed64-040: looked like it never got rebooted after imaging?
w7-032: gray screen -> reboot
w7-036: gray screen -> reboot
| Reporter | ||
Comment 11•15 years ago
|
||
After zandr's impromptu scl trip, the list is:
moz2-darwin9-slave51
mv-moz2-linux-ix-slave07
mv-moz2-linux-ix-slave21
mv-moz2-linux-ix-slave22
talos-r3-fed-042
w32-ix-slave05
(and yes, talos-r3-fed64-040 hasn't been set up yet)
| Assignee | ||
Updated•15 years ago
|
Assignee: server-ops-releng → zandr
| Reporter | ||
Comment 12•15 years ago
|
||
add
mv-moz2-linux-ix-slave10 (no ping)
| Reporter | ||
Comment 13•15 years ago
|
||
add
linux-ix-slave16 (fallout from bug 636342)
| Reporter | ||
Comment 14•15 years ago
|
||
add
w32-ix-slave10 (stuck at the OPSI prompt; needs a reboot and the event log needs to be cleared (run -> eventvwr, clear out the OPSI list))
| Reporter | ||
Comment 15•15 years ago
|
||
add
talos-r3-fed64-030 (no ping)
Comment 16•15 years ago
|
||
(In reply to comment #12)
> add
> mv-moz2-linux-ix-slave10 (no ping)
Managed to reset this using IPMI, there was barf on the console from puppetd.
| Reporter | ||
Comment 17•15 years ago
|
||
add:
talos-r3-xp-024 (no ping)
| Reporter | ||
Comment 18•15 years ago
|
||
add
w32-ix-slave10 (stuck at the OPSI prompt; needs a reboot and the event log
needs to be cleared (run -> eventvwr, clear out the OPSI list))
| Reporter | ||
Comment 19•15 years ago
|
||
add
w32-ix-slave14 (same reason)
| Reporter | ||
Comment 20•15 years ago
|
||
add
talos-r3-fed-022 (no ping)
| Reporter | ||
Comment 21•15 years ago
|
||
add
cm-bbot-linux-002.mozilla.org
(if you'd like this one on a separate bug, let me know)
| Reporter | ||
Comment 22•15 years ago
|
||
add
win64-ix-ref (no ping)
(see bug 635416)
Comment 24•15 years ago
|
||
add
talos-r3-w7-036.build.scl1.mozilla.com no ping or ssh
| Reporter | ||
Comment 25•15 years ago
|
||
linux-ix-slave16 is on the list, but also has slow io, so maybe it should just be bundled off to IX while it's down?
| Reporter | ||
Comment 26•15 years ago
|
||
add
w32-ix-slave08 (no ping, IMPI doesn't work)
Comment 27•15 years ago
|
||
add
w32-ix-slave18 (no ping or ssh)
Comment 28•15 years ago
|
||
(In reply to comment #27)
> add
> w32-ix-slave18 (no ping or ssh)
ignore this - nick can reach it via vnc and it appears stopped
| Reporter | ||
Comment 29•15 years ago
|
||
add
talos-r3-fed-023 (no ping)
| Assignee | ||
Comment 30•15 years ago
|
||
talos-r3-xp-024: gray screen -> reboot
talos-r3-w7-036: gray screen -> reboot
talos-r3-fed-022: date problem
talos-r3-fed-023: gray screen -> reboot
talos-r3-fed-042: gray screen -> reboot
talos-r3-fed-051: date problem
talos-r3-fed64-030: gray screen -> reboot
talos-r3-fed64-038: gray screen -> reboot
| Assignee | ||
Comment 31•15 years ago
|
||
w32-ix-slave05: Hung in MozillaBuild install -> rebooted, came up normally
IPMI is pending inventory update: correct address is 10.250.50.229
w32-ix-slave08: S.M.A.R.T. status BAD -> powered off, added to repair list
IPMI is pending inventory update: correct address is 10.250.50.232
w32-ix-slave10: Hung at OPSI prompt -> rebooted, came up normally
IPMI is pending inventory update: correct address is 10.250.50.234
w32-ix-slave14: Up, responsive.
IPMI is pending inventory update: correct address is 10.250.50.238
linux-ix-slave16: rebooted during (inadvertent) move to scl1.
mv-moz2-linux-ix-slave07: No address on eth0, rebooted. bug 636390?
mv-moz2-linux-ix-slave10: Up and responsive
mv-moz2-linux-ix-slave21: host down, IPMI wedged -> bug 639424
mv-moz2-linux-ix-slave22: No address on eth0, rebooted. bug 636390?
I've sent rtucker the inventory update file, should get applied in a day or so.
That leaves:
moz2-darwin9-slave51
cm-bbot-linux-002
which are MPT, so I filed bug 639425
Thus endeth another reboots bug. :)
Status: NEW → RESOLVED
Closed: 15 years ago
Resolution: --- → FIXED
Comment 32•15 years ago
|
||
Could you remove the itrequest flag on bug 639425 ?
Comment 33•15 years ago
|
||
And thanks!
| Assignee | ||
Comment 34•15 years ago
|
||
(In reply to comment #32)
> Could you remove the itrequest flag on bug 639425 ?
done
| Reporter | ||
Updated•15 years ago
|
Alias: reboots
Updated•12 years ago
|
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in
before you can comment on or make changes to this bug.
Description
•