Tue 21:54:41 UTC  [Unknown] admin2a.private.tpe1.mozilla.com:HP RAID is UNKNOWN: RAID UNKNOWN - /usr/sbin/hpacucli did not execute properly : Error: The specified device does not have any logical drives. (http://m.mozilla.org/HP+RAID) checked the host and saw this: [email@example.com ~]$ sudo hpacucli ctrl all show config Dynamic Smart Array B140i in Slot 0b () Port Name: 1I Port Name: 2I Port Name: 3I Port Name: 4I rebooted host checked the RAID configs/BIOS settings and everything looks fine. the host is complaining of XFS errors upon reboot and checking /etc/fstab, there's a /sdb and a msdos partition which doesn't match admin2b.private.tpe1. i'd like to rekick this host if possible.
:ashish, can we kickstart from tpe1?
The fs looks pretty trashed :/
host has been reimaged, puppetized, and back online. there *might* be an issue with this host so we should keep it on a short leash. 1) system lost raid for no reason 2) when trying to reimage, it couldn't detect a raid or any drives. had to change a config in bios, save, then revert. took a couple of tries. 3) 6+ hours to reimage, 2+ hours to pupppetize with multiple failures. installing a simple package or two can take tens of minutes. not sure if this is due to the pacific ocean and we're kickstarting and puppetizing from scl3. however, I ran the HP Insight Diagnostics and everything passed.
the host has been online for over 3 days and the RAID doesnt show any issues although i notice the always present load. going to close bug, reopen if any issues please. [firstname.lastname@example.org ~]$ uptime 17:57:41 up 3 days, 12:54, 1 user, load average: 8.28, 9.52, 9.91 [email@example.com ~]$ sudo hpacucli ctrl all show config Dynamic Smart Array B140i in Slot 0b () Port Name: 1I Port Name: 2I array A (SATA, Unused Space: 0 MB) logicaldrive 1 (465.7 GB, RAID 1, OK) physicaldrive 1I:0:1 (port 1I:box 0:bay 1, SATA, 500 GB, OK) physicaldrive 1I:0:2 (port 1I:box 0:bay 2, SATA, 500 GB, OK)