Closed Bug 783922 (b-linux64-hp-0026) Opened 12 years ago Closed 10 years ago

b-linux64-hp-0026 problem tracking

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task, P3)

Tracking

(Not tracked)

RESOLVED WONTFIX

People

(Reporter: nthomas, Unassigned)

References

Details

(Whiteboard: [buildduty][buildslaves][capacity])

      No description provided.
This machine is running builds, not sure if the hardware issues still persist.
Now sitting at the MoCo Network Installer boot screen.
Fixed the RAID config, back in the production pool.
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
Nagios reported this box as down, and this was on the console:
  [Firmware Bug]: the BIOS has corrupted hw-PMU resources (MSR 38d is 330)
When I rebooted it it got past grub and there was a flash of the same message, but it continued to the login prompt. 

Turns out the Firmware message is present many times in /var/log/messages on this slave, and bld-centos6-hp-008, so it seems to be normal for this class of slaves. No idea why it didn't complete reboot this one time though.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
(In reply to Nick Thomas [:nthomas] from comment #4)
> Nagios reported this box as down, and this was on the console:
>   [Firmware Bug]: the BIOS has corrupted hw-PMU resources (MSR 38d is 330)
> When I rebooted it it got past grub and there was a flash of the same
> message, but it continued to the login prompt. 
> 
> Turns out the Firmware message is present many times in /var/log/messages on
> this slave, and bld-centos6-hp-008, so it seems to be normal for this class
> of slaves. No idea why it didn't complete reboot this one time though.

Did you mean to re-open this bug, given that the slave is back in production? Maybe we should file a separate bug for the firmware message?
Closing since the machine is back in production.
Status: REOPENED → RESOLVED
Closed: 12 years ago11 years ago
Resolution: --- → FIXED
Hit the same issue as comment #4 again, power cycled it. 

This looks like the same bug known at HP:
 http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c03265132
Basically a harmless RHEL6 issue on HP Proliants. Over in bug 773390 we decided not to bother about it for a Socorro machine. So, unlikely to be related to failing to boot occasionally.
Product: mozilla.org → Release Engineering
Reboot needed.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Depends on: 933784
It came back!
No longer depends on: 933784
running green builds
Status: REOPENED → RESOLVED
Closed: 11 years ago11 years ago
Resolution: --- → FIXED
https://tbpl.mozilla.org/php/getParsedLog.php?id=35654562&tree=Mozilla-Central

Out of space on device. Disabled in slavealloc.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
deleted some old build dirs, re-enabled, rebooted
Status: REOPENED → RESOLVED
Closed: 11 years ago10 years ago
Resolution: --- → FIXED
Follow up in bug 980091.
Alias: bld-centos6-hp-007 → b-linux64-hp-0026
Summary: bld-centos6-hp-007 problem tracking → b-linux64-hp-0026 problem tracking
Out of space. Disabled.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Cleaned up, re-enabled.
Status: REOPENED → RESOLVED
Closed: 10 years ago10 years ago
Resolution: --- → FIXED
Please do not re-enable this slave. We are retiring linux hardware build slaves in bug 1106922.
Blocks: 1106922
Resolution: FIXED → WONTFIX
Product: Release Engineering → Infrastructure & Operations
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.