Closed Bug 829014 Opened 13 years ago Closed 13 years ago

repair talos-r4-snow-041

Categories

(Infrastructure & Operations :: DCOps, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: bhearsum, Unassigned)

References

Details

Hardware diagnostics came up clean on this slave, but even after a re-image it's still failing tons of tests. Can we do any repairs even when diagnostics come up clean?
colo-trip: --- → scl1
>Hardware diagnostics came up clean on this slave, but even after a re-image it's still failing tons of tests. Can we do any repairs even when diagnostics come up clean? :bhearsum, I'm not sure what you mean. Do we spend hundreds of dollars randomly replacing parts? You have to keep in mind we're short staffed as everyone else so we don't really have time to play around with these boxes. Do you have any logs that would help us out? BTW, what is the threshold for decommissioning these nodes?
Should we just collect all of the ones that have come up with clean diags but that seem to be burnign builds and send them off to Computer Care for further diagnosis? I'm not sure if they'd do more than we're doing in house to diagnose...
Diagnostic test resulted with: Completed:Volume:Jan 24, 2013 1:40 PMMacintosh HD (319.73 GB /) CHECKED 3 suspected files found out of 1557 total files checked. Suspected files cannot be repaired and should either be deleted or replaced with a known good backup copy. File is corrupt or is an unsupported file format: /Users/cltbld/Library/Caches/TemporaryItems/device-storage-testing/RBERRSTV0CBKNZQXF45H.png Unknown character '{' (0x7b) in <integer> on line 10: /Library/Ruby/Gems/1.8/gems/facter-1.5.6/conf/osx/PackageInfo.plist Unknown character '{' (0x7b) in <integer> on line 10: /Library/Ruby/Gems/1.8/gems/puppet-0.24.8/conf/osx/PackageInfo.plist The File Structures test checks a wide variety of file types to ensure that they are structured correctly. If a file is not, the program reports the name of the file. Looks like some file structures are corrupt. I will reimage the host tomorrow.
These can be reimaged remotely as long as you can get root access to the machine. I've kicked off a reimage.
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.