linux-ix-slave10 reporting a bad disk

RESOLVED FIXED

Status

Infrastructure & Operations
RelOps
RESOLVED FIXED
6 years ago
5 years ago

People

(Reporter: nthomas, Assigned: arr)

Tracking

Details

(Whiteboard: iX Systems RMA Ticket # XKP-590227)

Attachments

(1 attachment)

(Reporter)

Description

6 years ago
Created attachment 658028 [details]
BIOS screenshot

[I'm not sure whether to file this direct with DCOps, or if we need to source a replacement disk first]

On BIOS POST:

SATA Port 0 ST3250318AS  CC38
       S.M.A.R.T Capable and Status BAD
....
AHCI Port0 Device Error
Press F1 to Resume
----

Interestingly it does boot if you use F1 but seems like a bad idea to trust it, even on try.
(Reporter)

Comment 1

6 years ago
The nagios log has:
[09-03-2012 21:11:26] SERVICE ALERT: linux-ix-slave10.build.scl1;IDE S.M.A.R.T - /dev/sda1;CRITICAL;SOFT;1;CRITICAL - 1 Harddrive PreFailure Detected! 1/22 tests failed.
(Assignee)

Comment 2

6 years ago
DCOps: You guys handle drive replacements, correct?  This is an iX machine, so they'd be the folks to contact about a replacement.
Assignee: server-ops-releng → server-ops
Component: Server Operations: RelEng → Server Operations: DCOps
QA Contact: arich → dmoore

Updated

6 years ago
colo-trip: --- → scl1

Comment 3

6 years ago
An RMA ticket has been submitted with IX Systems.  

Ticket ID:  XKP-590227
(Reporter)

Comment 4

6 years ago
Is wiping the drive the normal process before it leaves the colo ?

Updated

6 years ago
Whiteboard: iX Systems RMA Ticket # XKP-590227

Comment 5

6 years ago
Nick,
Let me know when Linux-IX-slave10 is offline and I will remove the hard drive for replacement.

-Vinh
(Reporter)

Comment 6

6 years ago
Vinh, you can go ahead any time. It's sitting at a prompt during the BIOS starting.

Comment 7

6 years ago
Nick,
The hard drive has been replaced. Do you need us to do anything specific after?
(Reporter)

Comment 8

6 years ago
Great! Just a reimage to get the right content on it. Also, did you see comment #4 ?

Comment 9

6 years ago
Yes, I've wiped the drive before shipping it back.

Updated

6 years ago
Assignee: server-ops → vhua
Status: NEW → ASSIGNED
(Assignee)

Comment 10

6 years ago
I've reimaged the server.
Assignee: vhua → arich
Status: ASSIGNED → RESOLVED
Last Resolved: 6 years ago
Component: Server Operations: DCOps → Server Operations: RelEng
QA Contact: dmoore → arich
Resolution: --- → FIXED
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.