Closed Bug 863985 Opened 12 years ago Closed 12 years ago

RMA disk for node26.peach.metrics.scl3

Categories

(Infrastructure & Operations :: DCOps, task)

x86
Linux
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: ericz, Assigned: sal)

Details

node26.peach.metrics.scl3 /data2 had a bad entry in /etc/fstab. The box refused to boot fully because it couldn't find the bad UUID. I commented it out temporarily so the box would boot.
It looks like the root cause here is a bad disk.
Summary: node26.peach.metrics.scl3 /data2 bad entry in /etc/fstab → node26.peach.metrics.scl3 /data2 bad disk
Punting to DC Ops. This disk needs to be RMA'd: Event: 7 Added: 04/20/2013 07:53 CRITICAL: Drive Array Subsystem - Internal Storage Enclosure Device Failure (Bay 2, Box 1, Port 1I, Slot 0) Also, there is this issue: Event: 1 Added: 04/20/2013 07:51 CRITICAL: CPU - Uncorrectable Machine Check Exception (Board 0, Processor 1, APIC ID 0x00000020, Bank 0x00000004, Status 0xBA000080'00020C0F, Address 0x00000000'00000000, Misc 0xC0040FFE'01000000). Event: 5 Added: 04/20/2013 07:51 CAUTION: POST Messages - POST Error: The system experienced an unexpected reboot. The Integrated Management Log (IML) may contain an entry indicating additional information about this reboot..
Assignee: nobody → server-ops-dcops
Group: metrics-private
Component: Metrics Operations → Server Operations: DCOps
Product: Mozilla Metrics → mozilla.org
QA Contact: dmoore
Target Milestone: Unreviewed → ---
Version: unspecified → other
Summary: node26.peach.metrics.scl3 /data2 bad disk → node26.peach.metrics.scl3 /data2 bad disk and uncorrectable MCE
colo-trip: --- → scl3
Addressing MCE error in Bug 864019. Renaming this as just a drive RMA.
Summary: node26.peach.metrics.scl3 /data2 bad disk and uncorrectable MCE → RMA disk for node26.peach.metrics.scl3
case number 4644681954 opened for HDD replacement.
pinged ericz after replacing drive.
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Case ID is 4644718628 for new drive.
It should arrive by Monday. [Friday, April 26, 2013 9:59 AM] -- Venkatesh A says: Van, I see the server has Next business day coverage warranty with us and hence you can expect the hard drive delivery latest by EOD of monday. (04/29/2013). [Friday, April 26, 2013 9:59 AM] -- Van Le says: ok if you can get it to us sooner, be awesome [Friday, April 26, 2013 10:00 AM] -- Venkatesh A says: I will update the same on the case notes to priortise the shipment Van, but would not be able to assure on the same.
Swapped drive (Bay 2, Box 1) please close the bug after you confirm.
Working now.
Status: REOPENED → RESOLVED
Closed: 12 years ago12 years ago
Resolution: --- → FIXED
Assignee: server-ops-dcops → sespinoza
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.