Closed Bug 759147 Opened 12 years ago Closed 12 years ago

kvm1.build.mtv1 has a bad disk

Categories

(mozilla.org Graveyard :: Server Operations, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: ericz, Assigned: ericz)

Details

(Whiteboard: [Ticket ID: XSB-290687])

kvm1.build.mtv1 has a bad 1.82TB SATA disk in it's array:

[root@kvm1.build.mtv1 ~]# tw_cli
//kvm1> show

Ctl   Model        (V)Ports  Drives   Units   NotOpt  RRate   VRate  BBU
------------------------------------------------------------------------
c6    9690SA-4I    2         2        1       1       1       1      OK       

//kvm1> show c6
Error: (CLI:041) Invalid shell command.

//kvm1> focus c6

//kvm1/c6> show

Unit  UnitType  Status         %RCmpl  %V/I/M  Stripe  Size(GB)  Cache  AVrfy
------------------------------------------------------------------------------
u0    RAID-1    DEGRADED       -       -       -       1862.63   RiW    ON     

VPort Status         Unit Size      Type  Phy Encl-Slot    Model
------------------------------------------------------------------------------
p0    OK             u0   1.82 TB   SATA  0   -            Hitachi HUA722020AL 
p1    DEVICE-ERROR   u0   1.82 TB   SATA  1   -            Hitachi HUA722020AL 

Name  OnlineState  BBUReady  Status    Volt     Temp     Hours  LastCapTest
---------------------------------------------------------------------------
bbu   On           Yes       OK        OK       OK       0      xx-xxx-xxxx
Open a ticket with IX Systems for a replacement.
Whiteboard: [Ticket ID: XSB-290687]
Assignee: server-ops → phong
I guess we took too long and it recovered on its own already.

[root@kvm1.build.mtv1 ~]# tw_cli 
//kvm1> show

Ctl   Model        (V)Ports  Drives   Units   NotOpt  RRate   VRate  BBU
------------------------------------------------------------------------
c6    9690SA-4I    2         2        1       0       1       1      OK       

//kvm1> focus c6

//kvm1/c6> show

Unit  UnitType  Status         %RCmpl  %V/I/M  Stripe  Size(GB)  Cache  AVrfy
------------------------------------------------------------------------------
u0    RAID-1    OK             -       -       -       1862.63   RiW    ON     

VPort Status         Unit Size      Type  Phy Encl-Slot    Model
------------------------------------------------------------------------------
p0    OK             u0   1.82 TB   SATA  0   -            Hitachi HUA722020AL 
p1    OK             u0   1.82 TB   SATA  1   -            Hitachi HUA722020AL 

Name  OnlineState  BBUReady  Status    Volt     Temp     Hours  LastCapTest
---------------------------------------------------------------------------
bbu   On           Yes       OK        OK       OK       0      xx-xxx-xxxx  

//kvm1/c6>
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → WORKSFORME
It's warning again.

Unit     UnitType  Status         %RCmpl  %V/I/M  VPort Stripe  Size(GB)
------------------------------------------------------------------------
u0       RAID-1    DEGRADED*      -       -       -     -       1862.63   
u0-0     DISK      OK             -       -       p0    -       1862.63   
u0-1     DISK      DEGRADED       -       -       p1    -       1862.63   
u0/v0    Volume    -              -       -       -     -       1862.63
Status: RESOLVED → REOPENED
Resolution: WORKSFORME → ---
It's the disk in p1 again:

[eziegenhorn@kvm1.build.mtv1 ~]$ sudo tw_cli 
//kvm1> show

Ctl   Model        (V)Ports  Drives   Units   NotOpt  RRate   VRate  BBU
------------------------------------------------------------------------
c6    9690SA-4I    2         2        1       1       1       1      OK       

//kvm1> show c6
Error: (CLI:041) Invalid shell command.

//kvm1> focus c6

//kvm1/c6> show

Unit  UnitType  Status         %RCmpl  %V/I/M  Stripe  Size(GB)  Cache  AVrfy
------------------------------------------------------------------------------
u0    RAID-1    DEGRADED       -       -       -       1862.63   RiW    ON     

VPort Status         Unit Size      Type  Phy Encl-Slot    Model
------------------------------------------------------------------------------
p0    OK             u0   1.82 TB   SATA  0   -            Hitachi HUA722020AL 
p1    DEVICE-ERROR   u0   1.82 TB   SATA  1   -            Hitachi HUA722020AL 

Name  OnlineState  BBUReady  Status    Volt     Temp     Hours  LastCapTest
---------------------------------------------------------------------------
bbu   On           Yes       OK        OK       OK       0      xx-xxx-xxxx  

I will open a ticket with IX Systems.
Assignee: phong → eziegenhorn
I just sent the drive back.
IX Systems ticket FWU-654089 opened for this.
Still waiting for the disk to ship from IX Systems.
Drive replaced.
Please verify the rebuilding status.
Looks like it is rebuilding:

//kvm1/c6> show

Unit  UnitType  Status         %RCmpl  %V/I/M  Stripe  Size(GB)  Cache  AVrfy
------------------------------------------------------------------------------
u0    RAID-1    REBUILDING     30%     -       -       1862.63   RiW    ON     

VPort Status         Unit Size      Type  Phy Encl-Slot    Model
------------------------------------------------------------------------------
p0    OK             u0   1.82 TB   SATA  0   -            Hitachi HUA722020AL 
p1    DEGRADED       u0   1.82 TB   SATA  1   -            Hitachi HUA722020AL 

I will check back later.
nagios-releng-scl1
kvm1.build.mtv1:TW RAID is OK: RAID Status on c6 u0: OK: 0 warnings, 0 errors

Raid just finished rebuilding.
Status: REOPENED → RESOLVED
Closed: 12 years ago12 years ago
Resolution: --- → FIXED
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.