596366 - (ix-drive-issues) latest batch of ix machines have slow and failing drives

Reporter

Description

•

14 years ago

I noticed during the 3.6.10 release that the latest batch of ix machines seem to run slower in terms of disk speed than the other ones. Some timing from hdparm: [root@mv-moz2-linux-ix-slave03 ~]# hdparm -tT /dev/sda /dev/sda: Timing cached reads: 29532 MB in 2.00 seconds = 14801.87 MB/sec Timing buffered disk reads: 360 MB in 3.00 seconds = 119.85 MB/sec [root@mv-moz2-linux-ix-slave03 ~]# hdparm -tT /dev/sda /dev/sda: Timing cached reads: 29524 MB in 2.00 seconds = 14797.74 MB/sec Timing buffered disk reads: 360 MB in 3.01 seconds = 119.80 MB/sec [root@mv-moz2-linux-ix-slave03 ~]# hdparm -tT /dev/sda /dev/sda: Timing cached reads: 29480 MB in 2.00 seconds = 14776.34 MB/sec Timing buffered disk reads: 356 MB in 3.01 seconds = 118.21 MB/sec --------------- [root@linux-ix-slave07 ~]# hdparm -tT /dev/sda /dev/sda: Timing cached reads: 29336 MB in 1.99 seconds = 14738.42 MB/sec Timing buffered disk reads: 256 MB in 3.02 seconds = 84.76 MB/sec [root@linux-ix-slave07 ~]# hdparm -tT /dev/sda /dev/sda: Timing cached reads: 29336 MB in 1.99 seconds = 14738.13 MB/sec Timing buffered disk reads: 262 MB in 3.01 seconds = 86.98 MB/sec [root@linux-ix-slave07 ~]# hdparm -tT /dev/sda /dev/sda: Timing cached reads: 29332 MB in 1.99 seconds = 14738.71 MB/sec Timing buffered disk reads: 270 MB in 3.03 seconds = 89.09 MB/sec As far as I can tell, they're set-up exactly the same as the other ones, down to the hard drive firmware level. The filesystems are ext3, mounted with noatime. Haven't dug further than this.

Mike Taylor [:bear]

Updated

•

14 years ago

Whiteboard: [buildslaves][hardware]

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Comment 1

•

14 years ago

mrz: I thought these machines were identical to the last batch. What is different about these new ix machines?

Assignee: nobody → mrz

Status: ASSIGNED → NEW

Component: Release Engineering → Server Operations

OS: Mac OS X → All

QA Contact: release → mrz

matthew zeier [:mrz]

Comment 2

•

14 years ago

Nothing.

Assignee: mrz → nobody

Component: Server Operations → Release Engineering

QA Contact: mrz → release

bhearsum@mozilla.com (:bhearsum)

Reporter

Comment 3

•

14 years ago

linux-ix-slave17 is repeatedly getting hg into an uninterruptible sleep when cloning a mozilla-1.9.2 for 3.6.11 tagging. Breaks tagging and requires a reboot. We should get IT to run diagnostics on at least one of these machines.

Severity: normal → major

bhearsum@mozilla.com (:bhearsum)

Reporter

Updated

•

14 years ago

Depends on: 601123

Aki Sasaki (not active)

Comment 4

•

14 years ago

I strongly suspect this bug for wasting most of my day today. (timeout after rm -rf of ~17mb took >20min; 20min timeouts in several compiles in a row on linux-ix-slave02 that worked perfectly on mv-moz2-linux-ix-slave01.)

John Ford [:jhford] CET/CEST Berlin Time

Comment 5

•

14 years ago

I did a quick set of tests and it looks like this might be a more widespread issue affecting all new IX boxes. While nothing else was running on the machines, I ran the following two commands on both the linux and win32 machines: time hg clone http://hg.mozilla.org/mozilla-central freshclone time hg clone --pull --uncompressed freshclone copy I found that the new batch of machines are in both cases slower than the original batch of ix machines. On linux-ix-slave02, the second command took 10 times longer than the old machines. The windows tests showed that the local clone operation took nearly twice as long. The breakdown of real, user and sys times was only available on the linux machines. More detailed results below. Win32 ==================================================== on mw32-ix-slave01, hg clone http://.../mozilla-central freshclone took 12m38. on w32-ix-slave02, hg clone http://.../mozilla-central freshclone took 14m31. on mw32-ix-slave01, hg clone --pull --uncompressed freshclone copy took 10m50 on w32-ix-slave02, hg clone --pull --uncompressed freshclone copy took 18m52 Linux ==================================================== on mv-moz2-linux-ix-slave04, hg clone http://.../mozilla-central freshclone took real 4m5 user 2m38 sys 0m11 on linux-ix-slave02, hg clone http://.../mozilla-central freshclone took real 43m27 user 3m0 sys 0m11 on mv-moz2-linux-ix-slave04, hg clone --pull --uncompressed freshclone copy took real 4m35 user 3m23 sys 0m12 on linux-ix-slave02, hg clone --pull --uncompressed freshclone copy took real 14m56 user 3m49 sys 0m12

Summary: latest batch of linux ix machines seem to have slower disks → latest batch of ix machines have slow i/o

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Comment 6

•

14 years ago

jabba/jlazaro: from comment#2, mrz asserts the hardware is identical. Is there any diagnostics that can be run on these to explain the performance different? Or is there anything different about how these machines were imaged ? Marking as critical, as its causing intermittent timeouts/hangs in production.

Assignee: nobody → server-ops

Severity: major → critical

Component: Release Engineering → Server Operations

QA Contact: release → mrz

Justin Lazaro [:jlaz] (use needinfo)

Updated

•

14 years ago

Assignee: server-ops → jlazaro

Justin Lazaro [:jlaz] (use needinfo)

Comment 7

•

14 years ago

Contacted IX support via email, since this is hardware related

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Comment 8

•

14 years ago

linux-ix-slave16 was taken out of production by nthomas last night, because its was taking 6 hours for a Linux maple leak test build (clobber) last night. See attached bug#601623 for history of linux-ix-slave16 being sick a couple of weeks ago.

Comment 9

•

14 years ago

Just took mv-moz2-linux-ix-slave02 and linux-ix-slave31 offline to loan out for investigation.

Lukas Blakk [:lsblakk] use ?needinfo

Comment 10

•

14 years ago

also handed off linux-ix-slave14

Justin Lazaro [:jlaz] (use needinfo)

Comment 11

•

14 years ago

Confirming Lukas's comment mv-moz2-linux-slave02 linux-ix-slave14 linux-ix-slave31 (scl) These machines were taken by Chris Williams from IX Systems today to investigate the i/o issues, will report back when I receive an update from IX

Justin Lazaro [:jlaz] (use needinfo)

Updated

•

14 years ago

Assignee: jlazaro → server-ops

Dave Miller [:justdave]

Updated

•

14 years ago

Assignee: server-ops → jlazaro

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Comment 12

•

14 years ago

from email with matt@ix systems: They've confirmed performance differences on the few machines they took back from office to test. More debugging ongoing.

John Ford [:jhford] CET/CEST Berlin Time

Comment 13

•

14 years ago

Some production fallout in bug 606716.

Blocks: 606716

Justin Lazaro [:jlaz] (use needinfo)

Comment 14

•

14 years ago

In order for IX to continue debugging these issues, we'll need to run tests on 37 machines with these serial numbers: Group A A1-14132 A1-14134 A1-14136 A1-14138 A1-14139 A1-14141 A1-16051 A1-16063 A1-16094 A1-16098 A1-16105 A1-16188 A1-16189 Group B A1-14147 A1-14154 A1-14168 A1-14171 A1-14174 A1-14175 A1-16056 A1-16114 A1-16132 A1-16171 A1-16205 A1-16213 Group C A1-14128 A1-14145 A1-14146 A1-14152 A1-14153 A1-14166 A1-16061 A1-16082 A1-16095 A1-16149 A1-16151 A1-16212 Test: hdparm -I /dev/sda hdparm -tT /dev/sda We're hoping most of the machines from each group are linux machines since we don't have a tool for testing i/o on Windows. Would we need to schedule a downtime for this? Is this a concern that we won't have accurate results if these machines are in active production?

John Ford [:jhford] CET/CEST Berlin Time

Comment 15

•

14 years ago

hdparm is available for windows iirc. I dont know how to map those serials to hostnames, do we have a way to do that? I can remove the machines from production so that we get mostly-idle testing done.

John Ford [:jhford] CET/CEST Berlin Time

Comment 16

•

14 years ago

Attached file hdparm for windows — Details

Here is hdparm for windows. It requires administrator permissions to run and will require cygwin1.dll to either be in the same directory or in the %PATH% system variable. I have checked and it looks like cygwin1.dll is the only dependency (other than kernel32.dll).

Justin Lazaro [:jlaz] (use needinfo)

Comment 17

•

14 years ago

It looks like joduinn/buildduty will be working to get these tests/results to IX. Although these machines are in inventory, the "quick search" option does not allow us to search by serial number ( https://bugzilla.mozilla.org/show_bug.cgi?id=607050 ) We might have a spreadsheet with the hostnames and serial numbers to reference by, and forward that to buildduty/joduinn once I find this info.

Justin Lazaro [:jlaz] (use needinfo)

Updated

•

14 years ago

Assignee: jlazaro → joduinn

Justin Lazaro [:jlaz] (use needinfo)

Updated

•

14 years ago

Component: Server Operations → Release Engineering

QA Contact: mrz → release

John Ford [:jhford] CET/CEST Berlin Time

Comment 18

•

14 years ago

Do you have the list of slaves which those serial numbers?

John Ford [:jhford] CET/CEST Berlin Time

Comment 19

•

14 years ago

Please throw back to Release Engineering when you have the list.

Assignee: joduinn → server-ops

Component: Release Engineering → Server Operations

QA Contact: release → mrz

Justin Lazaro [:jlaz] (use needinfo)

Updated

•

14 years ago

Assignee: server-ops → jlazaro

Justin Lazaro [:jlaz] (use needinfo)

Comment 20

•

14 years ago

Group A A1-14132 mv-moz2-linux-ix-slave12 A1-14134 mw32-ix-slave17 A1-14136 mw32-ix-slave13 A1-14138 mv-moz2-linux-ix-slave15 A1-14139 mv-moz2-linux-ix-slave02 A1-14141 mv-moz2-linux-ix-slave11 A1-16051 w32-ix-slave05 A1-16063 w32-ix-slave17 A1-16094 w32-ix-slave31 A1-16098 w32-ix-slave35 A1-16105 w32-ix-slave42 A1-16188 linux64-ix-slave16 A1-16189 linux64-ix-slave17 Group B A1-14147 mv-moz2-linux-ix-slave08 A1-14154 mw64-ix-slave01 A1-14168 mw32-ix-slave07 A1-14171 mw32-ix-slave05 A1-14174 mw32-ix-slave10 A1-14175 mw32-ix-slave18 A1-16056 w32-ix-slave10 A1-16114 w64-ix-slave09 A1-16132 w64-ix-slave27 A1-16171 linux-ix-slave41 A1-16205 linux64-ix-slave33 A1-16213 linux64-ix-slave41 Group C A1-14128 mw32-ix-slave23 A1-14145 mw32-ix-slave11 A1-14146 mw32-ix-slave22 A1-14152 mv-moz2-linux-ix-slave23 A1-14153 mv-moz2-linux-ix-slave19 A1-14166 mv-moz2-linux-ix-slave13 A1-16061 w32-ix-slave15 A1-16082 linux-ix-slave11 A1-16095 w32-ix-slave32 A1-16149 linux-ix-slave19 A1-16151 linux-ix-slave21 A1-16212 linux64-ix-slave40

Justin Lazaro [:jlaz] (use needinfo)

Updated

•

14 years ago

Assignee: jlazaro → nobody

Component: Server Operations → Release Engineering

QA Contact: mrz → release

John Ford [:jhford] CET/CEST Berlin Time

Comment 21

•

14 years ago

I will start looking at this

Assignee: nobody → jhford

John Ford [:jhford] CET/CEST Berlin Time

Comment 22

•

14 years ago

[root@mv-moz2-linux-ix-slave12 ~]# hdparm -I /dev/sda /dev/sda: ATA device, with non-removable media Model Number: ST3250318AS Serial Number: 5VY0LB7E Firmware Revision: CC45 Transport: Serial Standards: Supported: 8 7 6 5 Likely used: 8 Configuration: Logical max current cylinders 16383 16383 heads 16 16 sectors/track 63 63 -- CHS current addressable sectors: 16514064 LBA user addressable sectors: 268435455 LBA48 user addressable sectors: 488397168 device size with M = 1024*1024: 238475 MBytes device size with M = 1000*1000: 250059 MBytes (250 GB) Capabilities: LBA, IORDY(can be disabled) Queue depth: 32 Standby timer values: spec'd by Standard, no device specific minimum R/W multiple sector transfer: Max = 16 Current = ? Recommended acoustic management value: 208, current value: 208 DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 Cycle time: min=120ns recommended=120ns PIO: pio0 pio1 pio2 pio3 pio4 Cycle time: no flow control=120ns IORDY flow control=120ns Commands/features: Enabled Supported: * SMART feature set Security Mode feature set * Power Management feature set * Write cache * Look-ahead * Host Protected Area feature set * WRITE_BUFFER command * READ_BUFFER command * DOWNLOAD_MICROCODE SET_MAX security extension * Automatic Acoustic Management feature set * 48-bit Address feature set * Device Configuration Overlay feature set * Mandatory FLUSH_CACHE * FLUSH_CACHE_EXT * SMART error logging * SMART self-test * General Purpose Logging feature set * WRITE_{DMA|MULTIPLE}_FUA_EXT * 64-bit World wide name Write-Read-Verify feature set * WRITE_UNCORRECTABLE command * {READ,WRITE}_DMA_EXT_GPL commands * Segmented DOWNLOAD_MICROCODE * SATA-I signaling speed (1.5Gb/s) * SATA-II signaling speed (3.0Gb/s) * Native Command Queueing (NCQ) * Phy event counters Device-initiated interface power management * Software settings preservation Security: Master password revision code = 65534 supported not enabled not locked not frozen not expired: security count supported: enhanced erase 40min for SECURITY ERASE UNIT. 40min for ENHANCED SECURITY ERASE UNIT. Checksum: correct [root@mv-moz2-linux-ix-slave12 ~]# hdparm -tT /dev/sda /dev/sda: Timing cached reads: 29384 MB in 1.99 seconds = 14728.98 MB/sec Timing buffered disk reads: 376 MB in 3.00 seconds = 125.13 MB/sec

John Ford [:jhford] CET/CEST Berlin Time

Comment 23

•

14 years ago

A1-14134 mw32-ix-slave17 A1-14136 mw32-ix-slave13 are both unreachable

John Ford [:jhford] CET/CEST Berlin Time

Comment 24

•

14 years ago

[root@mv-moz2-linux-ix-slave15 ~]# hdparm -I /dev/sda /dev/sda: ATA device, with non-removable media Model Number: ST3250318AS Serial Number: 5VY17LN3 Firmware Revision: CC45 Transport: Serial Standards: Supported: 8 7 6 5 Likely used: 8 Configuration: Logical max current cylinders 16383 16383 heads 16 16 sectors/track 63 63 -- CHS current addressable sectors: 16514064 LBA user addressable sectors: 268435455 LBA48 user addressable sectors: 488397168 device size with M = 1024*1024: 238475 MBytes device size with M = 1000*1000: 250059 MBytes (250 GB) Capabilities: LBA, IORDY(can be disabled) Queue depth: 32 Standby timer values: spec'd by Standard, no device specific minimum R/W multiple sector transfer: Max = 16 Current = ? Recommended acoustic management value: 208, current value: 208 DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 Cycle time: min=120ns recommended=120ns PIO: pio0 pio1 pio2 pio3 pio4 Cycle time: no flow control=120ns IORDY flow control=120ns Commands/features: Enabled Supported: * SMART feature set Security Mode feature set * Power Management feature set * Write cache * Look-ahead * Host Protected Area feature set * WRITE_BUFFER command * READ_BUFFER command * DOWNLOAD_MICROCODE SET_MAX security extension * Automatic Acoustic Management feature set * 48-bit Address feature set * Device Configuration Overlay feature set * Mandatory FLUSH_CACHE * FLUSH_CACHE_EXT * SMART error logging * SMART self-test * General Purpose Logging feature set * WRITE_{DMA|MULTIPLE}_FUA_EXT * 64-bit World wide name Write-Read-Verify feature set * WRITE_UNCORRECTABLE command * {READ,WRITE}_DMA_EXT_GPL commands * Segmented DOWNLOAD_MICROCODE * SATA-I signaling speed (1.5Gb/s) * SATA-II signaling speed (3.0Gb/s) * Native Command Queueing (NCQ) * Phy event counters Device-initiated interface power management * Software settings preservation Security: Master password revision code = 65534 supported not enabled not locked not frozen not expired: security count supported: enhanced erase 40min for SECURITY ERASE UNIT. 40min for ENHANCED SECURITY ERASE UNIT. Checksum: correct [root@mv-moz2-linux-ix-slave15 ~]# hdparm -tT /dev/sda /dev/sda: Timing cached reads: 29336 MB in 2.00 seconds = 14704.02 MB/sec Timing buffered disk reads: 278 MB in 3.00 seconds = 92.62 MB/sec

John Ford [:jhford] CET/CEST Berlin Time

Comment 25

•

14 years ago

A1-14139 mv-moz2-linux-ix-slave02 is unreachable

John Ford [:jhford] CET/CEST Berlin Time

Comment 26

•

14 years ago

[root@mv-moz2-linux-ix-slave11 ~]# hdparm -I /dev/sda /dev/sda: ATA device, with non-removable media Model Number: ST3250318AS Serial Number: 5VY0LAK8 Firmware Revision: CC45 Transport: Serial Standards: Supported: 8 7 6 5 Likely used: 8 Configuration: Logical max current cylinders 16383 16383 heads 16 16 sectors/track 63 63 -- CHS current addressable sectors: 16514064 LBA user addressable sectors: 268435455 LBA48 user addressable sectors: 488397168 device size with M = 1024*1024: 238475 MBytes device size with M = 1000*1000: 250059 MBytes (250 GB) Capabilities: LBA, IORDY(can be disabled) Queue depth: 32 Standby timer values: spec'd by Standard, no device specific minimum R/W multiple sector transfer: Max = 16 Current = ? Recommended acoustic management value: 208, current value: 208 DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 Cycle time: min=120ns recommended=120ns PIO: pio0 pio1 pio2 pio3 pio4 Cycle time: no flow control=120ns IORDY flow control=120ns Commands/features: Enabled Supported: * SMART feature set Security Mode feature set * Power Management feature set * Write cache * Look-ahead * Host Protected Area feature set * WRITE_BUFFER command * READ_BUFFER command * DOWNLOAD_MICROCODE SET_MAX security extension * Automatic Acoustic Management feature set * 48-bit Address feature set * Device Configuration Overlay feature set * Mandatory FLUSH_CACHE * FLUSH_CACHE_EXT * SMART error logging * SMART self-test * General Purpose Logging feature set * WRITE_{DMA|MULTIPLE}_FUA_EXT * 64-bit World wide name Write-Read-Verify feature set * WRITE_UNCORRECTABLE command * {READ,WRITE}_DMA_EXT_GPL commands * Segmented DOWNLOAD_MICROCODE * SATA-I signaling speed (1.5Gb/s) * SATA-II signaling speed (3.0Gb/s) * Native Command Queueing (NCQ) * Phy event counters Device-initiated interface power management * Software settings preservation Security: Master password revision code = 65534 supported not enabled not locked not frozen not expired: security count supported: enhanced erase 42min for SECURITY ERASE UNIT. 42min for ENHANCED SECURITY ERASE UNIT. Checksum: correct [root@mv-moz2-linux-ix-slave11 ~]# hdparm -tT /dev/sda /dev/sda: Timing cached reads: 29480 MB in 1.99 seconds = 14776.95 MB/sec Timing buffered disk reads: 370 MB in 3.01 seconds = 123.04 MB/sec

Chris Cooper [:coop] (he/him)

Updated

•

14 years ago

Whiteboard: [buildslaves][hardware] → [buildslaves][hardware][triagefollowup]

Chris Cooper [:coop] (he/him)

Comment 27

•

14 years ago

This is pretty important and we need to make progress here. Can we start taking these machines offline in batches, i.e. gracefully shutdown one slave from each platform (linux, linux64, win32, win64), run the diagnostics, add those slave back to the pool, and then move on to the next batch? Also, it's probably not ideal to post the results for each slave in the bug. I'd suggest creating a subdir for the output logs on people.mozilla.com and linking to it from the bug. Not fun, I realize, but required.

Priority: -- → P3

Whiteboard: [buildslaves][hardware][triagefollowup] → [buildslaves][hardware]

Nick Thomas [:nthomas] (UTC+12)

Comment 28

•

14 years ago

I really think we should put some time into this bug ? Some examples of wrongness I've seen today * linux-ix-slave13 taking 2+ hours to do a 1.9.2 unit test build, holding rs up * w32-ix-slave16 taking 4hrs 20 mins to compile a try opt build Need some data so that IX know what to fix up.

John Ford [:jhford] CET/CEST Berlin Time

Comment 29

•

14 years ago

maybe we could do this during the mega-downtime this comming friday

Assignee: jhford → nobody

Chris Cooper [:coop] (he/him)

Comment 30

•

14 years ago

(In reply to comment #29) > maybe we could do this during the mega-downtime this comming friday Sure, but let's be specific here: * who's going to be around to do this on Friday, given that many of us are traveling? * will it be IT or RelEng running the tests? * is it just hdparm output we're looking for, or are there other tests we could/should be running?

Chris AtLee [:catlee]

Comment 31

•

14 years ago

I think I posted this somewhere else, but can't find it right now... Since the IX machines can boot off an image provided via the ipmi interface, we could boot off something like http://www.sysresccd.org/, which provides hdparm.

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Comment 32

•

14 years ago

(In reply to comment #30) > (In reply to comment #29) > > maybe we could do this during the mega-downtime this comming friday > > Sure, but let's be specific here: > > * who's going to be around to do this on Friday, given that many of us are > traveling? > * will it be IT or RelEng running the tests? Debugging this hardware difference is something for IT. Pushing to zandr after talking with him on irc. > * is it just hdparm output we're looking for, or are there other tests we > could/should be running?

Assignee: nobody → zandr

Chris Cooper [:coop] (he/him)

Comment 33

•

14 years ago

zandr: if we need to take (more of) these out of service at some point to get this done, just let me know.

Component: Release Engineering → Server Operations

QA Contact: release → mrz

hdparm for windows 14 years ago John Ford [:jhford] CET/CEST Berlin Time 948.59 KB, application/octet-stream		Details
Data collected from Linux64 part 1 14 years ago Spencer Hui 1.67 MB, application/zip		Details
Data collected from Linux64 part 2 14 years ago Spencer Hui 1.73 MB, application/zip		Details
Data collected from W64 part 1 14 years ago Spencer Hui 1.31 MB, application/zip		Details
Data collected from W64 part 2 14 years ago Spencer Hui 1.57 MB, application/zip		Details