Closed Bug 615744 Opened 15 years ago Closed 15 years ago

w32-ix-slave41 possible hard drive issue?

Categories

(mozilla.org Graveyard :: Server Operations, task)

x86_64
Linux
task
Not set
normal

Tracking

(Not tracked)

RESOLVED DUPLICATE of bug 596366

People

(Reporter: catlee, Assigned: zandr)

References

Details

(Whiteboard: [badslave?][buildduty])

Maybe it has a slow disk? Needs some love in any case.
Disconnected it from the master. Looks like NTFS has indexing turned on, and none of the other tweaks here: https://wiki.mozilla.org/ReferencePlatforms/Win32#Add_drive_E
Disk reformatted, tweaks applied, and re-connected to buildbot-master1:8010.
It's done a full clone of m-c since reattaching, which took 38 minutes to complete vs. 18 minutes on ix-slave32. We should probably take this down to see if the hard drive is ok.
it's also failing make upload on shadow-central builds due to host key error.
Whiteboard: [badslave?] → [badslave?][buildduty]
http://tinderbox.mozilla.org/showlog.cgi?log=TraceMonkey/1291929389.1291934569.15182.gz is it doing a mochitest-other run with something stealing focus.
I logged onto it and didn't find any dialog or focus stealing but to be safe I've taken it offline for now. It also needs the host key error for shadow-central fixed.
Forwarding this to IT to look into the hard drive - see if it's one of the 'slow' ix machines? Also please re-image if the hard drive seems fine so we can try again with this one.
Assignee: nobody → server-ops
Component: Release Engineering → Server Operations
QA Contact: release → mrz
Summary: w32-ix-slave41 is failing a lot of jobs because of hg timing out → w32-ix-slave41 possible hard drive issue?
Assignee: server-ops → jdow
Assignee: jdow → zandr
Assignee: zandr → server-ops
Please don't assign bugs directly to me. Please assign them to the default for Server Operations. They'll get noticed sooner that way. Please do cc: me if you like.
Assignee: server-ops → zandr
Presumed drive failure. Imaging at 20MB/*minute* and falling.
root@sysresccd /root % hdparm -I /dev/sda /dev/sda: ATA device, with non-removable media Model Number: ST3250318AS Serial Number: 6VY6XTQ3 Firmware Revision: CC38 Transport: Serial Standards: Used: unknown (minor revision code 0x0029) Supported: 8 7 6 5 Likely used: 8 Configuration: Logical max current cylinders 16383 16383 heads 16 16 sectors/track 63 63 -- CHS current addressable sectors: 16514064 LBA user addressable sectors: 268435455 LBA48 user addressable sectors: 488397168 Logical/Physical Sector size: 512 bytes device size with M = 1024*1024: 238475 MBytes device size with M = 1000*1000: 250059 MBytes (250 GB) cache/buffer size = 8192 KBytes Nominal Media Rotation Rate: 7200 Capabilities: LBA, IORDY(can be disabled) Queue depth: 32 Standby timer values: spec'd by Standard, no device specific minimum R/W multiple sector transfer: Max = 16 Current = 16 Recommended acoustic management value: 254, current value: 0 DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 Cycle time: min=120ns recommended=120ns PIO: pio0 pio1 pio2 pio3 pio4 Cycle time: no flow control=120ns IORDY flow control=120ns Commands/features: Enabled Supported: * SMART feature set Security Mode feature set * Power Management feature set * Write cache * Look-ahead * Host Protected Area feature set * WRITE_BUFFER command * READ_BUFFER command * DOWNLOAD_MICROCODE SET_MAX security extension * Automatic Acoustic Management feature set * 48-bit Address feature set * Device Configuration Overlay feature set * Mandatory FLUSH_CACHE * FLUSH_CACHE_EXT * SMART error logging * SMART self-test * General Purpose Logging feature set * WRITE_{DMA|MULTIPLE}_FUA_EXT * 64-bit World wide name Write-Read-Verify feature set * WRITE_UNCORRECTABLE_EXT command * {READ,WRITE}_DMA_EXT_GPL commands * Segmented DOWNLOAD_MICROCODE * Gen1 signaling speed (1.5Gb/s) * Gen2 signaling speed (3.0Gb/s) * Native Command Queueing (NCQ) * Phy event counters Device-initiated interface power management * Software settings preservation * SMART Command Transport (SCT) feature set * SCT Long Sector Access (AC1) * SCT LBA Segment Access (AC2) * SCT Error Recovery Control (AC3) * SCT Features Control (AC4) * SCT Data Tables (AC5) unknown 206[12] (vendor specific) Security: Master password revision code = 65534 supported not enabled not locked frozen not expired: security count supported: enhanced erase 40min for SECURITY ERASE UNIT. 40min for ENHANCED SECURITY ERASE UNIT. Logical Unit WWN Device Identifier: 5000c50027ecd6c7 NAA : 5 IEEE OUI : 000c50 Unique ID : 027ecd6c7 Checksum: correct root@sysresccd /root % hdparm -tT /dev/sda /dev/sda: Timing cached reads: 12796 MB in 2.00 seconds = 6411.91 MB/sec Timing buffered disk reads: 36 MB in 3.04 seconds = 11.83 MB/sec root@sysresccd /root % hdparm -tT /dev/sda /dev/sda: Timing cached reads: 2 MB in 2.19 seconds = 937.13 kB/sec Timing buffered disk reads: 54 MB in 3.01 seconds = 17.96 MB/sec root@sysresccd /root % hdparm -tT /dev/sda /dev/sda: Timing cached reads: 2 MB in 2.85 seconds = 718.25 kB/sec Timing buffered disk reads: 50 MB in 3.03 seconds = 16.47 MB/sec root@sysresccd /root % Note that with speed numbers like that, SMART is clean.
Status: NEW → RESOLVED
Closed: 15 years ago
Resolution: --- → DUPLICATE
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.