please run hardware diagnostics on talos-mtnlion-r5-022 and reimage

RESOLVED FIXED

Status

Infrastructure & Operations
DCOps
RESOLVED FIXED
3 years ago
3 years ago

People

(Reporter: philor, Unassigned)

Tracking

Details

(Whiteboard: drive replaced)

(Reporter)

Description

3 years ago
Though I don't know why, other than autocomplete, I'm saying "and reimage" - according to a screenshot from a failed test (bug 799799 comment 9) it has a dialog up saying that it has mounted the disk read-only, so it's going to turn out bad.

Comment 1

3 years ago
Saw a message about HDD error.  Going to run HDD diags first. SATA(0,0)
colo-trip: --- → scl3
Whiteboard: Running hard disk diagnostics. SATA(0,0)

Comment 2

3 years ago
results are inconclusive as both drives passed diags when tested individually... running diags again on both disks at same time.
Whiteboard: Running hard disk diagnostics. SATA(0,0) → Running hard disk diagnostics on both disks

Comment 3

3 years ago
both drives passed several tests, reimaged host. please give it another try and let me know if the RO issues persist.

vans-MacBook-Pro:~ vle$ fping talos-mtnlion-r5-022.test.releng.scl3.mozilla.com
talos-mtnlion-r5-022.test.releng.scl3.mozilla.com is alive
vans-MacBook-Pro:~ vle$ ssh !$
ssh talos-mtnlion-r5-022.test.releng.scl3.mozilla.com
The authenticity of host 'talos-mtnlion-r5-022.test.releng.scl3.mozilla.com (10.26.56.42)' can't be established.
RSA key fingerprint is 5c:1c:6d:34:ba:c7:38:29:5a:50:e7:43:6b:a6:62:13.
Are you sure you want to continue connecting (yes/no)?
Status: NEW → RESOLVED
Last Resolved: 3 years ago
Resolution: --- → FIXED
Whiteboard: Running hard disk diagnostics on both disks
(Reporter)

Comment 4

3 years ago
Exact same thing again with the "you can read from this disk, but I'm not going to let you write to it" dialog after a dozen jobs.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---

Comment 5

3 years ago
we'll replace the drives on this host and hopefully the results are different.
Whiteboard: bad drive

Comment 6

3 years ago
(In reply to Van Le [:van] from comment #5)
> we'll replace the drives on this host and hopefully the results are
> different.

Did this happen? The bug is still open, so I'm not sure.
Depends on: 1118354

Comment 7

3 years ago
replaced the drives but we arent able to image until 1118354 is resolved.
Whiteboard: bad drive → drive replaced

Comment 8

3 years ago
Host has been reimaged.


ssh talos-mtnlion-r5-022.test.releng.scl3.mozilla.com
The authenticity of host 'talos-mtnlion-r5-022.test.releng.scl3.mozilla.com (10.26.56.42)' can't be established.
RSA key fingerprint is e5:e2:d5:b1:55:cb:c5:17:ef:8c:98:a6:59:fb:c4:16.
Are you sure you want to continue connecting (yes/no)?
Status: REOPENED → RESOLVED
Last Resolved: 3 years ago3 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.