Closed Bug 1483746 (t-yosemite-r7-092) Opened 6 years ago Closed 6 years ago

[MDC2] t-yosemite-r7-092 problem tracking

Categories

(Infrastructure & Operations Graveyard :: CIDuty, defect)

defect
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: rmutter, Unassigned)

References

Details

When trying to ssh into machine, we're facing stdio error.
Blocks: 1473358
the problem still persists. Today, I've tried to connect to it and I've received the following error : Stdio forwarding request failed: Session open refused by peer ssh_exchange_identification: Connection closed by remote host
Alias: t-yosemite-r7-092
Depends on: 1485140
This machine is running tasks.
Status: NEW → RESOLVED
Closed: 6 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Depends on: 1492152
machine seems to be up and running tasks. we will close the bug for now. If the problem will persist in the future, we will re-open the bug.
Status: REOPENED → RESOLVED
Closed: 6 years ago6 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Depends on: 1496263
the machine seems to be up and running and taking jobs https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-osx-1010/workers/mdc2/t-yosemite-r7-092 We will close the bug for now. If the problem will persist in the future, we will re-open this bug.
Status: REOPENED → RESOLVED
Closed: 6 years ago6 years ago
Resolution: --- → FIXED
Worker is in a bad state once again. I'm unable to ssh and machine is not responding to ping.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Depends on: 1513668
reimaged
Status: REOPENED → RESOLVED
Closed: 6 years ago6 years ago
Resolution: --- → FIXED

Machine is in a bad state and has been quarantined.
Taskcluster link
The worker is burning tests and cause is not known ATM.

Status: RESOLVED → REOPENED
Resolution: FIXED → ---

Compared to other workers, this is the only odd thing I can see in the logs atm:
Mar 23 04:30:25 t-yosemite-r7-092.test.releng.mdc2.mozilla.com identityservicesd: [Warning] DeviceHeartbeat - Failed loading device heartbeat data from keychain -25300
I'm going to reimage the machine for the moment and I'll see the outcome.

Few hours later, everything looks green.

Status: REOPENED → RESOLVED
Closed: 6 years ago6 years ago
Resolution: --- → FIXED

I've quarantined this machine, it was running multiple tasks at the same time:

https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-osx-1010/workers/mdc2/t-yosemite-r7-092

Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Depends on: 1540372
Type: task → defect

removed from quarantine,rebooted and blessed the machine...will keep on monitoring it

I've reimaged the machine and removed it from quarantine. Monitored it.
It didn't took multiple jobs any longer.

https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-osx-1010/workers/mdc2/t-yosemite-r7-092

I will close the bug for now. If the issue will persist in the future, we will re-open the bug.

Status: REOPENED → RESOLVED
Closed: 6 years ago6 years ago
Resolution: --- → FIXED
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.