Closed
Bug 1480658
(t-yosemite-r7-045)
Opened 7 years ago
Closed 7 years ago
[MDC2] t-yosemite-r7-045 Problem tracking
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task)
Infrastructure & Operations Graveyard
CIDuty
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: apop, Unassigned)
References
Details
(Whiteboard: REQ0252727, REQ0246655)
Today, when I wanted to log on the machine t-yosemite-r7-045, I've received the following error :
Stdio forwarding request failed: Session open refused by peer
ssh_exchange_identification: Connection closed by remote host
Comment 1•7 years ago
|
||
Problem persists, also the machine doesn't appear in taskcluster.
Flags: needinfo?(dhouse)
Rebooted and it is taking tasks:
https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-osx-1010/workers/mdc2/t-yosemite-r7-045
Status: NEW → RESOLVED
Closed: 7 years ago
Flags: needinfo?(dhouse)
Resolution: --- → FIXED
Comment 3•7 years ago
|
||
Hey Van,
Seems like the machine has the STDIO issue again. Can you please restart it (or someone in dcops. Dave is in PTO and not accepting NI?s
Tried to SSH into it - failed
Tried to use roller to restart - failed.
Status: RESOLVED → REOPENED
Flags: needinfo?(vle)
Resolution: FIXED → ---
Updated•7 years ago
|
Assignee: nobody → vle
Whiteboard: REQ0246655
Comment 4•7 years ago
|
||
opened REQ0246655 with QTS for reimage.
Comment 5•7 years ago
|
||
back online.
vle@DESKTOP-3HK51T3:~$ fping t-yosemite-r7-045.test.releng.mdc2.mozilla.com
t-yosemite-r7-045.test.releng.mdc2.mozilla.com is alive
Status: REOPENED → RESOLVED
Closed: 7 years ago → 7 years ago
Flags: needinfo?(vle)
Resolution: --- → FIXED
Comment 6•7 years ago
|
||
Reopening this as it looks like the server is offline. It won't respond to pings.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 7•7 years ago
|
||
Looks good now , responds to ping and can ssh into it.
Status: REOPENED → RESOLVED
Closed: 7 years ago → 7 years ago
Resolution: --- → FIXED
Comment 8•7 years ago
|
||
Seems like the host is offline once again. Won't respond to pings.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 9•7 years ago
|
||
The machine looks good now:
https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-osx-1010/workers/mdc2/t-yosemite-r7-045
Status: REOPENED → RESOLVED
Closed: 7 years ago → 7 years ago
Resolution: --- → FIXED
Comment 10•7 years ago
|
||
Seems like we are facing STDIO problem on this machine once again. Reopening.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Reporter | ||
Comment 11•7 years ago
|
||
It seems that the machine still has STDIO issue. Can you please check again ?
Flags: needinfo?(vle)
Comment 12•7 years ago
|
||
opened REQ0252727 with QTS for reimage.
Flags: needinfo?(vle)
Whiteboard: REQ0246655 → REQ0252727, REQ0246655
Comment 13•7 years ago
|
||
Machine is back and running jobs:
https://tools.taskcluster.net/groups/CUMQ99uhRYKZh_X5yGpFRQ/tasks/f_RhIiVYS4KJHoERKCaHmA/runs/0
Updated•7 years ago
|
Status: REOPENED → RESOLVED
Closed: 7 years ago → 7 years ago
Resolution: --- → FIXED
Comment 14•7 years ago
|
||
Seems like the machine is in a bad state once again. Last job ended in exception ,we're receiving stdio issue when trying to ssh and machine doesn't respond to ping.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 15•7 years ago
|
||
Added this machine back in TC using the quarantine_tc.py script [1]. I've rebooted it by roller.
It looks good now:
https://tools.taskcluster.net/groups/c28OglDCQQSqMgIZg3jr7g/tasks/bNTxPfuXQbuXklJ95sF-jA/runs/0
[1] - https://wiki.mozilla.org/Connect_and_Troubleshoot_workers_in_CI#How_to_add.2Fdefine_a_worker_if_it_is_missing_from_Taskcluster
Status: REOPENED → RESOLVED
Closed: 7 years ago → 7 years ago
Resolution: --- → FIXED
Comment 16•7 years ago
|
||
Reopening this as it seems that we're hitting STDIO issue and the worker is not pingable.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Status: REOPENED → RESOLVED
Closed: 7 years ago → 7 years ago
Resolution: --- → FIXED
Comment 17•7 years ago
|
||
Reopening this as it seems like we're hitting STDIO issue and workers seems unreachable.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 18•7 years ago
|
||
https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-osx-1010/workers/mdc2/t-yosemite-r7-045
fixed. rebooted
Status: REOPENED → RESOLVED
Closed: 7 years ago → 7 years ago
Resolution: --- → FIXED
Comment 19•7 years ago
|
||
Reopening this as it seems like we're hitting STDIO issue and worker seems unreachable.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Reporter | ||
Comment 20•7 years ago
|
||
the machine seems to be up and running and taking jobs.
https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-osx-1010/workers/mdc2/t-yosemite-r7-045
We will close the bug for now. If the problem will persist in the future, we will re-open this bug.
Status: REOPENED → RESOLVED
Closed: 7 years ago → 7 years ago
Resolution: --- → FIXED
Updated•6 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•