Closed
Bug 1462820
(T-W1064-MS-118)
Opened 7 years ago
Closed 6 years ago
[MDC1] T-W1064-MS-118 problem tracking
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task)
Infrastructure & Operations Graveyard
CIDuty
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: riman, Unassigned)
References
Details
This worker is not taking tasks since 5 hours ago.
Checked in Nagios:
Host Status: DOWN (for 0d 5h 5m 49s)
Comment 1•7 years ago
|
||
The host status is Critical and has been down for 23H as of now with a packet loss of 100%. Tried rebooting the worker on tools.taskcluster.net but i'm getting a 404 error. Yes i am behind VPN and yes i added the ssl certificate.
Comment 2•7 years ago
|
||
Currently the worker can not be found on Taskcluster. I have rebooted the machine via iLO.
We will continue monitor it.
Comment 3•7 years ago
|
||
Checked the worker. It seems that is back on Taskcluster and it took jobs.
I will close the ticket for now as it seems the problem is fixed
Status: NEW → RESOLVED
Closed: 7 years ago
Resolution: --- → FIXED
Comment 4•7 years ago
|
||
re-opened bug for continue tracking it
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 5•7 years ago
|
||
rebooted and re-imaged the machine ( as it wasn't visible on Task Cluster ). Currently it is available and it took jobs
Updated•7 years ago
|
Summary: T-W1064-MS-118 problem tracking → [MDC1]T-W1064-MS-118 problem tracking
Updated•7 years ago
|
Summary: [MDC1]T-W1064-MS-118 problem tracking → [MDC1] T-W1064-MS-118 problem tracking
Reporter | ||
Comment 6•6 years ago
|
||
I've rebooted the worker and it's running jobs now.
Status: REOPENED → RESOLVED
Closed: 7 years ago → 6 years ago
Resolution: --- → FIXED
Comment 7•6 years ago
|
||
Worker was not taking jobs. Logs where showing:
T-W1064-MS-118.mdc1.mozilla.com Service_Control_Manager: The sshd service terminated unexpectedly.
Reboot did nothing to it so I reimaged. Has started working.
Comment 8•6 years ago
|
||
Re-opend bug. Worker is not taking jobs.
Tried rebooting, then reset bios and then reimage.
It seems that nothing helped
On papertrail the last entry is from 27.08.2018
Updated•6 years ago
|
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 9•6 years ago
|
||
Worker is active and taking jobs.
https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-win10-64-hw/workers/mdc1/T-W1064-MS-118
Status: REOPENED → RESOLVED
Closed: 6 years ago → 6 years ago
Resolution: --- → FIXED
Updated•6 years ago
|
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 10•6 years ago
|
||
machine seems to be up and running tasks. we will close the bug for now. If the problem will persist in the future, we will re-open the bug.
Status: REOPENED → RESOLVED
Closed: 6 years ago → 6 years ago
Resolution: --- → FIXED
Comment 11•6 years ago
|
||
Re-opening the bug. The machine is not available on Taskcluster. There is no document on ServiceNow portal and I couldn't find any last logs on papertrail.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 12•6 years ago
|
||
the machine seems to be up and running and taking jobs.
https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-win10-64-hw/workers/mdc1/T-W1064-MS-118
We will close the bug for now. If the problem will persist in the future, we will re-open this bug.
Status: REOPENED → RESOLVED
Closed: 6 years ago → 6 years ago
Resolution: --- → FIXED
Updated•5 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•