Closed Bug 1481068 (T-W1064-MS-131) Opened 6 years ago Closed 6 years ago

[MDC1] T-W1064-MS-131 problem tracking

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: bcrisan, Unassigned)

References

Details

Missing from taskcluster. The worker has been re-imaged, needs verification.
Status: NEW → RESOLVED
Closed: 6 years ago
Resolution: --- → FIXED
Worker had no jobs 1 day+, last log was: T-W1064-MS-131.mdc1.mozilla.com Service_Control_Manager: The sshd service terminated unexpectedly. It has done this 1 time(s).#015 after reboot T-W1064-MS-131.mdc1.mozilla.com Service_Control_Manager: The sshd service terminated unexpectedly. It has done this 1 time(s).#015 Reimaged it and has since taken jobs.
Worker can't be found on taskcluster. Tried to solve the problem by rebooting, reset bios and reimage, but didn't solved the issue. On papertrail I've found the following entries : Aug 28 19:56:56 T-W1064-MS-131.mdc1.mozilla.com mlx4_bus: Port type registry value for device Native_14_0_0 could not be modified to value (PortType = none,auto). Previous value will be set.#015 Aug 28 19:56:56 T-W1064-MS-131.mdc1.mozilla.com mlx4_bus: Native_14_0_0: EXT_QP_MAX_RETRY_LIMIT/EXT_QP_MAX_RETRY_PERIOD registry keys were requested by user but FW does not support this feature. Please upgrade your firmware to support it. For more details, please refer to WinOF User Manual.#015 Aug 28 19:56:59 T-W1064-MS-131.mdc1.mozilla.com mlx4eth63: Mellanox ConnectX-3 Pro Ethernet Adapter #2 device detected that the link connected to port 2 is down. This can occur if the physical link is disconnected or damaged, or if the other end-port is down.#015
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Status: REOPENED → RESOLVED
Closed: 6 years ago6 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Depends on: 1490319
Status: REOPENED → RESOLVED
Closed: 6 years ago6 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Depends on: 1540497

the machine seems to be up and running and taking jobs.
https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/gecko-t-win10-64-hw/workers/mdc1/T-W1064-MS-131
We will close the bug for now. If the problem will persist in the future, we will re-open this bug.

Status: REOPENED → RESOLVED
Closed: 6 years ago6 years ago
Resolution: --- → FIXED
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.