Workers in terraform-packet worker type getting stuck with exception
Categories
(Taskcluster :: Workers, defect)
Tracking
(Not tracked)
People
(Reporter: zfay, Unassigned)
Details
After monitoring the https://tools.taskcluster.net/provisioners/terraform-packet/worker-types/gecko-t-linux pool today I noticed that a large number of workers stopped picking up tasks. All the affected workers finished their last job as exception.
Treeherder link from earlier:
https://treeherder.mozilla.org/#/jobs?repo=autoland&selectedJob=252607557&searchStr=android%2C7.0%2Cx86-64%2Copt%2Cweb%2Cplatform%2Ctests%2Ctest-android-em-7.0-x86_64%2Fopt-web-platform-tests-reftests-e10s-1%2Cw%28wr1%29&tochange=71d648e912ef3e45b9faf3c63894152e1874c037&fromchange=c497c45f090c25395b1b07680034e7653fb94d7f
![]() |
||
Comment 1•6 years ago
|
||
Any idea what's going on with those packet.net instances?
Comment 2•6 years ago
|
||
lets see what :wcosta would know.
Comment 3•6 years ago
|
||
wcosta is on PTO this week, but I can take a look.
Do we have a list of the affected workerIDs and/or instances?
![]() |
||
Comment 4•6 years ago
|
||
Hi Adrian, are there any stuck machines left? The link in comment 0 shows them running as normal.
Comment 5•6 years ago
|
||
I've checked the machines from gecko-t-linux, currently, all the active workers are running. I'll keep on monitoring them
![]() |
||
Updated•6 years ago
|
Description
•