Closed Bug 1092902 (talos-mtnlion-r5-093) Opened 10 years ago Closed 10 years ago

talos-mtnlion-r5-093 problem tracking

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task, P3)

x86_64
macOS

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: nthomas, Unassigned)

References

Details

(Whiteboard: [buildduty][buildslaves][capacity])

Burning a lot of builds, errors like:

bash -c 'basename "$PWD"'
 in dir /builds/slave/talos-slave/test/. (timeout 1200 secs)
...
command timed out: 1200 seconds without output running ['bash', '-c', 'basename "$PWD"'], attempting to kill
SIGKILL failed to kill process
using fake rc=-1
program finished with exit code -1

remoteFailed: [Failure instance: Traceback from remote host -- Traceback (most recent call last):
Failure: exceptions.RuntimeError: SIGKILL failed to kill process

and:
rm -rf scripts
 in dir /builds/slave/talos-slave/test/. (timeout 1200 secs)
...
rm: scripts/.gitignore: Invalid argument
rm: scripts/.hg/00changelog.i: Invalid argument
...
rm: scripts: Directory not empty
program finished with exit code 1

Trying a reboot, may have a disk issue.
Blocks: 1092901
Attempting SSH reboot...Failed.
Attempting PDU reboot...Failed.
Filed IT bug for reboot (bug 1092903)
Buildbot is stopped at least; it's still enabled in slavealloc.
Maybe all it really wanted was a reboot.
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Could be, the reimage it got in bug 1092903 was requested explicitly.
Product: Release Engineering → Infrastructure & Operations
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.