Closed Bug 1095059 (b-2008-ix-0078) Opened 10 years ago Closed 9 years ago

b-2008-ix-0078 problem tracking

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task, P3)

x86_64
Windows Server 2008

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: RyanVM, Unassigned)

References

Details

(Whiteboard: [buildduty][buildslaves][capacity])

Corrupted builds and an awful-looking recent health. Disabled for reimaging.
https://treeherder.mozilla.org/ui/logviewer.html#?job_id=379008&repo=mozilla-aurora
Depends on: 1095137
Diagnostics saw nothing, maybe the reimage helped. Reenabled.
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Still seems pretty burntastic to me. I've seen a couple like the one below since it was put back into production:
https://treeherder.mozilla.org/ui/logviewer.html#?job_id=31332&repo=mozilla-b2g34_v2_1
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Re-imaged. If we still see failures, we'll try another pass of diagnostics.
Status: REOPENED → RESOLVED
Closed: 10 years ago10 years ago
Resolution: --- → FIXED
https://treeherder.mozilla.org/ui/logviewer.html#?job_id=59297&repo=mozilla-b2g34_v2_1

 LINK : fatal error LNK1123: failure during conversion to COFF: file invalid or corrupt
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Re-imaged and returned to production.
Status: REOPENED → RESOLVED
Closed: 10 years ago9 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Depends on: 1026870
Resolution: FIXED → ---
Re-imaged and re-enabled.
Status: REOPENED → RESOLVED
Closed: 9 years ago9 years ago
Resolution: --- → FIXED
This host has been showing as down for 33 days. I'm not sure why slaveapi hasn't filed a bug for it?
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
It's now a staging slave, in the dev/pp environment and attached to dev-bhearsum-builds, so slaveapi has no reason to touch it.
Attempting SSH reboot...Failed.
Attempting IPMI reboot...Failed.
Filed IT bug for reboot (bug 1139927)
Re-imaged and returned to the production environment.
Status: REOPENED → RESOLVED
Closed: 9 years ago9 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
This is burning Windows nightly jobs:
https://treeherder.mozilla.org/logviewer.html#?job_id=877823&repo=mozilla-aurora

I've disabled it in slavealloc.
Flags: needinfo?(mcornmesser)
I am looking into this. I am not seeing a clear cut reason for the failure. There is the "03:48:52 INFO - IOError: [Errno 13] Permission denied: 'c:/builds/crash-stats-api.token'" error, but the permissions are consistent with other machines. 

RyanVM: any thoughts on why it burned?
Flags: needinfo?(mcornmesser) → needinfo?(ryanvm)
Nope. I asked Ted about it and he said it was a slave issue.
Flags: needinfo?(ryanvm)
Are the running processes consistent with other machines? "I downloaded a file, and when I went to use it I got a permission denied error" on a Windows machine, my first thought is AV or search indexing grabbing onto the newly-created file, including maybe something that is running on other machines too, but on other machines has been told to keep its hands off c:/builds/.
This machine, due to troubleshooting, had puppet ran by 2 different local accounts, system and cltbld. I suspect that may have caused an issue with permissions.  I am going to reimage the machine and reenable it.
No longer depends on: 1165457
Depends on: 1165771
Re-imaged slave, did not enable it in slavealloc.
Status: REOPENED → RESOLVED
Closed: 9 years ago9 years ago
Resolution: --- → FIXED
Product: Release Engineering → Infrastructure & Operations
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.