Closed
Bug 1455534
Opened 7 years ago
Closed 7 years ago
Intermittent IOError: [Errno 13] y:\hg-shared\8ba995b74e18334ab3707f27e9eb8f4e37ba3d29\.hg/store\data/modules/libpref/init/_static_pref_list.h.i: Access is denied
Categories
(Infrastructure & Operations :: RelOps: General, task, P5)
Infrastructure & Operations
RelOps: General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: intermittent-bug-filer, Unassigned)
References
Details
(Keywords: intermittent-failure, Whiteboard: [stockwell infra])
Attachments
(1 file)
58 bytes,
patch
|
grenade
:
review+
|
Details | Diff | Splinter Review |
Filed by: ncsoregi [at] mozilla.com
https://treeherder.mozilla.org/logviewer.html#?job_id=174707743&repo=autoland
https://queue.taskcluster.net/v1/task/LvLYufiiRIaAK7IvyIwKMA/runs/0/artifacts/public/logs/live_backing.log
Traceback (most recent call last):
File "mercurial\scmutil.pyc", line 150, in callcatch
File "mercurial\dispatch.pyc", line 292, in _runcatchfunc
File "mercurial\dispatch.pyc", line 896, in _dispatch
File "mercurial\dispatch.pyc", line 658, in runcommand
File "mercurial\dispatch.pyc", line 904, in _runcommand
File "mercurial\dispatch.pyc", line 893, in <lambda>
File "mercurial\util.pyc", line 1077, in check
File "C:/mozilla-build/robustcheckout.py", line 239, in robustcheckout
File "C:/mozilla-build/robustcheckout.py", line 516, in _docheckout
File "mercurial\exchange.pyc", line 1251, in pull
File "mercurial\exchange.pyc", line 1393, in _pullbundle2
File "mercurial\bundle2.pyc", line 364, in processbundle
File "mercurial\bundle2.pyc", line 464, in _processpart
File "mercurial\bundle2.pyc", line 1552, in handlechangegroup
File "mercurial\bundle2.pyc", line 405, in _processchangegroup
File "mercurial\changegroup.pyc", line 327, in apply
File "mercurial\changegroup.pyc", line 986, in _addchangegroupfiles
File "mercurial\revlog.pyc", line 1886, in addgroup
File "mercurial\store.pyc", line 488, in __call__
File "mercurial\vfs.pyc", line 405, in __call__
File "mercurial\windows.pyc", line 140, in posixfile
IOError: [Errno 13] y:\hg-shared\8ba995b74e18334ab3707f27e9eb8f4e37ba3d29\.hg/store\data/modules/libpref/init/_static_pref_list.h.i: Access is denied
abort: y:\hg-shared\8ba995b74e18334ab3707f27e9eb8f4e37ba3d29\.hg/store\data/modules/libpref/init/_static_pref_list.h.i: Access is denied
[taskcluster:error] Exit Code: 255
[taskcluster:error] User Time: 15.625ms
[taskcluster:error] Kernel Time: 0s
[taskcluster:error] Wall Time: 11.918s
[taskcluster:error] Result: FAILED
[taskcluster 2018-04-20T06:32:16.368Z] === Task Finished ===
[taskcluster 2018-04-20T06:32:16.368Z] Task Duration: 12.1849674s
[taskcluster:error] Uploading error artifact public/build from file public/build with message "Could not read directory 'Z:\\task_1524202998\\public\\build'", reason "file-missing-on-worker" and expiry 2019-04-20T06:30:54.782Z
[taskcluster:error] TASK FAILURE during artifact upload: file-missing-on-worker: Could not read directory 'Z:\task_1524202998\public\build'
[taskcluster 2018-04-20T06:32:17.074Z] Uploading artifact public/logs/certified.log from file generic-worker\certified.log with content encoding "gzip", mime type "text/plain; charset=utf-8" and expiry 2019-04-20T06:30:54.782Z
[taskcluster 2018-04-20T06:32:17.751Z] Uploading artifact public/chainOfTrust.json.asc from file generic-worker\chainOfTrust.json.asc with content encoding "gzip", mime type "text/plain; charset=utf-8" and expiry 2019-04-20T06:30:54.782Z
[taskcluster 2018-04-20T06:32:18.331Z] Uploading redirect artifact public/logs/live.log to URL https://queue.taskcluster.net/v1/task/LvLYufiiRIaAK7IvyIwKMA/runs/0/artifacts/public/logs/live_backing.log with mime type "text/plain; charset=utf-8" and expiry 2019-04-20T06:30:54.782Z
[taskcluster:error] Task not successful due to following exception(s):
[taskcluster:error] Exception 1)
[taskcluster:error] exit status 255
[taskcluster:error] Exception 2)
[taskcluster:error] file-missing-on-worker: Could not read directory 'Z:\task_1524202998\public\build'
[taskcluster:error]
Updated•7 years ago
|
Component: General → Worker
Product: Firefox Build System → Taskcluster
QA Contact: pmoore
Version: Version 3 → unspecified
Comment 1•7 years ago
|
||
Pete cleared the gecko-3-b-win2012, this shall resolve the issue.
Updated•7 years ago
|
Component: Worker → Task Configuration
Product: Taskcluster → Firefox Build System
QA Contact: pmoore
Comment 2•7 years ago
|
||
This would be solved with bug 1350956.
Comment hidden (Intermittent Failures Robot) |
Comment 4•7 years ago
|
||
This continued to occurred on today's nightlys: https://treeherder.mozilla.org/#/jobs?repo=mozilla-central&revision=dd0e54d786743974a50a338059bcd68a09b6d5b2&selectedJob=174941784
All failures ran on worker i-0b15cd926915cb97d, i terminated the machine.
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment 7•7 years ago
|
||
It looks like https://github.com/mozilla-releng/OpenCloudConfig/blob/7eccd4a9085f3cb4e2fbde1d8cac81b260258d00/userdata/Configuration/GenericWorker/run-generic-worker-format-and-reboot.bat#L58 is creating the flag file C:\dsc\task-claim-state.valid before rebooting.
This is that flag file that OCC uses to delay the launch of generic-worker.exe in https://github.com/mozilla-releng/OpenCloudConfig/blob/master/userdata/Configuration/GenericWorker/run-generic-worker-format-and-reboot.bat#L22
This file should only be created when OCC run has completed. I believe that is done here: https://github.com/mozilla-releng/OpenCloudConfig/blob/dbd82051b22f3cd336153d257f5509bf838652d0/userdata/rundsc.ps1#L1204
If that rundsc.ps1 line runs at the end of every OCC run, the solution is simply to delete line 58 of run-generic-worker-format-and-reboot.bat (the first link above).
Comment 8•7 years ago
|
||
Attachment #8970230 -
Flags: review?(rthijssen)
Updated•7 years ago
|
Attachment #8970230 -
Attachment is patch: true
Attachment #8970230 -
Attachment mime type: text/x-github-pull-request → text/plain
Attachment #8970230 -
Flags: review?(rthijssen) → review+
Comment 9•7 years ago
|
||
Kendall, is someone on your team available to deploy this? It is 7pm here, and it takes a few hours to completely roll out.
Assignee: nobody → relops
Component: Task Configuration → RelOps
Flags: needinfo?(klibby)
Product: Firefox Build System → Infrastructure & Operations
QA Contact: klibby
Comment 10•7 years ago
|
||
I'm *very* wary about rolling out changes to this outside of EU hours, to be honest, especially given how long it takes to completely roll out. I would like it if grenade can deploy, and the two of you keep an eye on the rollout.
Flags: needinfo?(klibby) → needinfo?(rthijssen)
Comment hidden (Intermittent Failures Robot) |
Comment 12•7 years ago
|
||
merged by pmoore last night:
https://github.com/mozilla-releng/OpenCloudConfig/commit/f9296e2195f5d367c32581c54fb3116c9928860c
rollout of new gecko-(1,2,3)-b-win2012 amis in progress:
https://tools.taskcluster.net/groups/dz9lxKKsRHafzfQx_Zl0uA
monitoring...
Flags: needinfo?(rthijssen)
Updated•7 years ago
|
Whiteboard: [stockwell disable-recommended] → [stockwell infra]
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Updated•7 years ago
|
Status: NEW → RESOLVED
Closed: 7 years ago
Resolution: --- → FIXED
You need to log in
before you can comment on or make changes to this bug.
Description
•