Closed Bug 1455534 Opened 7 years ago Closed 7 years ago

Intermittent IOError: [Errno 13] y:\hg-shared\8ba995b74e18334ab3707f27e9eb8f4e37ba3d29\.hg/store\data/modules/libpref/init/_static_pref_list.h.i: Access is denied

Categories

(Infrastructure & Operations :: RelOps: General, task, P5)

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: intermittent-bug-filer, Unassigned)

References

Details

(Keywords: intermittent-failure, Whiteboard: [stockwell infra])

Attachments

(1 file)

Filed by: ncsoregi [at] mozilla.com https://treeherder.mozilla.org/logviewer.html#?job_id=174707743&repo=autoland https://queue.taskcluster.net/v1/task/LvLYufiiRIaAK7IvyIwKMA/runs/0/artifacts/public/logs/live_backing.log Traceback (most recent call last): File "mercurial\scmutil.pyc", line 150, in callcatch File "mercurial\dispatch.pyc", line 292, in _runcatchfunc File "mercurial\dispatch.pyc", line 896, in _dispatch File "mercurial\dispatch.pyc", line 658, in runcommand File "mercurial\dispatch.pyc", line 904, in _runcommand File "mercurial\dispatch.pyc", line 893, in <lambda> File "mercurial\util.pyc", line 1077, in check File "C:/mozilla-build/robustcheckout.py", line 239, in robustcheckout File "C:/mozilla-build/robustcheckout.py", line 516, in _docheckout File "mercurial\exchange.pyc", line 1251, in pull File "mercurial\exchange.pyc", line 1393, in _pullbundle2 File "mercurial\bundle2.pyc", line 364, in processbundle File "mercurial\bundle2.pyc", line 464, in _processpart File "mercurial\bundle2.pyc", line 1552, in handlechangegroup File "mercurial\bundle2.pyc", line 405, in _processchangegroup File "mercurial\changegroup.pyc", line 327, in apply File "mercurial\changegroup.pyc", line 986, in _addchangegroupfiles File "mercurial\revlog.pyc", line 1886, in addgroup File "mercurial\store.pyc", line 488, in __call__ File "mercurial\vfs.pyc", line 405, in __call__ File "mercurial\windows.pyc", line 140, in posixfile IOError: [Errno 13] y:\hg-shared\8ba995b74e18334ab3707f27e9eb8f4e37ba3d29\.hg/store\data/modules/libpref/init/_static_pref_list.h.i: Access is denied abort: y:\hg-shared\8ba995b74e18334ab3707f27e9eb8f4e37ba3d29\.hg/store\data/modules/libpref/init/_static_pref_list.h.i: Access is denied [taskcluster:error] Exit Code: 255 [taskcluster:error] User Time: 15.625ms [taskcluster:error] Kernel Time: 0s [taskcluster:error] Wall Time: 11.918s [taskcluster:error] Result: FAILED [taskcluster 2018-04-20T06:32:16.368Z] === Task Finished === [taskcluster 2018-04-20T06:32:16.368Z] Task Duration: 12.1849674s [taskcluster:error] Uploading error artifact public/build from file public/build with message "Could not read directory 'Z:\\task_1524202998\\public\\build'", reason "file-missing-on-worker" and expiry 2019-04-20T06:30:54.782Z [taskcluster:error] TASK FAILURE during artifact upload: file-missing-on-worker: Could not read directory 'Z:\task_1524202998\public\build' [taskcluster 2018-04-20T06:32:17.074Z] Uploading artifact public/logs/certified.log from file generic-worker\certified.log with content encoding "gzip", mime type "text/plain; charset=utf-8" and expiry 2019-04-20T06:30:54.782Z [taskcluster 2018-04-20T06:32:17.751Z] Uploading artifact public/chainOfTrust.json.asc from file generic-worker\chainOfTrust.json.asc with content encoding "gzip", mime type "text/plain; charset=utf-8" and expiry 2019-04-20T06:30:54.782Z [taskcluster 2018-04-20T06:32:18.331Z] Uploading redirect artifact public/logs/live.log to URL https://queue.taskcluster.net/v1/task/LvLYufiiRIaAK7IvyIwKMA/runs/0/artifacts/public/logs/live_backing.log with mime type "text/plain; charset=utf-8" and expiry 2019-04-20T06:30:54.782Z [taskcluster:error] Task not successful due to following exception(s): [taskcluster:error] Exception 1) [taskcluster:error] exit status 255 [taskcluster:error] Exception 2) [taskcluster:error] file-missing-on-worker: Could not read directory 'Z:\task_1524202998\public\build' [taskcluster:error]
Component: General → Worker
Product: Firefox Build System → Taskcluster
QA Contact: pmoore
Version: Version 3 → unspecified
Pete cleared the gecko-3-b-win2012, this shall resolve the issue.
Component: Worker → Task Configuration
Product: Taskcluster → Firefox Build System
QA Contact: pmoore
This would be solved with bug 1350956.
This continued to occurred on today's nightlys: https://treeherder.mozilla.org/#/jobs?repo=mozilla-central&revision=dd0e54d786743974a50a338059bcd68a09b6d5b2&selectedJob=174941784 All failures ran on worker i-0b15cd926915cb97d, i terminated the machine.
It looks like https://github.com/mozilla-releng/OpenCloudConfig/blob/7eccd4a9085f3cb4e2fbde1d8cac81b260258d00/userdata/Configuration/GenericWorker/run-generic-worker-format-and-reboot.bat#L58 is creating the flag file C:\dsc\task-claim-state.valid before rebooting. This is that flag file that OCC uses to delay the launch of generic-worker.exe in https://github.com/mozilla-releng/OpenCloudConfig/blob/master/userdata/Configuration/GenericWorker/run-generic-worker-format-and-reboot.bat#L22 This file should only be created when OCC run has completed. I believe that is done here: https://github.com/mozilla-releng/OpenCloudConfig/blob/dbd82051b22f3cd336153d257f5509bf838652d0/userdata/rundsc.ps1#L1204 If that rundsc.ps1 line runs at the end of every OCC run, the solution is simply to delete line 58 of run-generic-worker-format-and-reboot.bat (the first link above).
Attachment #8970230 - Attachment is patch: true
Attachment #8970230 - Attachment mime type: text/x-github-pull-request → text/plain
Attachment #8970230 - Flags: review?(rthijssen) → review+
Kendall, is someone on your team available to deploy this? It is 7pm here, and it takes a few hours to completely roll out.
Assignee: nobody → relops
Component: Task Configuration → RelOps
Flags: needinfo?(klibby)
Product: Firefox Build System → Infrastructure & Operations
QA Contact: klibby
I'm *very* wary about rolling out changes to this outside of EU hours, to be honest, especially given how long it takes to completely roll out. I would like it if grenade can deploy, and the two of you keep an eye on the rollout.
Flags: needinfo?(klibby) → needinfo?(rthijssen)
Flags: needinfo?(rthijssen)
Whiteboard: [stockwell disable-recommended] → [stockwell infra]
Status: NEW → RESOLVED
Closed: 7 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: