Intermittent build failure IOError: [Errno 28] No space left on device
Categories
(Firefox Build System :: Task Configuration, defect, P2)
Tracking
(Not tracked)
People
(Reporter: intermittent-bug-filer, Assigned: wcosta)
References
Details
(Keywords: intermittent-failure, Whiteboard: [stockwell unknown])
Attachments
(2 files)
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Comment 8•7 years ago
|
||
Comment 9•7 years ago
|
||
Recent failure log: https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=225920386&repo=mozilla-inbound&lineNumber=109044
[task 2019-02-04T19:08:20.245Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.245Z] Logged from file log.py, line 585
[task 2019-02-04T19:08:20.245Z] 19:08:20 INFO - Starting Mac pre-processing on file: /builds/worker/workspace/build/src/obj-firefox/toolkit/library/gtest/XUL
[task 2019-02-04T19:08:20.245Z] Traceback (most recent call last):
[task 2019-02-04T19:08:20.245Z] File "/usr/lib/python2.7/logging/init.py", line 883, in emit
[task 2019-02-04T19:08:20.245Z] self.flush()
[task 2019-02-04T19:08:20.245Z] File "/usr/lib/python2.7/logging/init.py", line 843, in flush
[task 2019-02-04T19:08:20.245Z] self.stream.flush()
[task 2019-02-04T19:08:20.246Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.246Z] Logged from file log.py, line 585
[task 2019-02-04T19:08:20.246Z] Traceback (most recent call last):
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 883, in emit
[task 2019-02-04T19:08:20.246Z] self.flush()
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 843, in flush
[task 2019-02-04T19:08:20.246Z] self.stream.flush()
[task 2019-02-04T19:08:20.246Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.246Z] Logged from file log.py, line 585
[task 2019-02-04T19:08:20.246Z] 19:08:20 INFO - Running Mac pre-processing on file: /builds/worker/workspace/build/src/obj-firefox/toolkit/library/gtest/XUL
[task 2019-02-04T19:08:20.246Z] Traceback (most recent call last):
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 883, in emit
[task 2019-02-04T19:08:20.246Z] self.flush()
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 843, in flush
[task 2019-02-04T19:08:20.246Z] self.stream.flush()
[task 2019-02-04T19:08:20.246Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.246Z] Logged from file log.py, line 585
[task 2019-02-04T19:08:20.246Z] Traceback (most recent call last):
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 883, in emit
[task 2019-02-04T19:08:20.246Z] self.flush()
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 843, in flush
[task 2019-02-04T19:08:20.246Z] self.stream.flush()
[task 2019-02-04T19:08:20.246Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.246Z] Logged from file log.py, line 585
[task 2019-02-04T19:08:20.246Z] 19:08:20 INFO - /builds/worker/workspace/build/src/build/macosx/llvm-dsymutil --arch=x86_64 /builds/worker/workspace/build/src/obj-firefox/toolkit/library/gtest/XUL
[task 2019-02-04T19:08:20.246Z] Traceback (most recent call last):
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 883, in emit
[task 2019-02-04T19:08:20.246Z] self.flush()
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 843, in flush
[task 2019-02-04T19:08:20.246Z] self.stream.flush()
[task 2019-02-04T19:08:20.246Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.246Z] Logged from file log.py, line 585
[task 2019-02-04T19:08:20.246Z] Traceback (most recent call last):
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 883, in emit
[task 2019-02-04T19:08:20.246Z] self.flush()
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 843, in flush
[task 2019-02-04T19:08:20.246Z] self.stream.flush()
[task 2019-02-04T19:08:20.246Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.246Z] Logged from file log.py, line 585
[task 2019-02-04T19:08:20.246Z] 19:08:20 INFO - warning: could not find referenced DIE
[task 2019-02-04T19:08:20.246Z] Traceback (most recent call last):
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 883, in emit
[task 2019-02-04T19:08:20.246Z] self.flush()
[task 2019-02-04T19:08:20.246Z] File "/usr/lib/python2.7/logging/init.py", line 843, in flush
[task 2019-02-04T19:08:20.246Z] self.stream.flush()
[task 2019-02-04T19:08:20.246Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.246Z] Logged from file log.py, line 585
[task 2019-02-04T19:08:20.247Z] Traceback (most recent call last):
[task 2019-02-04T19:08:20.247Z] File "/usr/lib/python2.7/logging/init.py", line 883, in emit
[task 2019-02-04T19:08:20.247Z] self.flush()
[task 2019-02-04T19:08:20.247Z] File "/usr/lib/python2.7/logging/init.py", line 843, in flush
[task 2019-02-04T19:08:20.247Z] self.stream.flush()
[task 2019-02-04T19:08:20.247Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.247Z] Logged from file log.py, line 585
[task 2019-02-04T19:08:20.247Z] Traceback (most recent call last):
[task 2019-02-04T19:08:20.247Z] File "/usr/lib/python2.7/logging/init.py", line 883, in emit
[task 2019-02-04T19:08:20.247Z] self.flush()
[task 2019-02-04T19:08:20.247Z] File "/usr/lib/python2.7/logging/init.py", line 843, in flush
[task 2019-02-04T19:08:20.247Z] self.stream.flush()
[task 2019-02-04T19:08:20.247Z] IOError: [Errno 28] No space left on device
[task 2019-02-04T19:08:20.247Z] Logged from file log.py, line 585
| Comment hidden (Intermittent Failures Robot) |
Comment 11•7 years ago
|
||
This seems to be happening more regularly lately with OSX builds, including today on mozilla-release. Can we please try to find someone to investigate?
Comment 12•7 years ago
|
||
Redirecting to fubar, keeper of the workers...
Ryan, is this happening more often than is being starred? I only see 2 occurrences of this in the past week according to comment #10 above.
Comment 13•7 years ago
|
||
The latest log there is from a macOS cross-compile build, so it's a docker host in AWS; Wander, any ideas? It's not clear to me if it's an issue within the docker image or the underlying host..
| Assignee | ||
Comment 14•7 years ago
|
||
(In reply to Kendall Libby [:fubar] (he/him) from comment #13)
The latest log there is from a macOS cross-compile build, so it's a docker
host in AWS; Wander, any ideas? It's not clear to me if it's an issue within
the docker image or the underlying host..
I reran the task a kept measuring disk usage, it kept around 50-65%, and the worker capacity is set to one. No clue what is going on.
| Comment hidden (Intermittent Failures Robot) |
Comment 16•7 years ago
|
||
The following workers have been terminated because of this failure hitting today on beta:
Failure log: https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=229014156&repo=mozilla-beta&lineNumber=109081
Could someone look into this issue? Thank you.
| Comment hidden (Intermittent Failures Robot) |
| Assignee | ||
Comment 18•7 years ago
|
||
(In reply to Cosmin Sabou [:CosminS] from comment #16)
The following workers have been terminated because of this failure hitting
today on beta:https://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-linux/workers/us-west-1/i-096ad97909fa7e400https://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-android/workers/us-east-1/i-0a675ab5a9aa6b75ahttps://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-linux/workers/us-east-1/i-0d93ac233d6533558https://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-linux/workers/us-west-2/i-0dd362434c4106361Th push:
https://treeherder.mozilla.org/#/jobs?repo=mozilla-
beta&resultStatus=busted&searchStr=nightly&fromchange=bb9e3868a7bf0e1eeca0209
d9943732ffb855958&tochange=bce0092f646c52d0402531a5b5a860905dfe7ad8&selectedJ
ob=229014156Failure log:
https://treeherder.mozilla.org/logviewer.html#/
jobs?job_id=229014156&repo=mozilla-beta&lineNumber=109081Could someone look into this issue? Thank you.
I am investigating this.
| Assignee | ||
Comment 19•7 years ago
|
||
(In reply to Cosmin Sabou [:CosminS] from comment #16)
The following workers have been terminated because of this failure hitting
today on beta:https://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-linux/workers/us-west-1/i-096ad97909fa7e400https://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-android/workers/us-east-1/i-0a675ab5a9aa6b75ahttps://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-linux/workers/us-east-1/i-0d93ac233d6533558https://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-linux/workers/us-west-2/i-0dd362434c4106361Th push:
https://treeherder.mozilla.org/#/jobs?repo=mozilla-
beta&resultStatus=busted&searchStr=nightly&fromchange=bb9e3868a7bf0e1eeca0209
d9943732ffb855958&tochange=bce0092f646c52d0402531a5b5a860905dfe7ad8&selectedJ
ob=229014156Failure log:
https://treeherder.mozilla.org/logviewer.html#/
jobs?job_id=229014156&repo=mozilla-beta&lineNumber=109081Could someone look into this issue? Thank you.
I investigated this particular one [1]. When I click on "Inpect Task", it shows the task with status "Completed" and no sign of "No space left" in the logs. I am quite confused now :/
[1] https://treeherder.mozilla.org/#/jobs?repo=mozilla-beta&resultStatus=busted&searchStr=nightly&selectedJob=229014156&revision=bce0092f646c52d0402531a5b5a860905dfe7ad8
[2] https://tools.taskcluster.net/groups/TbzbfqqYSVu7yoVp-U7kLg/tasks/AIC4rVRkS32dTgsFePIjTA/runs/1/logs/public%2Flogs%2Flive.log
| Assignee | ||
Comment 20•7 years ago
|
||
(In reply to Wander Lairson Costa [:wcosta] from comment #19)
(In reply to Cosmin Sabou [:CosminS] from comment #16)
The following workers have been terminated because of this failure hitting
today on beta:https://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-linux/workers/us-west-1/i-096ad97909fa7e400https://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-android/workers/us-east-1/i-0a675ab5a9aa6b75ahttps://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-linux/workers/us-east-1/i-0d93ac233d6533558https://tools.taskcluster.net/provisioners/aws-provisioner-v1/worker-types/
gecko-3-b-linux/workers/us-west-2/i-0dd362434c4106361Th push:
https://treeherder.mozilla.org/#/jobs?repo=mozilla-
beta&resultStatus=busted&searchStr=nightly&fromchange=bb9e3868a7bf0e1eeca0209
d9943732ffb855958&tochange=bce0092f646c52d0402531a5b5a860905dfe7ad8&selectedJ
ob=229014156Failure log:
https://treeherder.mozilla.org/logviewer.html#/
jobs?job_id=229014156&repo=mozilla-beta&lineNumber=109081Could someone look into this issue? Thank you.
I investigated this particular one [1]. When I click on "Inpect Task", it
shows the task with status "Completed" and no sign of "No space left" in the
logs. I am quite confused now :/[1]
https://treeherder.mozilla.org/#/jobs?repo=mozilla-
beta&resultStatus=busted&searchStr=nightly&selectedJob=229014156&revision=bce
0092f646c52d0402531a5b5a860905dfe7ad8
[2]
https://tools.taskcluster.net/groups/TbzbfqqYSVu7yoVp-U7kLg/tasks/
AIC4rVRkS32dTgsFePIjTA/runs/1/logs/public%2Flogs%2Flive.log
Update: shame on me, I was looking at Run 1 instead of Run 0.
Comment 22•7 years ago
|
||
We've had the same problem, but perma-failing in bug 1530682. In that case, there's a difference between level 1 workers (which do work) and level 3 ones. More details in there.
Comment 23•7 years ago
|
||
Upgrading importance because the situation has changed over the past 2 weeks. Please change it again, if I misinterpreted the thread.
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Updated•6 years ago
|
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Assignee | ||
Comment 33•6 years ago
|
||
Bug 1447695 reports an intermittent "No space left" error
in CI. Given that the job runs in a docker container and it
is destroy as soon as the task finishes, it makes difficult
to diagnose the root cause of the bug. We then dump the disk
usage to logs so we can perform a post mortem analysis.
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Comment 42•6 years ago
|
||
There are 31 failures associated to this bug in the last 7 days. These are occurring on: android-4-0-armv7-api16, android-4-2-x86, android-5-0-aarch64, linux64, linux64-shippable, windows-mingw32, osx-shippable all builds.
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Assignee | ||
Updated•6 years ago
|
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Comment 64•6 years ago
|
||
This started to fail quite frequently today:
https://treeherder.mozilla.org/#/jobs?repo=autoland&resultStatus=success%2Ctestfailed%2Cbusted%2Cexception&searchStr=c93018f1e173e285a117f34a528343cb8536c34f&fromchange=d58db9c67aae38c26425a0a70de1a5df1a64f721&tochange=d07675191e795e0d3b41535d6e0a5b01fc47c702&selectedJob=279762373
| Assignee | ||
Comment 65•6 years ago
|
||
Comment 66•6 years ago
•
|
||
(In reply to Noemi Erli[:noemi_erli] from comment #64)
This started to fail quite frequently today:
https://treeherder.mozilla.org/#/jobs?repo=autoland&resultStatus=success%2Ctestfailed%2Cbusted%2Cexception&searchStr=c93018f1e173e285a117f34a528343cb8536c34f&fromchange=d58db9c67aae38c26425a0a70de1a5df1a64f721&tochange=d07675191e795e0d3b41535d6e0a5b01fc47c702&selectedJob=279762373
This is a different bug. It could have been bug 1570522, but that one got duped here because the first log posted there happened to get the same symptom (error compiling gdb-tests) from a different cause (out of space). The above build, and the majority of the ones that are now getting classified as this bug, are different and do not appear to be related to "No space left on device." The relevant portion of the log is:
[task 2019-12-05T12:00:13.605Z] 12:00:13 INFO - /builds/worker/workspace/build/src/obj-firefox/x86_64-linux-android/release/libjsrust.a(wrappers.o): In function `AnnotateMozCrashReason':
[task 2019-12-05T12:00:13.606Z] 12:00:13 INFO - /builds/worker/workspace/build/src/obj-firefox/dist/include/mozilla/Assertions.h:42: undefined reference to `__asan_report_store8'
[task 2019-12-05T12:00:13.606Z] 12:00:13 INFO - /builds/worker/workspace/build/src/obj-firefox/x86_64-linux-android/release/libjsrust.a(wrappers.o): In function `MOZ_Crash':
[task 2019-12-05T12:00:13.607Z] 12:00:13 INFO - /builds/worker/workspace/build/src/obj-firefox/dist/include/mozilla/Assertions.h:332: undefined reference to `__asan_report_store4'
[task 2019-12-05T12:00:13.607Z] 12:00:13 INFO - /builds/worker/workspace/build/src/obj-firefox/dist/include/mozilla/Assertions.h:332: undefined reference to `__asan_handle_no_return'
[task 2019-12-05T12:00:13.607Z] 12:00:13 INFO - /builds/worker/workspace/build/src/obj-firefox/x86_64-linux-android/release/libjsrust.a(wrappers.o): In function `asan.module_ctor':
[task 2019-12-05T12:00:13.608Z] 12:00:13 INFO - wrappers.cpp:(.text.asan.module_ctor+0x2): undefined reference to `__asan_init'
[task 2019-12-05T12:00:13.608Z] 12:00:13 INFO - wrappers.cpp:(.text.asan.module_ctor+0x7): undefined reference to `__asan_version_mismatch_check_v8'
[task 2019-12-05T12:00:13.608Z] 12:00:13 INFO - clang-9: error: linker command failed with exit code 1 (use -v to see invocation)
[task 2019-12-05T12:00:13.609Z] 12:00:13 INFO - /builds/worker/workspace/build/src/config/rules.mk:522: recipe for target '../../../dist/bin/gdb-tests' failed
[task 2019-12-05T12:00:13.609Z] 12:00:13 ERROR - make[4]: *** [../../../dist/bin/gdb-tests] Error 1
(edit: removed first line of pasted log with an LLVM gold plugin:...IDs have conflicting values message from log snippet above, since it also happens in successful builds.)
Comment 67•6 years ago
|
||
I filed bug 1601704 for this problem. Hopefully it will start to get the classifications now.
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Assignee | ||
Updated•6 years ago
|
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Assignee | ||
Comment 74•6 years ago
|
||
No failure in 2 months, closing.
Description
•