Closed
Bug 582821
Opened 14 years ago
Closed 11 years ago
[Win7][disabled on Windows] intermittent exception in test_nsIProcess.js : from nsILocalFile.moveTo in rename_and_test (causing the slave to fail with "rm: cannot remove directory `build/xpcshell/tests/xpcom/tests/unit': Directory not empty" till clobber)
Categories
(Core :: XPCOM, defect)
Tracking
()
RESOLVED
FIXED
People
(Reporter: dbaron, Unassigned)
References
Details
(Keywords: intermittent-failure, Whiteboard: [test which breaks the slave][test disabled on Windows])
http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1280361270.1280364347.13736.gz
Rev3 WINNT 6.1 mozilla-central opt test xpcshell
s: talos-r3-w7-037
TEST-UNEXPECTED-FAIL | c:\talos-slave\mozilla-central-win7-opt-u-xpcshell\build\xpcshell\tests\xpcom\unit\test_nsIProcess.js | test failed (with xpcshell return code: 0), see following log:
TEST-UNEXPECTED-FAIL | (xpcshell/head.js) | [Exception... "Component returned failure code: 0x80004005 (NS_ERROR_FAILURE) [nsILocalFile.moveTo]" nsresult: "0x80004005 (NS_ERROR_FAILURE)" location: "JS frame :: c:/talos-slave/mozilla-central-win7-opt-u-xpcshell/build/xpcshell/tests/xpcom/unit/test_nsIProcess.js :: rename_and_test :: line 159" data: no]
I don't think this has been observed before, but we've only had the Win7 xpcshell unhidden for a few days.
Comment 1•14 years ago
|
||
This may be similar to problems we've beeing seeing where Windows holds on to conftest.exe for a few seconds after you've run it. I've heard that making the "Application Experience" service start automatically instead of delay-start will help things (I've done that on my machine).
But I also think we could just use copyTo instead of moveTo in this test.
Comment 2•14 years ago
|
||
(In reply to comment #1)
> This may be similar to problems we've beeing seeing where Windows holds on to
> conftest.exe for a few seconds after you've run it. I've heard that making the
> "Application Experience" service start automatically instead of delay-start
> will help things (I've done that on my machine).
>
> But I also think we could just use copyTo instead of moveTo in this test.
For what it's worth, we sleep for 30 seconds before launching Buildbot on this platform.
I had a look through services.msc on a 32-bit Windows 7 machine and "Application Experience" has its Startup Type set to "Manual". On this machine, it has been started, though I'm not sure by what. Based on all the stop and start in the System log it seems like it launches on demand -- which I guess is what your comment implies, Benjamin.
Comment 3•14 years ago
|
||
philringnalda%gmail.com
http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1281496696.1281499913.4488.gz
Rev3 WINNT 6.1 mozilla-central opt test xpcshell on 2010/08/10 20:18:16
s: talos-r3-w7-034
TEST-UNEXPECTED-FAIL | c:\\talos-slave\\mozilla-central_win7_test-xpcshell\\build\\xpcshell\\tests\\xpcom\\unit\\test_nsIProcess.js | test failed (with xpcshell return code: 0), see following log:
TEST-UNEXPECTED-FAIL | (xpcshell/head.js) | [Exception... \"Component returned failure code: 0x80004005 (NS_ERROR_FAILURE) [nsILocalFile.moveTo]\" nsresult: \"0x80004005 (NS_ERROR_FAILURE)\" location: \"JS frame :: c:/talos-slave/mozilla-central_win7_test-xpcshell/build/xpcshell/tests/xpcom/unit/test_nsIProcess.js :: rename_and_test :: line 159\" data: no]
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 10•14 years ago
|
||
http://tinderbox.mozilla.org/showlog.cgi?log=Mozilla-Aurora/1305062068.1305064280.24657.gz
Rev3 WINNT 6.1 mozilla-aurora opt test xpcshell on 2011/05/10 14:14:28
s: talos-r3-w7-025
TEST-UNEXPECTED-FAIL | c:\talos-slave\test\build\xpcshell\tests\xpcom\tests\unit\test_nsIProcess.js | test failed (with xpcshell return code: 0), see following log:
TEST-UNEXPECTED-FAIL | (xpcshell/head.js) | [Exception... "Component returned failure code: 0x80004005 (NS_ERROR_FAILURE) [nsILocalFile.moveTo]" nsresult: "0x80004005 (NS_ERROR_FAILURE)" location: "JS frame :: c:/talos-slave/test/build/xpcshell/tests/xpcom/tests/unit/test_nsIProcess.js :: rename_and_test :: line 159" data: no]
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 19•13 years ago
|
||
And if you then look at the rest of that slave's morning in https://build.mozilla.org/buildapi/recent/talos-r3-w7-035 you'll see that it went on to turn all the jobs that feel the need for a rm -rf build red, because "rm: cannot remove directory `build/xpcshell/tests/xpcom/tests/unit': Directory not empty" though I don't really know whether this is the cause of that fairly frequent (sometimes mentioned in bug 692715, other times not) thing, or just the first victim.
Blocks: 692715
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 21•13 years ago
|
||
(In reply to Benjamin Smedberg [:bsmedberg] from comment #1)
> This may be similar to problems we've beeing seeing where Windows holds on
> to conftest.exe for a few seconds after you've run it.
Seems to hold on for ever so slightly more than a few seconds - https://build.mozilla.org/buildapi/recent/talos-r3-w7-037 won't show it forever, but right now shows another 12 red runs after the comment 20 run, broken up by two green Talos runs because they don't try to rm -rf build and thus don't get "rm: cannot remove directory `build/xpcshell/tests/xpcom/tests/unit': Directory not empty"). I've never just let one go, but sometimes when I've sent whoever I could find off to help the slave, it has already healed, which I suspect means it did another xpcshell run, and my bet is that rather than "for a few seconds" the wedged state lasts "until this test runs again."
Whiteboard: [orange] → [orange][test which breaks the slave]
Comment 22•13 years ago
|
||
And indeed, 037 "burned" (a misnomer, since the tests run just fine, but colored and no bug suggested by tbpl means retrigger for most people, so practically they burn) another 31 jobs until I triggered 30 or 40 xpcshell jobs on a try push to trap it into having to do one, after which it has been green again.
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 29•13 years ago
|
||
I'm no longer willing to make releng run around manually cleaning up after this test, so we need to do something about it, whether or not that something is to stop running it.
Severity: normal → critical
Comment 30•13 years ago
|
||
Comment 31•13 years ago
|
||
Comment 32•13 years ago
|
||
Comment 33•13 years ago
|
||
Updated•13 years ago
|
Summary: [Win7] intermittent exception in test_nsIProcess.js : from nsILocalFile.moveTo in rename_and_test → [Win7] intermittent exception in test_nsIProcess.js : from nsILocalFile.moveTo in rename_and_test (causing the slave to fail with "rm: cannot remove directory `build/xpcshell/tests/xpcom/tests/unit': Directory not empty" till clobber)
Comment 34•13 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=9741214&tree=Firefox
https://tbpl.mozilla.org/php/getParsedLog.php?id=9740530&tree=Mozilla-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=9739343&tree=Mozilla-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=9741379&tree=Mozilla-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=9741129&tree=Mozilla-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=9741240&tree=Mozilla-Inbound
and a bunch of other tests on the same slave
Comment 35•13 years ago
|
||
Comment 36•13 years ago
|
||
https://tbpl.mozilla.org/?tree=Try&rev=7357e37293a7 is a naive s/moveTo/copyTo/, just copying and leaving the copied file, since it looked to me at a glance like the existing scheme was "create it, move it, move it back, abandon it without deleting it."
Failed despite the test passing, on every flavor of Windows, because the copied file couldn't be removed.
Comment 37•13 years ago
|
||
Oh, awesomesauce: the reason comment 34 looked like more things than talos-r3-w7-034 should have been able to run overnight is because my try run created four more slaves in this state.
Comment 38•13 years ago
|
||
ahah, you broke our slaves :)
Btw, couldn't this test move/copy the files to the profile folder that is automatically thrown away?
As a side note Neil pointed out on IRC he had issues (basically the same bug) moving unicode named files in the past.
Comment 39•13 years ago
|
||
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 41•13 years ago
|
||
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 43•13 years ago
|
||
So that's another slave broken
Comment 44•13 years ago
|
||
Comment 45•13 years ago
|
||
Comment 46•13 years ago
|
||
Comment 47•13 years ago
|
||
Comment 48•13 years ago
|
||
Comment 49•13 years ago
|
||
Comment 50•13 years ago
|
||
Comment 51•13 years ago
|
||
Comment 52•13 years ago
|
||
Comment 53•13 years ago
|
||
Comment 54•13 years ago
|
||
Comment 55•13 years ago
|
||
Comment 56•13 years ago
|
||
Comment 57•13 years ago
|
||
Comment 58•13 years ago
|
||
Comment 59•13 years ago
|
||
Comment 60•13 years ago
|
||
Comment 61•13 years ago
|
||
Comment 62•13 years ago
|
||
Comment 63•13 years ago
|
||
Comment 64•13 years ago
|
||
Comment 65•13 years ago
|
||
Comment 66•13 years ago
|
||
Comment 67•13 years ago
|
||
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 69•13 years ago
|
||
Comment 70•13 years ago
|
||
Comment 71•13 years ago
|
||
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 73•13 years ago
|
||
Comment 74•13 years ago
|
||
Comment 75•13 years ago
|
||
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 78•13 years ago
|
||
And that's enough of this. Disabled on Windows in https://hg.mozilla.org/integration/mozilla-inbound/rev/eed9e3c1ea13
Summary: [Win7] intermittent exception in test_nsIProcess.js : from nsILocalFile.moveTo in rename_and_test (causing the slave to fail with "rm: cannot remove directory `build/xpcshell/tests/xpcom/tests/unit': Directory not empty" till clobber) → clobber) [Win7][disabled on Windows] intermittent exception in test_nsIProcess.js : from nsILocalFile.moveTo in rename_and_test (causing the slave to fail with "rm: cannot remove directory `build/xpcshell/tests/xpcom/tests/unit': Directory not empty" till
Comment 79•13 years ago
|
||
Comment 80•13 years ago
|
||
Comment 81•13 years ago
|
||
Comment 82•13 years ago
|
||
Comment 83•13 years ago
|
||
Comment 84•13 years ago
|
||
Comment 85•13 years ago
|
||
Comment 86•13 years ago
|
||
Comment 87•13 years ago
|
||
Comment 88•13 years ago
|
||
Comment 89•13 years ago
|
||
(In reply to Phil Ringnalda (:philor) from comment #78)
> And that's enough of this. Disabled on Windows in
> https://hg.mozilla.org/integration/mozilla-inbound/rev/eed9e3c1ea13
Merged that to m-c:
https://hg.mozilla.org/mozilla-central/rev/eed9e3c1ea13
Comment 90•13 years ago
|
||
Comment 91•13 years ago
|
||
Comment 92•13 years ago
|
||
Comment 93•13 years ago
|
||
Comment 94•13 years ago
|
||
Comment 95•13 years ago
|
||
Comment 96•13 years ago
|
||
Comment 97•13 years ago
|
||
Comment 98•13 years ago
|
||
Comment 99•13 years ago
|
||
Comment 100•13 years ago
|
||
Comment 101•13 years ago
|
||
Comment 102•13 years ago
|
||
Comment 103•13 years ago
|
||
Comment 104•13 years ago
|
||
Comment 105•13 years ago
|
||
Comment 106•13 years ago
|
||
Comment 107•13 years ago
|
||
Comment hidden (Legacy TBPL/Treeherder Robot) |
Comment 109•13 years ago
|
||
Updated•12 years ago
|
Whiteboard: [orange][test which breaks the slave] → [orange][test which breaks the slave][test disabled on Windows]
Assignee | ||
Updated•12 years ago
|
Keywords: intermittent-failure
Assignee | ||
Updated•12 years ago
|
Whiteboard: [orange][test which breaks the slave][test disabled on Windows] → [test which breaks the slave][test disabled on Windows]
Updated•11 years ago
|
You need to log in
before you can comment on or make changes to this bug.
Description
•