Closed
Bug 785032
Opened 12 years ago
Closed 12 years ago
SD card issues with new Tegras ("INFO: attempting to create file /mnt/sdcard/writetest ... Push File Failed to Validate!")
Categories
(Release Engineering :: General, defect)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: emorley, Assigned: cmtalbert)
References
Details
(Keywords: intermittent-failure, Whiteboard: [red])
Attachments
(2 files, 1 obsolete file)
4.21 KB,
patch
|
cmtalbert
:
review+
|
Details | Diff | Splinter Review |
9.25 KB,
patch
|
jmaher
:
review+
|
Details | Diff | Splinter Review |
{ ========= Started Running verify.py failed (results: 2, elapsed: 34 secs) (at 2012-08-23 03:14:28.642449) ========= python /builds/sut_tools/verify.py in dir /builds/tegra-362/test/build (timeout 1200 secs) watching logfiles {} argv: ['python', '/builds/sut_tools/verify.py'] environment: HOME=/home/cltbld PATH=/tools/buildbot-0.8.4-pre-moz2/bin:/usr/local/bin:/usr/local/bin:/usr/local/bin:/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/sbin:/home/cltbld/bin PWD=/builds/tegra-362/test/build SUT_IP=10.250.51.202 SUT_NAME=tegra-362 using PTY: False DEBUG: updateSUT: Using tegra 'tegra-362' found in env variable INFO: Using tegra 'tegra-362' found in env variable INFO: attempting to ping tegra reconnecting socket INFO: updateSUT.py: We're running SUTAgentAndroid Version 1.11 INFO: Got expected SUTAgent version '1.11' INFO: attempting to create file /mnt/sdcard/writetest Push File Failed to Validate! program finished with exit code 1 } s: tegra-362 https://tbpl.mozilla.org/php/getParsedLog.php?id=14627585&tree=Firefox s: tegra-343 https://tbpl.mozilla.org/php/getParsedLog.php?id=14627866&tree=Firefox
Reporter | ||
Comment 1•12 years ago
|
||
slave: tegra-325 https://tbpl.mozilla.org/php/getParsedLog.php?id=14622443&tree=Mozilla-Inbound slave: tegra-330 https://tbpl.mozilla.org/php/getParsedLog.php?id=14623386&tree=Mozilla-Inbound slave: tegra-340 https://tbpl.mozilla.org/php/getParsedLog.php?id=14626225&tree=Mozilla-Inbound SD card related, but different error: slave: tegra-331 https://tbpl.mozilla.org/php/getParsedLog.php?id=14624360&tree=Mozilla-Inbound
Reporter | ||
Comment 2•12 years ago
|
||
Assorted errors: https://tbpl.mozilla.org/php/getParsedLog.php?id=14629835&tree=Firefox https://tbpl.mozilla.org/php/getParsedLog.php?id=14629837&tree=Firefox https://tbpl.mozilla.org/php/getParsedLog.php?id=14629837&tree=Firefox
Comment 3•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=14641782&tree=Mozilla-Inbound tegra-342
Comment 4•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=14708568&tree=Mozilla-Inbound s: tegra-347
Comment 5•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=14762559&tree=Firefox tegra-365 https://tbpl.mozilla.org/php/getParsedLog.php?id=14762549&tree=Firefox tegra-362
Comment 6•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=14780614&tree=Mozilla-Inbound s: tegra-119 (Not so sure about this one)
Comment 7•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=14785877&tree=Mozilla-Inbound s: tegra-293
Comment 8•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=14929591&tree=Mozilla-Inbound s: tegra-331
Comment 9•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=14957603&tree=Mozilla-Inbound s: tegra-342
Comment 10•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=14976726&tree=Mozilla-Aurora tegra-337
Comment 11•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=14999113&tree=Mozilla-Inbound s: tegra-325
Comment 12•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15006273&tree=Firefox s: tegra-338 https://tbpl.mozilla.org/php/getParsedLog.php?id=15006282&tree=Firefox s: tegra-330
Comment 13•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15017816&tree=Mozilla-Inbound tegra-320
Comment 14•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15022675&tree=Mozilla-Inbound tegra-346
Comment 15•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15022455&tree=Mozilla-Inbound tegra-365
Comment 16•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15022432&tree=Mozilla-Inbound tegra-345
Updated•12 years ago
|
Whiteboard: [orange]
Comment 17•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15028439&tree=Mozilla-Inbound tegra-341
Comment 18•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15031190&tree=Mozilla-Inbound tegra-355
Comment 19•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15033099&tree=Firefox tegra-342
Comment 20•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15033091&tree=Firefox tegra-322
Comment 21•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15061337&tree=Ionmonkey tegra-329
Comment 22•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15065096&tree=Mozilla-Inbound s: tegra-119
Comment 23•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15083945&tree=Mozilla-Inbound s: tegra-119
Comment 24•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15177826&tree=Firefox s: tegra-333
Comment 25•12 years ago
|
||
We've seen this problem on non-new tegras, so don't believe it's related to the way the new tegras are installed. Removing block on new tegra install.
No longer blocks: 767456
Comment 26•12 years ago
|
||
Don't you mean "on one and only one non-new tegra"? And "a problem other than this one"? This bug is INFO: attempting to create file /mnt/sdcard/writetest Push File Failed to Validate! and every one of the ones on 119, the only non-new tegra mentioned here, has instead been INFO: attempting to create file /mnt/sdcard/writetest removing file: /mnt/sdcard/writetest True reconnecting socket a thing which we've known about for some time, but have been calling bug 681861 rather than bothering to file the bug I really wanted to file, "Intermittent Android 'True'".
Blocks: 767456
Comment 27•12 years ago
|
||
Apparently I'm wrong, and old tegras do something else which isn't this, but is this, but they don't do it visibly, they do it invisibly, but their invisible not-it is this.
No longer blocks: 767456
Comment 28•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15273332&tree=Mozilla-Aurora tegra-339
Comment 29•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15275287&tree=Mozilla-Inbound tegra-346
Comment 30•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15281214&tree=Mozilla-Inbound tegra-358
Comment 31•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15285102&tree=Mozilla-Inbound s: tegra-311
Comment 32•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15307986&tree=Mozilla-Inbound tegra-343
Comment 33•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15344288&tree=Mozilla-Inbound tegra-340
Comment 34•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15375716&tree=Mozilla-Aurora tegra-348
Comment 35•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15386141&tree=Mozilla-Beta tegra-331
Comment 36•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15387243&tree=Mozilla-Inbound tegra-319
Comment 37•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15387727&tree=Mozilla-Inbound tegra-358
Comment 38•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15389088&tree=Mozilla-Inbound tegra-326
Comment 39•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15388503&tree=Mozilla-Inbound tegra-347
Comment 40•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15387948&tree=Mozilla-Beta tegra-343
Comment 41•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15392388&tree=Firefox tegra-340
Comment 42•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15388488&tree=Oak tegra-330
Comment 43•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15403532&tree=Mozilla-Beta tegra-308
Comment 44•12 years ago
|
||
Passing it this to ctalbert since he volunteered to look into this.
Assignee: nobody → ctalbert
Comment 45•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15419329&tree=Firefox tegra-324
Comment 46•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15433935&tree=Mozilla-Inbound tegra-329
Comment 47•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15535087&tree=Mozilla-Inbound tegra-351
Comment 48•12 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=15533423&tree=Mozilla-Beta tegra-336
Assignee | ||
Comment 49•12 years ago
|
||
So, given we don't know exactly what is happening here, the current hypothesis is that the agent is starting before the sdcard is mounted. That shouldn't happen. The agent waits on the android.intent.action.MEDIA_MOUNTED action to fire its startup receiver and start the agent.[1] However, I found that the agent would startup just fine with ONLY this code active when no sdcard was in the system (I disabled the code in the watcher). So, this patch adds a second check of the external storage mount state in all the places we start the agent -- both in the watcher and in the agent itself. It also outputs debugging information so we can see what is happening when we test this, and I ask very much that we test this before deploying it. If we get an r+ here I'd like to stage this an then we can do a final patch for deployment with less/better debug output.
Attachment #667280 -
Flags: review?(jmaher)
Comment 50•12 years ago
|
||
Comment on attachment 667280 [details] [diff] [review] Patch to help debug and possibly fix this Review of attachment 667280 [details] [diff] [review]: ----------------------------------------------------------------- just some nits ::: build/mobile/sutagent/android/watcher/WatcherService.java @@ +937,5 @@ > + if (Environment.MEDIA_MOUNTED.compareTo(state) != 0) { > + Log.i("SUTAgentWatcher", "SDcard not mounted, waiting another turn"); > + return; > + } else if (Environment.MEDIA_MOUNTED_READ_ONLY.equals(state)) { > + Log.e("SUTAgentWatcher", "SDCard mounted read only not starting agent now, try again in 60s"); nit: caps here should be SDcard @@ +945,3 @@ > if (bStartSUTAgent && !GetProcessInfo(sProgramName)) > { > + Log.i("SUTAgentWatcher", "Starting SUTAGent from watcher code"); NIT: caps here should be SUTAgent
Attachment #667280 -
Flags: review?(jmaher) → review+
Assignee | ||
Comment 51•12 years ago
|
||
Ok, this patch and the dependent bug patch in 797652 will allow you to test this. Callek can you stage this on some tegras and we can see if this works for us?
Attachment #667280 -
Attachment is obsolete: true
Attachment #667759 -
Flags: review+
Comment 52•12 years ago
|
||
(In reply to Clint Talbert ( :ctalbert ) from comment #51) > Created attachment 667759 [details] [diff] [review] > Addresses nits > > Ok, this patch and the dependent bug patch in 797652 will allow you to test > this. Callek can you stage this on some tegras and we can see if this works > for us? Sure, will just have you/joel give me the apk's and I'll get it up.
Assignee | ||
Comment 53•12 years ago
|
||
http://people.mozilla.org/~ctalbert/callek/
Assignee | ||
Comment 54•12 years ago
|
||
Note that this isn't the real fix we want. We have to deal with the fact that the agent needs to run on devices without an external sdcard and so we can't have it never come up if there is no sdcard. In that instance, we simply have to change the fallback behavior so that the agent never uses a fallback that is not world readable/writable. But this code in comment 53 should enable us to understand if this is the problem we're seeing on this bug and then we can fix it properly.
Assignee | ||
Comment 55•12 years ago
|
||
This is the final patch we should go with. I tried to fix the fallback method; however, I can't for the life of me get the agent to write anything into /data/local/tmp although the location appears to be (and is supposed to be) world writeable/readable. The fallback does work, but given that the getTestRoot code has never actually returned null (although that is what it was coded to do) it can cause the agent to crash, so I had to null guard pruneCommandLog against that. I think this should get us where we want to go, especially for the watcher, which is the more difficult one of the pair to update. We can continue working on the agent's fallback code in getTestRoot and see if we can actually get that to work properly.
Attachment #668302 -
Flags: review?(jmaher)
Comment 56•12 years ago
|
||
Comment on attachment 668302 [details] [diff] [review] Final patch Review of attachment 668302 [details] [diff] [review]: ----------------------------------------------------------------- this looks pretty good in general. I think we will be good in doing this, but I could see us changing the behaviour in the future as we hit other scenarios or find out more data. ::: build/mobile/sutagent/android/DoCommand.java @@ +1324,5 @@ > + tmpFile.createNewFile(); > + } catch (IOException e){ > + Log.i("SUTAgentAndroid", "Caught exception creating file in /data/local/tmp: " + e.getMessage()); > + } > + nit: extra whitespace here. @@ +1334,5 @@ > + return(Environment.getExternalStorageDirectory().getAbsolutePath()); > + } > + if (tmpFile.exists()) > + { > + Log.i("CLINT", "tmpfile exists"); extra debugging in here?
Attachment #668302 -
Flags: review?(jmaher) → review+
Reporter | ||
Comment 58•12 years ago
|
||
https://hg.mozilla.org/mozilla-central/rev/99889cc4affd
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
Updated•12 years ago
|
Keywords: intermittent-failure
Updated•12 years ago
|
Whiteboard: [orange][red] → [red]
Updated•11 years ago
|
Product: mozilla.org → Release Engineering
You need to log in
before you can comment on or make changes to this bug.
Description
•