Closed
Bug 730857
(tegra-079)
Opened 13 years ago
Closed 11 years ago
tegra-079 problem tracking
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task, P3)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: philor, Unassigned)
References
()
Details
(Whiteboard: [badslave?][buildduty])
https://tbpl.mozilla.org/php/getParsedLog.php?id=9653928&tree=Mozilla-Inbound
hg clone http://hg.mozilla.org/build/tools tools
in dir /builds/tegra-079/test/. (timeout 1320 secs)
watching logfiles {}
argv: ['hg', 'clone', 'http://hg.mozilla.org/build/tools', 'tools']
environment:
PATH=/opt/local/bin:/opt/local/sbin:/opt/local/Library/Frameworks/Python.framework/Versions/2.6/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/usr/X11/bin
PWD=/builds/tegra-079/test
SUT_IP=10.250.49.67
SUT_NAME=tegra-079
__CF_USER_TEXT_ENCODING=0x1F6:0:0
closing stdin
using PTY: False
'import site' failed; use -v for traceback
Traceback (most recent call last):
File "/opt/local/bin/hg", line 38, in <module>
mercurial.dispatch.run()
File "/opt/local/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/site-packages/mercurial/dispatch.py", line 16, in run
sys.exit(dispatch(sys.argv[1:]))
File "/opt/local/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/site-packages/mercurial/dispatch.py", line 21, in dispatch
u = uimod.ui()
File "/opt/local/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/site-packages/mercurial/ui.py", line 35, in __init__
for f in util.rcpath():
File "/opt/local/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/site-packages/mercurial/util.py", line 1346, in rcpath
_rcpath = os_rcpath()
File "/opt/local/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/site-packages/mercurial/util.py", line 1321, in os_rcpath
path.extend(user_rcpath())
File "/opt/local/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/site-packages/mercurial/posix.py", line 53, in user_rcpath
return [os.path.expanduser('~/.hgrc')]
File "/opt/local/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/posixpath.py", line 259, in expanduser
userhome = pwd.getpwuid(os.getuid()).pw_dir
KeyError: 'getpwuid(): uid not found: 502'
program finished with exit code 1
for the last 25 or so runs.
Comment 1•13 years ago
|
||
stop_cp has been run.
Updated•13 years ago
|
Assignee: nobody → server-ops-releng
Component: Release Engineering → Server Operations: RelEng
QA Contact: release → arich
Summary: Please disable tegra-079 and enroll it in tegra recovery → recover tegra-079
Comment 2•13 years ago
|
||
This was probably my fault, as I stopped it by mistake at about 2200 yesterday during
http://buildbot-master19.build.mtv1.mozilla.com:8201/builders/Android%20Tegra%20250%20try%20opt%20test%20robocop/builds/352
After starting it again it did one good build and has been red since. It might just need bear/hwine to check for leftovers from the interruption.
Comment 3•13 years ago
|
||
the environment is good - the tegra needs to be checked out to make sure the sdcard and/or it's network link are good
please continue with the "recovery"
Updated•13 years ago
|
Assignee: server-ops-releng → mlarrain
Updated•13 years ago
|
colo-trip: --- → mtv1
Comment 4•13 years ago
|
||
reimaged
Assignee: mlarrain → nobody
Component: Server Operations: RelEng → Release Engineering
QA Contact: arich → release
Updated•13 years ago
|
Priority: -- → P3
Comment 5•13 years ago
|
||
back in production
Alias: tegra-079
Status: NEW → RESOLVED
Closed: 13 years ago
Component: Release Engineering → Release Engineering: Machine Management
QA Contact: release → armenzg
Resolution: --- → FIXED
Summary: recover tegra-079 → tegra-079 problem tracking
Comment 6•13 years ago
|
||
Last Job 3 days, 10:00:33 ago
Updated•13 years ago
|
Status: REOPENED → RESOLVED
Closed: 13 years ago → 13 years ago
Resolution: --- → FIXED
Comment 7•12 years ago
|
||
Last job 3 days, 14:15:52 ago
error.flg [Remote Device Error: Unable to properly remove /mnt/sdcard/tests]
remotely reformatted sdcard
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 8•12 years ago
|
||
Had to manually reformat its sdcard today, pdu reboot, and clear the error flag. It's back in production though.
Status: REOPENED → RESOLVED
Closed: 13 years ago → 12 years ago
Resolution: --- → FIXED
Comment 9•12 years ago
|
||
error.flg [Remote Device Error: Unable to properly remove /mnt/sdcard/tests]
back in production
Comment 10•12 years ago
|
||
Last heard from Oct-10, clientproxy was dead on the foopy, started it.
Comment 11•12 years ago
|
||
Needs a new sdard
Updated•12 years ago
|
Status: REOPENED → RESOLVED
Closed: 12 years ago → 12 years ago
Resolution: --- → FIXED
Comment 12•12 years ago
|
||
No jobs taken on this device for >= 7 weeks
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 13•12 years ago
|
||
(mass change: filter on tegraCallek02reboot2013)
I just rebooted this device, hoping that many of the ones I'm doing tonight come back automatically. I'll check back in tomorrow to see if it did, if it does not I'll triage next step manually on a per-device basis.
---
Command I used (with a manual patch to the fabric script to allow this command)
(fabric)[jwood@dev-master01 fabric]$ python manage_foopies.py -j15 -f devices.json `for i in 021 032 036 039 046 048 061 064 066 067 071 074 079 081 082 083 084 088 093 104 106 108 115 116 118 129 152 154 164 168 169 174 179 182 184 187 189 200 207 217 223 228 234 248 255 264 270 277 285 290 294 295 297 298 300 302 304 305 306 307 308 309 310 311 312 314 315 316 319 320 321 322 323 324 325 326 328 329 330 331 332 333 335 336 337 338 339 340 341 342 343 345 346 347 348 349 350 354 355 356 358 359 360 361 362 363 364 365 367 368 369; do echo '-D' tegra-$i; done` reboot_tegra
The command does the reboot, one-at-a-time from the foopy the device is connected from. with one ssh connection per foopy
Comment 14•12 years ago
|
||
had to cycle clientproxy to bring this back
Updated•12 years ago
|
Status: REOPENED → RESOLVED
Closed: 12 years ago → 12 years ago
Resolution: --- → FIXED
Updated•12 years ago
|
Assignee | ||
Updated•11 years ago
|
Product: mozilla.org → Release Engineering
Reporter | ||
Comment 15•11 years ago
|
||
Hasn't taken a job for 7 days.
Status: RESOLVED → REOPENED
QA Contact: armenzg → bugspam.Callek
Resolution: FIXED → ---
Reporter | ||
Comment 16•11 years ago
|
||
Disabled in slavealloc to stop the stream of pointless reboots.
Comment 17•11 years ago
|
||
SD card replaced, tegra flashed and reimaged.
vle@vle-10516 ~ $ telnet tegra-079.tegra.releng.scl3.mozilla.com 20701
Trying 10.26.85.56...
Connected to tegra-079.tegra.releng.scl3.mozilla.com.
Escape character is '^]'.
$>^]
telnet> q
Reporter | ||
Comment 18•11 years ago
|
||
Reenabled.
Status: REOPENED → RESOLVED
Closed: 12 years ago → 11 years ago
Resolution: --- → FIXED
Updated•7 years ago
|
Product: Release Engineering → Infrastructure & Operations
Updated•5 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•