Closed Bug 631542 Opened 15 years ago Closed 14 years ago

Verify output from hg pull & update, and retry clone if required

Categories

(Release Engineering :: General, defect, P3)

x86
Windows XP
defect

Tracking

(Not tracked)

VERIFIED FIXED

People

(Reporter: Pike, Assigned: coop)

References

()

Details

(Whiteboard: [l10n][automation])

Attachments

(2 files)

win32-slave05 took a handful of nightlies down with it, see http://l10n.mozilla.org/~axel/nightlies/?date=2011-02-04
Assignee: nobody → coop
Status: NEW → ASSIGNED
Priority: -- → P2
The root cause here was a hg clone command of m-c that timed out. Because of the way we do our clones (http://hg.mozilla.org/build/buildbotcustom/file/default/process/factory.py#l2623), the checkout dir was created but not (yet) fully populated. Subsequent attempts *should* fail when trying to pull+update, but it looks like hg isn't returning an error code, or we're ignoring it. We're certainly ignoring the error message in the logs (from buildbot): # hg -R mozilla-central pull abort: repository default not found! 0 files updated, 0 files merged, 0 files removed, 0 files unresolved # hg -R mozilla-central up -C abort: unknown revision 'default'! This step, get_enUS_src, is marked as passing. :( I've clobbered the build dir for this slave which is a quick workaround for this specific failure, but I think this warrants a code fix.
Summary: bad slave in windows central nightly repacks, win32-slave05 → Verify output from hg pull & update, and retry clone if required
Whiteboard: [l10n] → [l10n][automation]
If the try-only landing of bug 589885 continues to go well it would probably be pretty easy to start using hgtool.py for this part of the l10n repack. Additionally, bug 608005 would fix this.
The dupe bug 632418 added some extra detail. The tl;dr is (1) the shell command used to run hg commands ignores exit values and (2) sometimes hg returns success inappropriately anyway.
Status: ASSIGNED → NEW
Priority: P2 → P3
Another example: sh -c if [ -d mozilla-central ]; then hg -R mozilla-central pull ;hg -R mozilla-central up -C ;else hg clone http://hg.mozilla.org/mozilla-central mozilla-central ; fi && hg -R mozilla-central update -C -r default pulling from http://hg.mozilla.org/mozilla-central searching for changes abort: abandoned transaction found - run hg recover! 0 files updated, 0 files merged, 0 files removed, 0 files unresolved 0 files updated, 0 files merged, 0 files removed, 0 files unresolved program finished with exit code 0
Clobbered win32-slave13 on 1.9.2 and m-c for hg timeouts that lead to more busted l10n builds.
Attachment #527009 - Flags: review?(armenzg)
Comment on attachment 527009 [details] [diff] [review] Use hgtool.py for l10n clones Looks good. I like it that we now have a lock :)
Attachment #527009 - Flags: review?(armenzg) → review+
Needs a merge to production-0.8 and then a reconfig.
Flags: needs-reconfig?
(In reply to comment #10) > Needs a merge to production-0.8 and then a reconfig. jhford got this merged and deployed this afternoon. Looking forward to a greener l10n nightlies dashboard.
Status: NEW → RESOLVED
Closed: 14 years ago
Flags: needs-reconfig?
Resolution: --- → FIXED
Looks like Python on Windows doesn't like the Unix style paths: python /e/builds/moz2_slave/cen-w32-l10n-ntly/tools/buildfarm/utils/hgtool.py --rev=default http://hg.mozilla.org/mozilla-central mozilla-central in dir e:\builds\moz2_slave\cen-w32-l10n-ntly\build (timeout 1800 secs) watching logfiles {} argv: ['python', '/e/builds/moz2_slave/cen-w32-l10n-ntly/tools/buildfarm/utils/hgtool.py', u'--rev=default', 'http://hg.mozilla.org/mozilla-central', 'mozilla-central'] environment: .......... python: can't open file '/e/builds/moz2_slave/cen-w32-l10n-ntly/tools/buildfarm/utils/hgtool.py': [Errno 2] No such file or directory program finished with exit code 2
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
The same command works on Try for Windows, but it ends up as: python e:/builds/moz2_slave/try-w32/tools/buildfarm/utils/hgtool.py ... ...rather than: python /e/builds/moz2_slave/cen-w32-l10n-ntly/tools/buildfarm/utils/hgtool.py ... That happens despite having the toolsdir build prop set to |/e/builds/moz2_slave/*| in both cases. Must be something in the env I'm missing that's causing the drivepath to be set wrong for l10n. Will investigate.
This is how we handle it for MercurialBuildFactory.
Attachment #527937 - Flags: review?(rail)
Comment on attachment 527937 [details] [diff] [review] Set toolsdir differently on Windows Looks good to me. You may also want to retrigger win32 nightlies after landing, merging to production-0.8 and reconfiguring the masters.
Attachment #527937 - Flags: review?(rail) → review+
A Windows l10n build is cloning successfully right now.
Status: REOPENED → RESOLVED
Closed: 14 years ago14 years ago
Resolution: --- → FIXED
Most of the win32 l10n repacks are green today. Yay!
Status: RESOLVED → VERIFIED
Product: mozilla.org → Release Engineering
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: