858621 - Run jit-tests from test package

Reporter

Description

•

11 years ago

We should run the jit-tests from the test package instead of during "make check".

Dan Minor [:dminor]

Assignee

Updated

•

11 years ago

Depends on: 903620

Nobody; OK to take it and work on it

Updated

•

11 years ago

Product: mozilla.org → Release Engineering

Dan Minor [:dminor]

Assignee

Updated

•

11 years ago

Depends on: 906525

Dan Minor [:dminor]

Assignee

Updated

•

11 years ago

Assignee: nobody → dminor

Dan Minor [:dminor]

Assignee

Updated

•

11 years ago

Depends on: 908906

Dan Minor [:dminor]

Assignee

Comment 1

•

11 years ago

I have completed the changes to the desktop scripts and have started working on the remote versions.

Status: NEW → ASSIGNED

Dan Minor [:dminor]

Assignee

Updated

•

11 years ago

Depends on: 912004

Dan Minor [:dminor]

Assignee

Comment 2

•

11 years ago

Attached patch Patch to add jit-tests support to mozharness scripts and configs. (obsolete) — Details — Splinter Review

This is working locally and I have an Ash run here: https://tbpl.mozilla.org/?tree=Ash&rev=6489d9abf96c.

Attachment #799689 - Flags: review?(aki)

Aki Sasaki (not active)

Comment 3

•

11 years ago

Comment on attachment 799689 [details] [diff] [review]
Patch to add jit-tests support to mozharness scripts and configs.

I'm guessing your test run on ash shows that this doesn't break anything else, but doesn't actually run the jit-tests, correct?
Should be good enough to land.


>+    def query_jsshell_url(self):
>+        if self.jsshell_url:
>+            return self.jsshell_url
>+        if not self.installer_url:
>+            self.fatal("Can't figure out jsshell without an installer_url!")
>+
>+        last_slash = self.installer_url.rfind('/')
>+        base_url = self.installer_url[:last_slash]
>+
>+        for suffix in INSTALLER_SUFFIXES:
>+            if self.installer_url.endswith(suffix):
>+                no_suffix = self.installer_url[:-len(suffix)]
>+                last_dot = no_suffix.rfind('.')
>+                platform = no_suffix[last_dot + 1:]
>+
>+                self.jsshell_url = base_url + '/jsshell-' + platform + '.zip'
>+                return self.jsshell_url
>+        else:
>+            self.fatal("Can't figure out symbols_url from installer_url %s!" % self.installer_url)

I think the fatal() message is wrong.
Can you add a docstring to this method explaining what this is for?

>@@ -352,7 +369,10 @@ class PandaTest(TestingMixin, MercurialScript, VirtualenvMixin, MozpoolMixin, Bu
>         #applies to mochitest, reftest, jsreftest
>         # TestingMixin._download_and_extract_symbols() will set
>         # self.symbols_path when downloading/extracting.
>-        hostnumber = int(self.mozpool_device.split('-')[1])
>+        hostnumber = 0
>+        mozpool_device_list = self.mozpool_device.split('-')
>+        if len(mozpool_device_list) == 2:
>+            hostnumber = int(mozpool_device_list[1])

I'm guessing you hit an error that led to this change... what happened?


I only have comment/log message nits, so I'm r+ing.
However, I'm flagging :kmoir for feedback/FYI, especially on the panda changes.

Attachment #799689 - Flags: feedback?(kmoir)

Dan Minor [:dminor]

Assignee

Comment 4

•

11 years ago

Thanks for the review. Yes, the Ash run just shows that this should avoid breakage when it lands.

The hostnumber change helps when running from my local system. The mozpool_device in production has a number in its name, but I use the ip address of my panda when testing locally, which breaks this part of the code. I can leave this change out, but it will help me (and possibly others) when testing locally in the future.

Aki Sasaki (not active)

Comment 5

•

11 years ago

Comment on attachment 799689 [details] [diff] [review]
Patch to add jit-tests support to mozharness scripts and configs.

Hm, I think I failed to set the flag.

Attachment #799689 - Flags: review?(aki) → review+

Kim Moir [:kmoir] ET

Comment 6

•

11 years ago

Comment on attachment 799689 [details] [diff] [review]
Patch to add jit-tests support to mozharness scripts and configs.

lgtm

Attachment #799689 - Flags: feedback?(kmoir) → feedback+

Dan Minor [:dminor]

Assignee

Comment 7

•

11 years ago

Attached patch Patch to add jit-tests support to mozharness scripts and configs. — Details — Splinter Review

With review comments addressed.

Attachment #799689 - Attachment is obsolete: true

Dan Minor [:dminor]

Assignee

Updated

•

11 years ago

Blocks: 912997

Aki Sasaki (not active)

Comment 8

•

11 years ago

Comment on attachment 800148 [details] [diff] [review]
Patch to add jit-tests support to mozharness scripts and configs.

Interdiff looks good.

Attachment #800148 - Flags: review+

Aki Sasaki (not active)

Comment 9

•

11 years ago

Comment on attachment 800148 [details] [diff] [review]
Patch to add jit-tests support to mozharness scripts and configs.

https://hg.mozilla.org/build/mozharness/rev/bd35e3e2b331

Attachment #800148 - Flags: checked-in+

bhearsum@mozilla.com (:bhearsum)

Comment 10

•

11 years ago

in production

Aki Sasaki (not active)

Comment 11

•

11 years ago

dminor: I'm getting this warning from pyflakes:

mozharness/mozilla/testing/testbase.py:96: redefinition of unused 'platform' from line 10

Looks like we should use a different name, since platform is an imported python built-in module ?  Do you have a preference? pf ?

Flags: needinfo?(dminor)

Dan Minor [:dminor]

Assignee

Comment 12

•

11 years ago

We use 'pltfrm' elsewhere in the same file, so maybe that. 'pf' would be fine too.

Flags: needinfo?(dminor)

Christian Holler (:decoder)

Updated

•

11 years ago

Depends on: 919145

Dan Minor [:dminor]

Assignee

Updated

•

11 years ago

Depends on: 929472

Dan Minor [:dminor]

Assignee

Comment 13

•

11 years ago

Now that Bug 929569 and Bug 929125 are resolved, it Looks like the latest runs on Cedar are down to intermittent failures.

I think the next steps are either to disable the intermittent tests, or to enable jit-tests on try so we can debug them.
Terrence, any preference?

Depends on: 929569, 929125, 916742, 918934, 917309

Flags: needinfo?(terrence)

Terrence Cole [:terrence]

Comment 14

•

11 years ago

Lets enable them hidden on Try so that these failing tests have non-zero visibility. This should be good incentive for us to take a harder look at the recent spate of intermittent failures we've been experiencing.

Flags: needinfo?(terrence)

Dan Minor [:dminor]

Assignee

Updated

•

11 years ago

Depends on: 931874

Dan Minor [:dminor]

Assignee

Updated

•

11 years ago

Depends on: 939274

bhearsum@mozilla.com (:bhearsum)

Updated

•

11 years ago

Blocks: 927061

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Depends on: 959155

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Depends on: 959156

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Depends on: 959158

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Depends on: 959164

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Depends on: 961212

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Depends on: 967467

Mike Hommey [:glandium]

Comment 15

•

10 years ago

jit-tests are currently running during make check. They already have those various issues that are "blocking" this bug. Why are we waiting? The net benefit is that this would free build slaves for other jobs, allowing us to use less build slaves, which in turn would make builds faster because the more slaves we have the slower the builds are (see http://glandium.org/blog/?p=3170)

So can we please deploy this now and stop doing jit-tests during make check?

:Ms2ger (he/him; ⌚ UTC+1/+2)

Comment 16

•

10 years ago

No? https://tbpl.mozilla.org/?tree=Cedar&jobname=jit shows they're not green.

Dan Minor [:dminor]

Assignee

Comment 17

•

10 years ago

(In reply to Mike Hommey [:glandium] from comment #15)
> jit-tests are currently running during make check. They already have those
> various issues that are "blocking" this bug. Why are we waiting? The net
> benefit is that this would free build slaves for other jobs, allowing us to
> use less build slaves, which in turn would make builds faster because the
> more slaves we have the slower the builds are (see
> http://glandium.org/blog/?p=3170)
> 
> So can we please deploy this now and stop doing jit-tests during make check?

I've been actively working on fixing a few consistent failures over the past couple of weeks and I have one left to fix for the desktop builds at which point we can remove the jit-tests from make check. If I don't get anywhere today, I'm sure we can disable it and move forward.

Since the android jit-tests are not currently part of 'make check' they can be greened up as a separate activity.

(not currently active) Ted Mielczarek

Reporter

Comment 18

•

10 years ago

Dan has hit (and filed and fixed) a lot of issues running the jit-tests on the test slaves vs. the build slaves. Unsurprisingly, these tests have some issues about the environment they're running in.

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Depends on: 968797

Gregory Szorc [:gps]

Comment 19

•

10 years ago

If it's possible to prioritize, I believe Windows is more important than other platforms.

Windows EC2 instances cost more than Linux.

JIT tests run slower on Windows than *NIX platforms because of new process overhead.

In EC2, it's likely costing 3-5x more to run JIT tests on Windows than on Linux.

Mike Hommey [:glandium]

Comment 20

•

10 years ago

We're not running anything windows on ec2.

Phil Ringnalda (:philor)

Comment 21

•

10 years ago

Well, we are, but it's only on the Date twig.

Chris AtLee [:catlee]

Comment 22

•

10 years ago

we're only running windows tests on ec2 on date at this point

Phil Ringnalda (:philor)

Updated

•

10 years ago

Depends on: 969986

Phil Ringnalda (:philor)

Updated

•

10 years ago

Depends on: 969988

Phil Ringnalda (:philor)

Updated

•

10 years ago

Depends on: 969991

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Depends on: 973900

(dormant account)

Comment 23

•

10 years ago

(In reply to Dan Minor [:dminor] from comment #17)
> (In reply to Mike Hommey [:glandium] from comment #15)
> > jit-tests are currently running during make check. They already have those
> > various issues that are "blocking" this bug. Why are we waiting? The net
> > benefit is that this would free build slaves for other jobs, allowing us to
> > use less build slaves, which in turn would make builds faster because the
> > more slaves we have the slower the builds are (see
> > http://glandium.org/blog/?p=3170)
> > 
> > So can we please deploy this now and stop doing jit-tests during make check?
> 
> I've been actively working on fixing a few consistent failures over the past
> couple of weeks and I have one left to fix for the desktop builds at which
> point we can remove the jit-tests from make check. If I don't get anywhere
> today, I'm sure we can disable it and move forward.
> 
> Since the android jit-tests are not currently part of 'make check' they can
> be greened up as a separate activity.

'Today' was a few weeks ago. What's the eta now?

Dan Minor [:dminor]

Assignee

Comment 24

•

10 years ago

(In reply to Taras Glek (:taras) from comment #23)
> 'Today' was a few weeks ago. What's the eta now?

That bug was fixed, but as usual, a new test machine only failure has appeared, see bug 906525.
I filed bug 973900 to schedule these. There are some blockers there as well.

(dormant account)

Comment 25

•

10 years ago

(In reply to Dan Minor [:dminor] from comment #24)
> (In reply to Taras Glek (:taras) from comment #23)
> > 'Today' was a few weeks ago. What's the eta now?
> 
> That bug was fixed, but as usual, a new test machine only failure has
> appeared, see bug 906525.
> I filed bug 973900 to schedule these. There are some blockers there as well.

I'd like to see an ETA here.

Mark Côté [:mcote]

Comment 26

•

10 years ago

We had to pull dminor off this temporarily because of emergency b2g certification work. Most of this is now in releng's court, aside from bug 906525, which shu is looking at right now. dminor is needinfoing releng where required. I don't think we can give an ETA right now due to this being worked on by several groups, but I'll monitor it.

Dan Minor [:dminor]

Assignee

Comment 27

•

10 years ago

Filed bug 988522 to disable the failing tests until bug 906525 is resolved, and kmoir is looking at 973900 to do the scheduling.

We'll probably want to run these in parallel with the make check for a few days to make sure no serious problems emerge before removing this from make check, so we can probably get that is wrapped up in two weeks.

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Depends on: 988532

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

No longer depends on: 906525

Ed Morley [:emorley]

Comment 28

•

10 years ago

Fair few new intermittents:
https://tbpl.mozilla.org/php/getParsedLog.php?id=37141602&tree=Mozilla-Central
https://tbpl.mozilla.org/php/getParsedLog.php?id=37143884&tree=Mozilla-Central
https://tbpl.mozilla.org/php/getParsedLog.php?id=37145413&tree=Mozilla-Inbound

Ed Morley [:emorley]

Comment 29

•

10 years ago

Another intermittent:
https://tbpl.mozilla.org/php/getParsedLog.php?id=37168546&tree=Mozilla-Inbound

Phil Ringnalda (:philor)

Comment 30

•

10 years ago

All four of those intermittents are timeouts on Windows; far as I know, everyone but me has been ignoring timeouts in jittests on Windows while run on the build since I started the chain at bug 970063, and I don't see any reason to believe these are any different than those. Other than the obvious difference, you haven't yet learned to ignore them unless and until I decide to file them ;)

Ed Morley [:emorley]

Comment 31

•

10 years ago

Until now I've been ignoring check-test failures on the most part; I was attempting to 'do the right thing' here ;-)

Dan Minor [:dminor]

Assignee

Updated

•

10 years ago

Status: ASSIGNED → RESOLVED

Closed: 10 years ago

Resolution: --- → FIXED

Nobody; OK to take it and work on it

Updated

•

6 years ago

Component: General Automation → General

Patch to add jit-tests support to mozharness scripts and configs. 11 years ago Dan Minor [:dminor] 16.00 KB, patch	mozilla : review+ kmoir : feedback+	Details \| Diff \| Splinter Review
Patch to add jit-tests support to mozharness scripts and configs. 11 years ago Dan Minor [:dminor] 16.13 KB, patch	mozilla : review+ mozilla : checked-in+	Details \| Diff \| Splinter Review