Closed Bug 938664 Opened 11 years ago Closed 9 years ago

Talos unit tests are broken

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: wlach, Assigned: jmaher)

Details

Attachments

(4 files, 1 obsolete file)

Log of talos unit tests failing 11 years ago William Lachance (:wlach) 3.98 KB, text/plain		Details
fix the test_ps.py for talos (1.0) 11 years ago Joel Maher ( :jmaher ) (UTC -8) 2.11 KB, patch	wlach : review-	Details \| Diff \| Splinter Review
remove test_ps.py from talos (2.0) 11 years ago Joel Maher ( :jmaher ) (UTC -8) 2.25 KB, patch	wlach : review+	Details \| Diff \| Splinter Review
Error log after patch was applied 11 years ago William Lachance (:wlach) 1.37 KB, text/plain		Details
fix test to do str(e) since we don't have e.msg anymore (1.0) 11 years ago Joel Maher ( :jmaher ) (UTC -8) 878 bytes, patch	wlach : review+	Details \| Diff \| Splinter Review

William Lachance (:wlach)

Reporter

Description

•

11 years ago

Attached file Log of talos unit tests failing — Details

If I install talos locally, then run INSTALL.py, then call runtests.py with the installed python, the unit tests hang here:

test_cli (test_PerfConfigurator.PerfConfiguratorUnitTest) ...  - outputName = /tmp/tmpXd2K5R.yaml
ok
test_errors (test_PerfConfigurator.PerfConfiguratorUnitTest) ...  - outputName = /tmp/tmpKFde6G.yaml

If I press CTRL-C, they continue on for a bit, before failing on an undefined import.

(see attached file for full log)

In addition to fixing tis issue, we should (1) make running unit tests a mandatory check before committing code and (2) run them continuously.

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 1

•

11 years ago

test_ps.py is the main culprit here- a lot of this code was removed when switching to mozprocess- let me update to use mozprocess calls.

Assignee: nobody → jmaher

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 2

•

11 years ago

Attached patch fix the test_ps.py for talos (1.0) (obsolete) — Details — Splinter Review

this really just removes the test case which tested functions that were replaced by mozprocess.

Attachment #8333816 - Flags: review?(wlachance)

William Lachance (:wlach)

Reporter

Comment 3

•

11 years ago

Comment on attachment 8333816 [details] [diff] [review]
fix the test_ps.py for talos (1.0)

Review of attachment 8333816 [details] [diff] [review]:
-----------------------------------------------------------------

At this point it looks more like you're just retesting mozprocess's ps function, which I assume we already do inside mozbase itself. I would suggest just killing test_ps.py altogether.

Attachment #8333816 - Flags: review?(wlachance) → review-

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 4

•

11 years ago

Attached patch remove test_ps.py from talos (2.0) — Details — Splinter Review

ok, removing it is easy.

Attachment #8333816 - Attachment is obsolete: true

Attachment #8333958 - Flags: review?(wlachance)

William Lachance (:wlach)

Reporter

Comment 5

•

11 years ago

Comment on attachment 8333958 [details] [diff] [review]
remove test_ps.py from talos (2.0)

Great. Please commit.

Do the unit tests now pass for you with no issue? As mentioned above, I saw some hangs before with the PerfConfigurator tests...

Attachment #8333958 - Flags: review?(wlachance) → review+

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 6

•

11 years ago

it takes about 30 seconds to complete that test, but it passes as ok

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 7

•

11 years ago

https://hg.mozilla.org/build/talos/rev/ad927d1aa867

Status: NEW → RESOLVED

Closed: 11 years ago

Resolution: --- → FIXED

William Lachance (:wlach)

Reporter

Comment 8

•

11 years ago

I am still seeing errors.

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

William Lachance (:wlach)

Reporter

Comment 9

•

11 years ago

Attached file Error log after patch was applied — Details

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 10

•

11 years ago

Attached patch fix test to do str(e) since we don't have e.msg anymore (1.0) — Details — Splinter Review

fix an error that shows up when running in the virtualenv.  There is still an issue with test_PerfConfigurator.py as we attempt to do a remote perf configurator on device 0.0.0.0 and this yields an exception (which we are testing for).  On some machines this doesn't timeout and causes a hang/error.

Attachment #8335366 - Flags: review?(wlachance)

William Lachance (:wlach)

Reporter

Comment 11

•

11 years ago

Comment on attachment 8335366 [details] [diff] [review]
fix test to do str(e) since we don't have e.msg anymore (1.0)

Hmm, I didn't think this was necessary, but apparently it is:

>>> try:
...     raise Exception("this is a string")
... except Exception, e:
...     print "this is" in e
...     print "this is not" in e
... 
False
False
>>> try:
...     raise Exception("this is a string")
... except Exception, e:
...     print "this is" in str(e)
...     print "this is not" in str(e)
... 
True
False

Attachment #8335366 - Flags: review?(wlachance) → review+

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 12

•

11 years ago

https://hg.mozilla.org/build/talos/rev/610eda814385

William Lachance (:wlach)

Reporter

Comment 13

•

11 years ago

(In reply to Joel Maher (:jmaher) from comment #10)
> Created attachment 8335366 [details] [diff] [review]
> fix test to do str(e) since we don't have e.msg anymore (1.0)
> 
> fix an error that shows up when running in the virtualenv.  There is still
> an issue with test_PerfConfigurator.py as we attempt to do a remote perf
> configurator on device 0.0.0.0 and this yields an exception (which we are
> testing for).  On some machines this doesn't timeout and causes a hang/error.

Ok, did some investigation. I think it's not so much that it isn't timing out, but that the timeout takes FOREVER, since we retry a whole bunch of times trying to get the device root on initialization (5 times, over an increasing number of seconds). 

There's also a problem with test_install where the virtualenv command returns non-zero because virtualenv_support isn't installed. I think this is a new requirement in virtualenv, and since we're just downloading the copy of virtualenv from github master, we're seeing this problem.

I would propose that as an interim measure we disable these tests, then file followup bugs to re-enable them at some point in the future. Broken tests are worse than no tests.

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 14

•

11 years ago

there is no easy way to reduce the retries or timeouts- unless we instrument the code in depth, I am fine removing this one specific scenario from test_PerfConfigurator.py

Julien Pagès (:parkouss)

Comment 15

•

9 years ago

Is this bug still open ? I would like to work on that if it is the case.

(In reply to William Lachance (:wlach) from comment #0)

> In addition to fixing tis issue, we should (1) make running unit tests a
> mandatory check before committing code and (2) run them continuously.

How can we achieve this with Talos ?

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 16

•

9 years ago

this bug is still open.  We should remove the tests for remote and ensure everything else is running smoothly.

A couple of thoughts:
1) run tests on github with tox, coveralls, etc.
2) either move talos to github, or mirror it there where a new checkin on hg goes to github and runs tests
3) if we moved talos into mozilla-central, what type of issues might this cause?

Personally I would lean more towards github so we can submit a PR and get unittests, etc. run on it right away before even reviewing it.  There are a lot of other good reasons to move talos into the tree though, so maybe mirror to github until we solve the in-tree move, etc.

William Lachance (:wlach)

Reporter

Comment 17

•

9 years ago

I agree 100%, if we can get a github<->hg mirror going I think a github repo is the way to go. That way we can get all this going, and the only thing we'd have to change would be the location of the hg repo in talos.json.

IIRC Hal was the person to talk to about these things. Hal, how difficult would this be to set up? We already have a build-talos repository on github which mirrors from http://hg.mozilla.org/build/talos -- how hard would it be to create a *new* hg repository which mirrors from a new github repository? (we'd probably want to keep http://hg.mozilla.org/build/talos around as-is for a while for backwards compatibility with older versions of the tree, etc)

Flags: needinfo?(hwine)

William Lachance (:wlach)

Reporter

Comment 18

•

9 years ago

Looks like releng is doing something similar to this proposal with some of their repositories: http://coop.deadsquid.com/2015/03/better-releng-patch-contribution-workflow/

Hal Wine [:hwine] use NI!

Comment 19

•

9 years ago

(In reply to William Lachance (:wlach) from comment #18)
> Looks like releng is doing something similar to this proposal with some of
> their repositories:
> http://coop.deadsquid.com/2015/03/better-releng-patch-contribution-workflow/

So, releng is going through some of that pain because they started on hg. Our criteria of where the RoR depends on how the code is deployed, and where the code is deployed, in the CI environment. Code that is auto deployed to build systems must have RoR on *.mozilla.org

If this code is only for test, what's the value of an hg.mozilla.org repo for the new code?

So, yes, it can be mirrored from github to hg.mozilla.org, but I would recommend that only if you have a real need. If so, file a request per https://wiki.mozilla.org/ReleaseEngineering/RepositoryCreationRequest#mirroring

Flags: needinfo?(hwine)

Joel Maher ( :jmaher ) (UTC -8)

Assignee

Comment 20

•

9 years ago

In production, we use hg.mozilla.org/build/talos to run all talos tests.  We clone from that repository for all talos runs.

The goal is to run tests as we do on github.  For the simpler solution, we could make github the canonical source, then mirror to the existing hg repo.

that seems like the best approach overall.

William Lachance (:wlach)

Reporter

Comment 21

•

9 years ago

(In reply to Hal Wine [:hwine] (use needinfo) from comment #19)
> (In reply to William Lachance (:wlach) from comment #18

> If this code is only for test, what's the value of an hg.mozilla.org repo
> for the new code?

The only value is that we don't have to change the build automation, which assumes that talos will be cloned from an hg repository. It's not worth changing that if we're planning to move the whole thing into the tree at some point.

Julien Pagès (:parkouss)

Comment 22

•

9 years ago

The unit tests are passing now. But we should find a way to automate the unit tests runs.

Julien Pagès (:parkouss)

Comment 23

•

9 years ago

I'm closing this, because running unit tests with:

python setup.py test

run fine now. Please reopen the bug if I missed something.

Status: REOPENED → RESOLVED

Closed: 11 years ago → 9 years ago

Resolution: --- → FIXED

You need to log in before you can comment on or make changes to this bug.