1097278 - B2G mochitest-3 runs are occasional marked as failing with no obvious indication as to why

Reporter

Description

•

10 years ago

https://treeherder.mozilla.org/ui/logviewer.html#?job_id=814772&repo=b2g-inbound

Either our log parser is missing something or we're not outputting enough information in the logs to properly flag these.

Jonathan Griffin (:jgriffin)

Comment 1

•

10 years ago

For some reason, the suite is aborting early, about half way through, and we don't report the test summary (number of passed/failed tests).  It isn't at all clear why this is happening.  It could be a hard crash that isn't leaving any crash report.  Any other theories, Ahal?

Flags: needinfo?(ahalberstadt)

Andrew Halberstadt [:ahal]

Comment 2

•

10 years ago

Sigh, this could be bug 1093296 (which was filed as an offshoot from bug 965705).

Flags: needinfo?(ahalberstadt)

Jonathan Griffin (:jgriffin)

Comment 3

•

10 years ago

It could be, although these runs aren't timing out, and it looks like the parent process would have to terminate early to cause the suite to end this way.

We could update the mozharness script to at least flag this condition more explicitly, although that won't help it be resolved.

Jonathan Griffin (:jgriffin)

Comment 4

•

10 years ago

The last test that ran in this failing case was /tests/dom/canvas/test/webgl-mochitest/test_hidden_alpha.html; if that's consistent with these cases, we could consider disabling that test.  Meanwhile, I'll make a mozharness patch to flag this when it happens, which should at least make this more sheriffable.

Jonathan Griffin (:jgriffin)

Comment 5

•

10 years ago

Hmm, I'm actually not sure how best to handle this in the world of structured logging.  The problem is that the JS harness is dying without executing http://dxr.mozilla.org/mozilla-central/source/testing/mochitest/tests/SimpleTest/TestRunner.js#634, and we don't have a good way to detect that sans a crash report.  The 'SUITE-END' event we check for is emitted by the Python harness, not the JS side.  We could update http://hg.mozilla.org/build/mozharness/file/d8d1a4283056/mozharness/mozilla/structuredlog.py#l58 to look for this explicitly...is there a better way?

Flags: needinfo?(cmanchester)

Chris Manchester (limited bugmail, email directly)

Comment 6

•

10 years ago

Looks like bug 1048855 and bug 1043428. The plumbing for it landed last week, but we aren't using structuredlog.py for mochitests yet.

If the goal is to print a message that's sherrifable in the short term I would add something to http://hg.mozilla.org/build/mozharness/file/d8d1a4283056/mozharness/mozilla/testing/unittest.py#l167 that logs something about no summary found or no checks run at the error level, and I’ll put something compatible in structuredlog.py before turning it on for mochitests. This actually highlights that we'll have to emit suite_end or some other final token from the js side that we check for to detect failures like these before doing that.

Flags: needinfo?(cmanchester)

Comment hidden (Legacy TBPL/Treeherder Robot)

submit_timestamp: 2014-11-12T09:16:42
log: https://treeherder.mozilla.org/ui/logviewer.html#?repo=b2g-inbound&job_id=823933
repository: b2g-inbound
who: rvandermeulen[at]mozilla[dot]com
machine: tst-linux64-spot-085
buildname: b2g_emulator_vm b2g-inbound opt test mochitest-3
revision: edb8a02ed7b9

Return code: 1

Comment hidden (Legacy TBPL/Treeherder Robot)

submit_timestamp: 2014-11-12T10:34:56
log: https://treeherder.mozilla.org/ui/logviewer.html#?repo=mozilla-inbound&job_id=3819243
repository: mozilla-inbound
who: rvandermeulen[at]mozilla[dot]com
machine: tst-linux64-spot-276
buildname: b2g_emulator_vm mozilla-inbound opt test mochitest-3
revision: 36147782c86e

Return code: 1

Jonathan Griffin (:jgriffin)

Comment 9

•

10 years ago

Attached patch Log a warning when no test summary is found, — Details — Splinter Review

Attachment #8521780 - Flags: review?(cmanchester)

Jonathan Griffin (:jgriffin)

Updated

•

10 years ago

Assignee: nobody → jgriffin

Chris Manchester (limited bugmail, email directly)

Comment 10

•

10 years ago

Comment on attachment 8521780 [details] [diff] [review]
Log a warning when no test summary is found,

Review of attachment 8521780 [details] [diff] [review]:
-----------------------------------------------------------------

::: mozharness/mozilla/testing/unittest.py
@@ +171,5 @@
> +
> +        # Account for the possibility that no test summary was output.
> +        if self.pass_count <= 0 and self.fail_count <= 0 and \
> +            (self.known_fail_count is None or self.known_fail_count <= 0):
> +            self.warning('No tests run or test summary not found')

Does this need to be "error" to show up in treeherder?

Attachment #8521780 - Flags: review?(cmanchester) → review+

Jonathan Griffin (:jgriffin)

Comment 11

•

10 years ago

(In reply to Chris Manchester [:chmanchester] from comment #10)
> 
> Does this need to be "error" to show up in treeherder?

Hmm, I think so.  Updated and pushed: https://hg.mozilla.org/build/mozharness/rev/bea2df1c0276

Comment hidden (Legacy TBPL/Treeherder Robot)

submit_timestamp: 2014-11-13T09:23:59
log: https://treeherder.mozilla.org/ui/logviewer.html#?repo=mozilla-inbound&job_id=3861890
repository: mozilla-inbound
who: rvandermeulen[at]mozilla[dot]com
machine: tst-linux64-spot-919
buildname: b2g_emulator_vm mozilla-inbound opt test mochitest-3
revision: 44d6420a3cc7

Return code: 1

Comment hidden (Legacy TBPL/Treeherder Robot)

submit_timestamp: 2014-11-13T22:36:05
log: https://treeherder.mozilla.org/ui/logviewer.html#?repo=mozilla-inbound&job_id=3889851
repository: mozilla-inbound
who: philringnalda[at]gmail[dot]com
machine: tst-linux64-spot-837
buildname: b2g_emulator_vm mozilla-inbound opt test mochitest-3
revision: cc92e864c679

Comment hidden (Legacy TBPL/Treeherder Robot)

submit_timestamp: 2014-11-13T22:37:05
log: https://treeherder.mozilla.org/ui/logviewer.html#?repo=b2g-inbound&job_id=839650
repository: b2g-inbound
who: philringnalda[at]gmail[dot]com
machine: tst-linux64-spot-365
buildname: b2g_emulator_vm b2g-inbound opt test mochitest-3
revision: 69bda3e20579

No tests run or test summary not found
Return code: 1

Jonathan Griffin (:jgriffin)

Comment 15

•

10 years ago

(In reply to TBPL Robot from comment #14)
> submit_timestamp: 2014-11-13T22:37:05
> log:
> https://treeherder.mozilla.org/ui/logviewer.html#?repo=b2g-
> inbound&job_id=839650
> repository: b2g-inbound
> who: philringnalda[at]gmail[dot]com
> machine: tst-linux64-spot-365
> buildname: b2g_emulator_vm b2g-inbound opt test mochitest-3
> revision: 69bda3e20579
> 
> No tests run or test summary not found
> Return code: 1

So the mozharness patch is in place and is working.  That doesn't, of course, fix the underlying problem, which at best guess is a B2G crash that produces no crash report.  Do you want to continue tracking that here?

Flags: needinfo?(ryanvm)

Jonathan Griffin (:jgriffin)

Updated

•

10 years ago

Assignee: jgriffin → nobody

Ryan VanderMeulen [:RyanVM]

Reporter

Comment 16

•

10 years ago

If we close this bug out, I think we should rename it to something along the lines of what ended up landing here. As-filed, I think it makes sense to just leave it open for further starring otherwise. That said, I'm not sure the harness is really leaving us in a satisfactory place yet? I'm not sure what information we're able to give an interested developer for debugging this that we couldn't before.

Flags: needinfo?(ryanvm)

Jonathan Griffin (:jgriffin)

Comment 17

•

10 years ago

Unfortunately, it isn't clear what additional information the harness could provide.  We rely on crash reports being written to the profile to handle crashes, but for some reason that apparently isn't happening here.

We could possibly make the harness smarter about knowing when the JS side terminated prematurely, so at least we could get the test name in the error message.

Chris Manchester (limited bugmail, email directly)

Updated

•

10 years ago

Blocks: 1071227

Andrew Halberstadt [:ahal]

Comment 18

•

7 years ago

Mass resolving of B2G mochitest bugs.

Status: NEW → RESOLVED

Closed: 7 years ago

Resolution: --- → INVALID