Closed Bug 1139911 Opened 9 years ago Closed 9 years ago

make-check test-failures not turning builds orange

Categories

(Release Engineering :: Applications: MozharnessCore, defect)

defect
Not set
blocker

Tracking

(firefox44 fixed)

RESOLVED FIXED
Tracking Status
firefox44 --- fixed

People

(Reporter: ahal, Assigned: ahal)

References

Details

Attachments

(1 file)

I pushed a change to try that broke a mozbase test, but it only turned about half of the builds orange, though the failure shows up in the logs for all of the builds. I suspect we're doing the right thing in certain build scripts, but not others.

https://treeherder.mozilla.org/#/jobs?repo=try&revision=60a457e81c05&filter-searchStr=build

Note that all the green builds have the test failure in the logs. I guessed this was a mozharness issue, but I could be wrong. Please move if it's not!
It was mozharness - for some unclear reason we're still doing non-mozharness builds on mozilla-beta, so a permanent test failure on mozilla-aurora in mozharness builds is completely green, and suddenly becomes orange on the merge to mozilla-beta.
Lack of test coverage is basically a development blocker....  We shouldn't be checking in things if we don't know whether they're correct.
Severity: normal → blocker
Flags: needinfo?(jgriffin)
Component: Mozharness → Mozbase
Product: Release Engineering → Testing
QA Contact: jlund
This is a bug in the mozharness script that runs the builds, not mozbase, so I'm putting this back in releng. Ahal, can you take a quick look? Are we just not parsing results like we should be here?
Component: Mozbase → Mozharness
Flags: needinfo?(jgriffin) → needinfo?(ahalberstadt)
Product: Testing → Release Engineering
QA Contact: jlund
Sorry, I filed and forgot. No excuse for not looking into this right away.

My theory is that the parsing is working fine, but the build script needs to set TBPL_FAILURE as the worst level instead of TBPL_WARNING. There are other steps that also return TBPL_WARNING on fail with comments like "turn the job orange". So maybe this was regressed in a treeherder change that stopped turning jobs with a TBPL_WARNING status orange.

Here's a try run to test that theory out:
https://treeherder.mozilla.org/#/jobs?repo=try&revision=6ce3352ff175
Flags: needinfo?(ahalberstadt)
Assignee: nobody → ahalberstadt
Status: NEW → ASSIGNED
I thought that was happening in the parent OutputParser, but evidently I was wrong. Yes, it looks like that's the problem. Maybe they should be using the same output parser as the other desktop unittests, but that's a can of warms I'm not too keen on opening just yet.
Yeah, leave that can closed, since the other desktop unittests have the flow "harnesses summarize pass/fail/todo themselves -> parser looks for the predigested summary and acts accordingly, setting the job status to nonsuccess either if tests failed or if the summary doesn't exist" so opening it up still requires that someone write a parser that will decide whether or not make check failed tests :)
See also bug 1215755 comment 3
Summary: mozbase failure not turning builds orange → make-check test-failures not turning builds orange
Hooray, the builds fail:
https://treeherder.mozilla.org/#/jobs?repo=try&revision=c2df0cf0235c

I'm quite confused about how this worked before, or what regressed this.

Now I guess we actually have to fix everything before I can land this. I'll file new bugs for individual issues and make them block this. Here's a new try run with every platform to see if there are any platform specific issues:
https://treeherder.mozilla.org/#/jobs?repo=try&revision=98902d03a742
Bug 1139911 - Set proper buildbot status on make-check errors and test failures, r=chmanchester
Depends on: 1215756
Depends on: 1215755
> Hooray, the builds fail:

Yes, that's what happens when test coverage gets turned off: test failures creep in.  :(
I actually wasn't being sarcastic :). I knew they were expected to fail, so was happy that my patch turned the jobs orange like it was supposed to.

But yes, not happy that test failures crept in. Luckily it looks like bug 1215755 (which you already fixed) and bug 1215756 (which we're currently investigating and will disable if the fix isn't trivial) are the only failures that crept in, on Linux at least. So hopefully I'll be able to get this landed ASAP.
Comment on attachment 8676313 [details]
MozReview Request: Bug 1139911 - Set proper buildbot status on make-check errors and test failures, r=chmanchester

Bug 1139911 - Set proper buildbot status on make-check errors and test failures, r=chmanchester
Attachment #8676313 - Flags: review?(cmanchester)
> so was happy that my patch turned the jobs orange like it was supposed to.

Ah, makes sense.  Thank you very much for dealing with this, by the way!
Comment on attachment 8676313 [details]
MozReview Request: Bug 1139911 - Set proper buildbot status on make-check errors and test failures, r=chmanchester

https://reviewboard.mozilla.org/r/22653/#review20155

I'm not familiar with this particular file, but this looks like the right fix to me.

::: testing/mozharness/mozharness/mozilla/building/buildbase.py:197
(Diff revision 1)
>              else:
>                  self.fail_count += 1

I think self.fail_count += 1 should be outside of this else.
Attachment #8676313 - Flags: review?(cmanchester) → review+
Here's a try run with the mozprocess tests fixed:
https://treeherder.mozilla.org/#/jobs?repo=try&revision=6f96eede62f8

Still might be a problem on osx, though one of the failures is due to the log being too big.. I don't think that would have been hidden by the output parser before :/
Depends on: 1217011
Actually here's a try run with all the mozprocess issues fixed, and showing green despite the too large log files. Just waiting for bug 1215756 to land, and this should be good to go.

(The log file issue is likely bug 1217051 btw, but it doesn't appear to block this).
No longer depends on: 1217011
Comment on attachment 8676313 [details]
MozReview Request: Bug 1139911 - Set proper buildbot status on make-check errors and test failures, r=chmanchester

Bug 1139911 - Set proper buildbot status on make-check errors and test failures, r=chmanchester
Green! The log size issue was also fixed by ted.
https://treeherder.mozilla.org/#/jobs?repo=try&revision=8f0073d35f0b
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: