make it easier to see the full list of test-unexpected-fail messages for failures



Tree Management
Intermittent Failures View
a year ago
a month ago


(Reporter: jmaher, Assigned: sclements)




(1 attachment)

currently we summarize orangefactor data in bugs, so we click through to see more data in orangefactor.  Once there we have to click through to individual log files, there we can see what is really going on.

I would like to make this less effort while investigating bugs, either something inside of orangefactor, or a tool hooked into mach.
Do you mean including example log failure lines in the bugzilla comment, or making that information visible in the OrangeFactor UI without having to click through?

At whatever point OrangeFactor v2 is built on/into Treeherder these would be much easier to implement, however in the meantime I'd suggest either:
a) the OrangeFactor UI could use the job_id property in each ES record to fetch the error summary from Treeherder (bonus: the job_id exists for existing data many months back)
b) the payload sent to Elasticsearch by Treeherder could include the error summary (albeit which lines? first line, all the lines? at most 5 lines?) Bonus: fewer requests to Treeherder and faster when loading the OrangeFactor UI.

For example on:

8 Feb 2017, 12:10
OS X 10.10
t-yosemite-r7-0308 has a job_id of 75476137.

For approach (a), OrangeFactor would fetch the error summary using either of:

For approach (b), the payload sent to ES would be adjusted here:
...and then exposed in the OrangeFactor API here:

For both (a) and (b), the results would be made use of here:
What I meant, in which triggered Joel filing this, was indeed that I'd like to see the TEST-UNEXPECTED-FAIL lines in an easier-to-find way.

This would have made it much easier to determine that while orangefactor thinks bug 1285461 has happened 54 times, it's actually happened 2-3 times, since:
 * 2 of the stars were clearly correct
 * 1 log is unavailable
 * 49 of them were mis-stars that were actually bug 1159532 (in the same file, and that bug's failure suggests the two bugs)
 * 1 was a different failure in the same file
 * 1 was a different failure in a different file.

It would have been great if I'd been able to determine that without clicking through to 54 logs.  And when the underlying data are sometimes that bad, I feel like I do, in fact, have to do so.

It's also important because the range of the TEST-UNEXPECTED-FAIL messages can help make it clear what the actual problem is.  For example, the fact that every time in bug 1159532 was exactly 8s (when the times are usually not round) was what made me realize what the problem was.

This can also make it clear if what the maintainer of the code/tests would expect to be reported as a separate bug is actually being starred by sheriffs as the same bug, something that's basically unobservable today (since starring stopped making bugzilla comments).
:gbrown, this seems to be in the same general category of your test-info work, would you be interested in hacking on this?
Flags: needinfo?(gbrown)
Sure, I'll take it. I don't have a clear vision for this, and I have some higher priorities right now....might take me a while to get around to it / don't mind if someone wants to steal it. ;)
Assignee: nobody → gbrown
Flags: needinfo?(gbrown)
I've never made any progress here.

sclements - Any interest?
Assignee: gbrown → nobody
Flags: needinfo?(sclements313)

Comment 6

2 months ago
Sure, I'll look into it.
Flags: needinfo?(sclements313)


2 months ago
Assignee: nobody → sclements313
Component: OrangeFactor → Intermittent Failures View


2 months ago
Attachment #8983592 - Flags: review?(emorley)
Attachment #8983592 - Flags: review?(cdawson)


2 months ago
Attachment #8983592 - Flags: review?(cdawson) → review+
Comment on attachment 8983592 [details] [review]
Link to GitHub pull-request:

Deferring this review to George, since I think he'll have some ideas as to how to tweak the Django ORM parts.
Attachment #8983592 - Flags: review?(emorley) → review?(ghickman)


2 months ago
Attachment #8983592 - Flags: review?(ghickman) → review+

Comment 9

a month ago
Commit pushed to master at
Bug 1339937 - IFV show unexpected fails (#3620)

modify failuresByBug api to include test-unexpected-fail lines per job; modify bugdetails UI to include failure counts and tooltip with lines
Last Resolved: a month ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.