Closed Bug 1213025 Opened 9 years ago Closed 5 years ago

Make a Test Failure Dashboard

Tracking

(Not tracked)

Status:

RESOLVED INVALID

People

(Reporter: ekyle, Assigned: ekyle)

Details

Attachments

(1 file)

2015-10-08 16-06-51.png 9 years ago Kyle Lahnakoski [:ekyle] 290.49 KB, image/png		Details

Kyle Lahnakoski [:ekyle]

Assignee

Description

•

9 years ago

Make a dashboard, using ActiveData, detailing failures down to the individual test.

Here is a super rough initial version, just to give a taste.  Click on a test failure.

http://activedata.allizom.org/tools/failures2.html

Kyle Lahnakoski [:ekyle]

Assignee

Comment 1

•

9 years ago

Attached image 2015-10-08 16-06-51.png — Details

The revisions are simply in push_date order, the y-axis is the test duration.

Kyle Lahnakoski [:ekyle]

Assignee

Comment 2

•

9 years ago

I hope you drink coffee.  You should go get one.  This page is really slow; it will download thousands of errors from the past day.   Too many errors in one day, and you will crash your browser.

Kyle Lahnakoski [:ekyle]

Assignee

Comment 3

•

9 years ago

I had some time to work on this.  It is in debug mode so it is faster.  The new location will be my people page [1].  The code [2] is now separate from the ActiveData project.

Since it is debug mode, you only a sample of the failures from today are showing right now.  If you exclude some categories, and reload, you will get another sample.  Due to the size of the `unittest` table, and given the small number of machines we are limited to, we must build a cache for the full error set, or optimize ActiveData to deal with the query.

[1] http://people.mozilla.org/~klahnakoski/testfailures/failures.html
[2] https://github.com/klahnakoski/TestFailures

Kyle Lahnakoski [:ekyle]

Assignee

Comment 4

•

8 years ago

To make this useful it must point out the most egregious failures (eg fails high percent of the time), or point out recent increases in intermittents.  Neither of these is hard to detect, the hard part is sorting the approximately one million combinations, most of which are uninteresting, and making it fast. 

The million aggregates is too large for memory, or network, so it requires a container to hold them and query them.  I believe the solution is a materialized view over the whole, with a script keeping that view up to date.  Implementing materialized views is too much work for this objective; but defining the API for materialized views, and faking the implementation for this use case, should be in scope.

Kyle Lahnakoski [:ekyle]

Assignee

Comment 5

•

8 years ago

Maybe prioritizing all test failures by "interestingness" can be pushed outside the scope of this bug.  If we have a simple text search, then we can view any test over time.  We can push the problem of highlighting "interesting" to the regression-detection module.

Having a store of alerts can be used in this dashboard at a later time.

Kyle Lahnakoski [:ekyle]

Assignee

Updated

•

5 years ago

Status: NEW → RESOLVED

Closed: 5 years ago

Resolution: --- → INVALID

BMO Automation

Updated

•

2 years ago

Product: Testing → Testing Graveyard

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Make a Test Failure Dashboard

Categories

(Testing Graveyard :: ActiveData, defect)

Tracking

(Not tracked)

People

(Reporter: ekyle, Assigned: ekyle)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Updated

Updated

Attachment

General

Description

File Name

Content Type