Closed Bug 1575213 Opened 6 years ago Closed 5 years ago

Create a code coverage optimizer

Tracking

(Not tracked)

Status:

RESOLVED WONTFIX

People

(Reporter: ahal, Assigned: ahal)

References

Details

Andrew Halberstadt [:ahal]

Assignee

Description

•

6 years ago

•

Edited

Now that we have the ability to run "shadow schedulers", we should create an optimizer that tries to use code coverage information.

Marco has set up a database that exists as an artifact here.

This is a sqlite database that has a table called file_to_chunk which is a mapping of "source file" -> "test task". Our optimizer can download this, check if any of the modified files can be mapped to test tasks, and remove any tasks that aren't in the list.

Much of the logic we'll need is already implemented in the try coverage selector. Once we fix this bug, we should refactor that selector to import the optimizer from taskgraph and use that.

Andrew Halberstadt [:ahal]

Assignee

Comment 1

•

6 years ago

I'll upload my WIP patch soon, but there are a few problems:

The DB doesn't have sufficient platform information. The schema is files,platform,chunk and each row looks something like this:
js/src/vm/TraceLogging.cpp, windows, jittest-1proc-5

The only two platforms in the DB are windows and linux. But there are tons of different configurations under these. Are they debug? Asan? QR? The chunking is going to be different under each configuration.

It looks like only compiled test suites and marionette based suites exist in the DB. I don't see any mochitest/reftest/wpt references anywhere. This isn't really a blocker to getting the shadow-scheduler landed, but it will severely limit the usefulness of this optimizer.
Known issue, but since the DB only updates once a day, tests may get shuffled to new chunks in the interim. We could A) use a more stable chunking algorithm than chunk-by-runtime, and/or B) figure out how to schedule tasks by test path more reliably in taskgraph itself. Definitely think we should do B) but that could be a follow-up to this bug.

Joel Maher ( :jmaher ) (UTC -8)

Comment 2

•

6 years ago

to answer the question, we only have code coverage data for windows/linux- it is the ccov config, which is basically debug, but slightly different.

We should have clear mappings for wpt, mochitest, xpcshell (maybe reftest). I don't think we have it for the compiled tests and marionette, that seems to be opposite of what I would expect.

Andrew Halberstadt [:ahal]

Assignee

Comment 3

•

6 years ago

Ok, if it's the ccov config this data won't be very useful to us then. As the chunks from there will be different than the chunks on the configs we'd like to run. We're basically back to the same old problem we always run up against: how to schedule by test path on autoland (without breaking backfill).

Marco Castelluccio [:marco]

Comment 4

•

6 years ago

(In reply to Andrew Halberstadt [:ahal] from comment #1)

I'll upload my WIP patch soon, but there are a few problems:

The DB doesn't have sufficient platform information. The schema is files,platform,chunk and each row looks something like this:
js/src/vm/TraceLogging.cpp, windows, jittest-1proc-5

The only two platforms in the DB are windows and linux. But there are tons of different configurations under these. Are they debug? Asan? QR? The chunking is going to be different under each configuration.

We need to make a call here. Do we run on all configurations? Do we run only under one configuration? Do we use something like SETA to decide the configuration?
This applies in general to any coverage-based scheduler (at chunk level, or manifest level, or test level), as the different configurations are often redundant.

It looks like only compiled test suites and marionette based suites exist in the DB. I don't see any mochitest/reftest/wpt references anywhere. This isn't really a blocker to getting the shadow-scheduler landed, but it will severely limit the usefulness of this optimizer.

I thought I fixed this, but I guess I didn't :)

Known issue, but since the DB only updates once a day, tests may get shuffled to new chunks in the interim. We could A) use a more stable chunking algorithm than chunk-by-runtime, and/or B) figure out how to schedule tasks by test path more reliably in taskgraph itself. Definitely think we should do B) but that could be a follow-up to this bug.

Ok, if it's the ccov config this data won't be very useful to us then. As the chunks from there will be different than the chunks on the configs we'd like to run. We're basically back to the same old problem we always run up against: how to schedule by test path on autoland (without breaking backfill).

These two are ultimately the same root problem.

As an interim solution, could we schedule all chunks containing the tests that are in the ccov chunk?
That is:

Get list of covered chunks from file_to_chunk;
Get list of tests from covered chunks using chunk_to_test;
Build list of chunks to run from the list of tests.

Is 3 feasible? We'd possibly be running more tests than necessary, but at least we'd have a working solution.

Marco Castelluccio [:marco]

Updated

•

6 years ago

Depends on: 1583353

Marco Castelluccio [:marco]

Updated

•

6 years ago

Comment 5

•

5 years ago

I'm going to WONTFIX this for now. A lot has changed since this bug and I'm not sure this is still relevant given bugbug. I think it's more likely we integrate code coverage into the existing bugbug optimizer than we stand up something separate.

Status: ASSIGNED → RESOLVED

Closed: 5 years ago

Resolution: --- → WONTFIX

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Create a code coverage optimizer

Categories

(Firefox Build System :: Task Configuration, task)

Tracking

(Not tracked)

People

(Reporter: ahal, Assigned: ahal)

References

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Comment 3

Comment 4

Updated

Updated

Comment 5