Open Bug 1383515 Opened 8 years ago Updated 3 years ago

web-platform-test expected crashes don't show nearly enough indications that they are expected

Categories

(Testing :: web-platform-tests, defect)

defect

Tracking

(Not tracked)

People

(Reporter: aryx, Unassigned)

Details

Treeherder just displays the job status provided to it by buildbot/taskcluster, who normally rely solely on the exit code of the harness. As such, this isn't a bug in Treeherder, so moving to web-platform tests.
Component: Treeherder: Data Ingestion → web-platform-tests
Product: Tree Management → Testing
Version: --- → unspecified
Summary: treeherder reports some crashes as passing → Some web-platform-tests contain crashes even though they are green
I mean this is by design. web-platform-tests just records the status on import and tells you if that changes. Any other setup wouldn't allow us to import tests at all because we would be blocked on fixing issues (or just disable the tests, which is worse). We are working on importing in smaller batches, which will also allow creating bugs to inform people of the status when tests are imported and don't pass.
Ok, that makes it difficult to see why a test run got set as orange, e.g. if the first error will always be in the log. Fortunately, that's rare at the moment.
Well, no, not particularly rare. Consider, for instance, that Mac Stylo wpt10 has been permaorange since last Thursday because bug 1386604 landed a bunch of webrtc tests that we totally fail, but we totally fail them differently on Mac Stylo than we fail them elsewhere, and the annotations failed to reflect that, but that webrtc permaorange has been starred as the completely expected crash bug 1387684 ever since. And after noticing that, I discovered yet another one (which I've since lost track of) where every instance, 32 in the last week, was actually another unfiled failure.
Summary: Some web-platform-tests contain crashes even though they are green → web-platform-test expected crashes don't show nearly enough indications that they are expected
Severity: critical → S2
You need to log in before you can comment on or make changes to this bug.