reftest-analyzer disagrees with TreeHerder logs on differing pixel count for fuzzy failures
Categories
(Testing :: Reftest, defect, P2)
Tracking
(firefox-esr68 unaffected, firefox75 wontfix, firefox76 wontfix, firefox77 wontfix, firefox78 fixed)
Tracking | Status | |
---|---|---|
firefox-esr68 | --- | unaffected |
firefox75 | --- | wontfix |
firefox76 | --- | wontfix |
firefox77 | --- | wontfix |
firefox78 | --- | fixed |
People
(Reporter: dholbert, Assigned: kats)
References
(Regression)
Details
(Keywords: regression)
Attachments
(1 file)
STR:
(1) View this reftest log: https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=299100920&repo=mozilla-beta&lineNumber=4432 and note how many pixels are reported as differing.
(2) Click "open analyzer" (or visit direct link)
(3) Click the "outline.html == [etc]` text for the reftest failure, to show the mismatching image view.
(4) Scroll down.
ACTUAL RESULTS:
The reftest log says: max difference: 7, number of differing pixels: 214
...whereas reftest-analyzer says: Maximum difference per channel 7, 140 pixels differ
EXPECTED RESULTS:
They should agree on how many pixels differ.
(It looks like this numerical disagreement is unexpected, per bug 1614904 comment 2 - 3)
Reporter | ||
Comment 1•4 years ago
•
|
||
I saw this exact issue for 4 logs that were all about different occurrences of this same sort of failure in this particular layout/reftests/svg/outline.html
reftest:
https://treeherder.mozilla.org/logviewer.html#?job_id=299100920&repo=mozilla-beta [this is the log from comment 0]
https://treeherder.mozilla.org/logviewer.html#?job_id=298861634&repo=autoland
https://treeherder.mozilla.org/logviewer.html#?job_id=298559973&repo=autoland
https://treeherder.mozilla.org/logviewer.html#?job_id=298417320&repo=mozilla-central
Reporter | ||
Updated•4 years ago
|
Assignee | ||
Comment 2•4 years ago
|
||
Interesting. When I load the reftest analyzer I get 7,214 as expected, using FF 75.0 build 20200331175109 on macOS with webrender enabled. On latest Nightly with WR enabled, same machine, I get 7,139 as the difference.
Comment 3•4 years ago
|
||
The only difference I'm aware of is that the reftest-analyzer ignores the alpha channel, assuming there aren't any transparent pixels. But changing that locally didn't make any difference in one of the problematic cases.
Assignee | ||
Comment 4•4 years ago
|
||
Do you have cycles to take this bug and investigate further? If not maybe we can disable this feature for now. I'm not sure it's worth displaying those numbers if they can't be relied on.
Assignee | ||
Comment 5•4 years ago
|
||
I'll probably disable the feature later this week if I don't hear back, assuming you won't have time in the near future to investigate.
Assignee | ||
Comment 6•4 years ago
|
||
Updated•4 years ago
|
Assignee | ||
Updated•4 years ago
|
Pushed by kgupta@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/594d2cecc8cf Disable showing pixel difference numbers in the reftest analyzer. r=dholbert
Comment 8•4 years ago
|
||
Kats, one issue is that WPT reftest don't show the pixel count in the log, so this is the only way to get information from them. And the fuzzy match probably matches the WPT version. So this count is useful if you're looking at WPT reftests instead of regular reftests.
Comment 9•4 years ago
|
||
wpt does show it in the logs e.g. [1], although perhaps it's harder to find. And on the backend the wpt reftests are using the same code as the reftest harness reftests, so there shouldn't be a difference there. I didn't yet get a chance to investigate whether the problem is that the algorithm in the reftest analyzer is somehow different from the one that the harnesses use or if the problem is that we're outputting images that aren't exactly matching the internal canvas in some cases (or something else). The algorithm for comparison seems simple enough, and the numbers in the reftest analyzer are right in the cases that I've used so it's possible that there's something more subtle going on and this is only a symptom.
Comment 11•4 years ago
|
||
bugherder |
Updated•4 years ago
|
Description
•