Closed Bug 1616326 Opened 4 years ago Closed 2 years ago

skewed image masks create junk on their edges

Tracking

()

Status:

RESOLVED DUPLICATE of bug 1671784

Project Flags:

Webcompat Priority

People

(Reporter: Gankra, Unassigned)

References

Details

(Keywords: regression)

Attachments

(4 files, 1 obsolete file)

bad-render.png 4 years ago Aria Beingessner [:Gankra] 15.56 KB, image/png		Details
mask-transformed-skew.yaml 4 years ago Aria Beingessner [:Gankra] 642 bytes, application/x-yaml		Details
Bug 1616326 - Use a scissor rect instead of discard for image masks. r?gw 4 years ago Aria Beingessner [:Gankra] 47 bytes, text/x-phabricator-request		Details \| Review
skewtest_p400.png 4 years ago Bert Peers [:bpeers] 25.12 KB, image/png		Details
skewtest_vm.png 4 years ago Bert Peers [:bpeers] 41.15 KB, image/png		Details

Aria Beingessner [:Gankra]

Reporter

Description

•

4 years ago

Attached image bad-render.png — Details

Noticed this while looking into failures from enabling Bug 1555356. It looks like we're incorrectly sampling the mask along its boundary, and showing what should be masked content.

Aria Beingessner [:Gankra]

Reporter

Comment 1

•

4 years ago

Attached file mask-transformed-skew.yaml — Details

simple yaml that demonstrates the issue (when dropped in wrench/reftests/mask/)

Aria Beingessner [:Gankra]

Reporter

Comment 2

•

4 years ago

Hey emilio, it looks like this is caused by https://github.com/servo/webrender/pull/3220

Specifically, https://searchfox.org/mozilla-central/source/gfx/wr/webrender/res/cs_clip_image.glsl#62-65

Do you recall what problem that discard was solving?

Flags: needinfo?(emilio)

Lee Salzman [:lsalzman]

Updated

•

4 years ago

Keywords: regression

Priority: -- → P3

See Also: → https://github.com/servo/webrender/pull/3220

Aria Beingessner [:Gankra]

Reporter

Comment 3

•

4 years ago

for reference, the failing reftest was this (because my change made the entire thing active): https://hg.mozilla.org/mozilla-central/raw-file/tip/layout/tools/reftest/reftest-analyzer.xhtml#logurl=https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/AwYlD_c1QA6gTPqeWRDqJQ/runs/0/artifacts/public/logs/live_backing.log&only_show_unexpected=1

also, kicked off a try build without that discard to see what happens: https://treeherder.mozilla.org/#/jobs?repo=try&revision=4c5e3c8ce621bb1e27b035cf047d2a009e20b116

Emilio Cobos Álvarez (:emilio)

Comment 4

•

4 years ago

I don't recall off-hand. Probably was tryng to avoid incorrectly clipping pixels at the boundary. I suspect it was necessary at the time, but if it doesn't break existing tests added there and such it seems fine to remove to me.

Flags: needinfo?(emilio)

Aria Beingessner [:Gankra]

Reporter

Updated

•

4 years ago

Assignee: nobody → a.beingessner

Aria Beingessner [:Gankra]

Reporter

Comment 5

•

4 years ago

Attached file Bug 1616326 - Use a scissor rect instead of discard for image masks. r?gw (obsolete) — Details

It's unclear what this was accomplishing, but it
prevents us from correctly processing the pixels
on the edge of the mask, causing masked content to
peek through. No tests seem to rely on this
discarding behaviour.

Also added a reftest that's fairly fuzzy but should
suffice as a canary for a regression here.

Pulsebot

Comment 6

•

4 years ago

Pushed by abeingessner@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/ed0609ec00ba
Don't discard mask pixels. r=kvark

Alexandru Michis [:malexandru]

Comment 7

•

4 years ago

Backed out changeset ed0609ec00ba for causing wrench bustages.

Backout link: https://hg.mozilla.org/integration/autoland/rev/4eb218fb5806b04c040ff4bfdd7eb2fce4b8d6cb

Push with failures: https://treeherder.mozilla.org/#/jobs?repo=autoland&group_state=expanded&resultStatus=testfailed%2Cbusted%2Cexception%2Crunnable&tochange=4eb218fb5806b04c040ff4bfdd7eb2fce4b8d6cb&fromchange=ed0609ec00baad84416ee89012c7138907216989&selectedJob=289911955

Failure log: https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=289911955&repo=autoland&lineNumber=18929

Flags: needinfo?(a.beingessner)

Aria Beingessner [:Gankra]

Reporter

Comment 8

•

4 years ago

ah, wonderful "qr" in try-fuzzy gets everything except wrench. whelp, time to investigate

Flags: needinfo?(a.beingessner)

BugBot [:suhaib / :marco/ :calixte]

Comment 9

•

4 years ago

There's a r+ patch which didn't land and no activity in this bug for 2 weeks.
:Gankra, could you have a look please?
For more information, please visit auto_nag documentation.

Flags: needinfo?(a.beingessner)

Aria Beingessner [:Gankra]

Reporter

Comment 10

•

4 years ago

Still working on this, much more involved than we hoped.

Flags: needinfo?(a.beingessner)

Phabricator Automation

Updated

•

4 years ago

Attachment #9127593 - Attachment description: Bug 1616326 - Don't discard mask pixels. r?kvark → Bug 1616326 - Use a scissor rect instead of discard for image masks. r?gw

Aria Beingessner [:Gankra]

Reporter

Comment 11

•

4 years ago

Ok so here's the situation with this bug:

The image mask shader (cs_clip_image.glsl) has a difficult task: it takes in an image
(instanced for each tile if the input image was tiled) that is in local space as
well as a target rectangle in screen space (subrect / actual_rect).

This is a problem because we may have two rectangles in different spaces that we need to render the intersection of. In particular, the subrect is always axis-aligned, but the local rect may not be. The current implementation approaches the problem by having the vertex shader generate geometry for the subrect while using discard in the fragment shader to mask out the shape of the input image (tile). The bug is ultimately that this doesn't properly perform AA on the boundaries of the image (tile), so we can end up rendering garbage there when the edges are non-trivial (when the image is not axis aligned).

An interesting side-effect of this approach is that the image-mask shader does not initialize the pixels of the subrect that aren't touched by the image (tile). This is actually necessary for tiling to work, because each tile instance is unaware of the other tiles, so no instance has the information necessary for us to initialize the pixels outside our own tile. It is conjectured that this is "fine" because anything that uses this image-mask will naturally clip itself to the image's rect, and therefore not source the out-of-bounds pixels (which contain whatever garbage was in the texture beforehand).

I was working on changing our approach so that the vertex shader generates geometry for the local image (tile), and use a scissor rect to clip to the subrect.

The benefit of this is that we could use the same edge AA logic we use for other prims, and the GPU should theoretically be able to do less work because it can clearly see the geometry and avoid fragment shading, instead of relying on discard

The downside of this is that changing the scissor rect breaks batching. This wouldn't hurt the tiled case, as each tile shares the same subrect, but it would hurt batching of other alpha tasks together. A potential optimization we can perform is to identify when the image rect is axis-aligned, in which case we can avoid using a scissor and instead make the vertices represent the intersection of the subrect and the image rect.

My patch is incomplete in a few regards:

I was still just testing out the approach, so my implementation is a bit messy
I hadn't integrated my approach with the rest of the cs_clip_shared family of shaders, but I'm not sure
that's actually necessary/desirable. cs_clip_image is the only one that handles tiling, so it's genuinely a bit different.
I hadn't exactly finished figuring out how to properly apply the prim_transform and clip_transform; there is some suggestion that this is legacy over-engineered for things gecko won't ever emit.
I hadn't yet piped through the logic for the scissor rect, which I had been intending to ask gw to handle (since he seemed to think he could do it relatively easily, while it would have taken me a while to learn all the relevant code)
I hadn't implemented the axis-aligned optimization, although I was intending to only seriously consider that as a follow-up if the theoretical batching issues show up for real.

Aria Beingessner [:Gankra]

Reporter

Updated

•

4 years ago

Blocks: 1555356

Aria Beingessner [:Gankra]

Reporter

Comment 12

•

4 years ago

needs reassignment

Assignee: a.beingessner → nobody

Flags: needinfo?(jbonisteel)

Jessie [:jbonisteel] pls NI

Updated

•

4 years ago

Blocks: gfx-triage

Flags: needinfo?(jbonisteel)

Jeff Muizelaar [:jrmuizel]

Updated

•

4 years ago

No longer blocks: gfx-triage

Bert Peers [:bpeers]

Comment 13

•

4 years ago

•

Edited

Attached image skewtest_p400.png — Details

I'm a bit confused as to the status of this bug. I cannot repro the problem (attached Quadro P400, same with HD630 and GTX1060), ~~and then I noticed the patch has landed, which I suppose explains it?~~ nope it was backed out.

Is this bug platform specific? Did we fix it some other way? Am I just lucky :)

Sorry if I miss something obvious, I'm still catching up.

(Edit2: was going to test in a VM on Ubuntu but running into build issues)

Flags: needinfo?(jbonisteel)

Jessie [:jbonisteel] pls NI

Comment 14

•

4 years ago

Checking with Alexis to see if we can get clarity on the status

Bert Peers [:bpeers]

Comment 15

•

4 years ago

Attached image skewtest_vm.png — Details

Alexis still sees the problem on Mac, but it's more subtle than before.

I tested in a VM, no repro (attached - same in release build). That's all the platforms I can test. Since Mac is not officially a target for WR, I'll rebase 1555356 and try it and see if it's landable.
It's possible we just get lucky in wrench with the driver clearing allocated rendertargets to a value that happens to work.

bpeers@ubuntu:~/src/gecko/gfx/wr/wrench$ cargo run -- -p1.5 show ~/skew.yaml
    Finished dev [unoptimized + debuginfo] target(s) in 0.11s
     Running `/home/bpeers/src/gecko/gfx/wr/target/debug/wrench -p1.5 show /home/bpeers/skew.yaml`
OpenGL version 3.3 (Core Profile) Mesa 19.2.8, SVGA3D; build: RELEASE;  LLVM;
hidpi factor: 1.5 (native 1)

Flags: needinfo?(jbonisteel)

Phabricator Automation

Updated

•

4 years ago

Attachment #9127593 - Attachment is obsolete: true

Dennis Schubert [:denschub]

Updated

•

2 years ago

Webcompat Priority: --- → P3

See Also: → https://github.com/webcompat/web-bugs/issues/98306

Glenn Watson [:gw]

Updated

•

2 years ago

Status: NEW → RESOLVED

Closed: 2 years ago

Resolution: --- → DUPLICATE

You need to log in before you can comment on or make changes to this bug.