Closed Bug 705559 Opened 13 years ago Closed 13 years ago

Speed up canvas-to-canvas image drawing with Azure/D2D

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla11

People

(Reporter: bzbarsky, Assigned: roc)

References

(
URL
)

Details

(Whiteboard: [inbound])

Attachments

(5 files, 2 obsolete files)

Part 1: nsCanvasRenderingContext2DAzure cleanups 13 years ago Robert O'Callahan (:roc) (email my personal email if necessary) 6.29 KB, patch	jrmuizel : review+	Details \| Diff \| Splinter Review
performance test 13 years ago Robert O'Callahan (:roc) (email my personal email if necessary) 973 bytes, text/html		Details
Part 2: DrawTargetD2D fix 13 years ago Robert O'Callahan (:roc) (email my personal email if necessary) 12.99 KB, patch		Details \| Diff \| Splinter Review
Part 3: use mTransformDirty 13 years ago Robert O'Callahan (:roc) (email my personal email if necessary) 3.49 KB, patch	bas.schouten : review+	Details \| Diff \| Splinter Review
Part 4: speed up AddDependencyOnSource 13 years ago Robert O'Callahan (:roc) (email my personal email if necessary) 842 bytes, patch	bas.schouten : review+	Details \| Diff \| Splinter Review
Part 2 v2 13 years ago Robert O'Callahan (:roc) (email my personal email if necessary) 12.99 KB, patch		Details \| Diff \| Splinter Review
part 2 v3 13 years ago Robert O'Callahan (:roc) (email my personal email if necessary) 13.07 KB, patch	bas.schouten : review+	Details \| Diff \| Splinter Review

Boris Zbarsky [:bzbarsky]

Reporter

Description

•

13 years ago

Because it relies on having an imgIRequest, as far as I can tell.  Or at least the place where we fill the cache is conditioned on that.

This may be what causes the performance slowdown at http://stackoverflow.com/questions/8286491/performance-issue-with-off-screen-canvas-in-firefox

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 1

•

13 years ago

Yes, the canvas image cache has never worked with canvas sources.

Boris Zbarsky [:bzbarsky]

Reporter

Updated

•

13 years ago

URL: http://www.crystalgears.com/isoengine...

Boris Zbarsky [:bzbarsky]

Reporter

Comment 2

•

13 years ago

roc points out that we don't really need the image cache here.  All we really need to do is fast-path the case when |canvas| is not null and |!mCanvasElement || canvas->OwnerDoc() == mCanvasElement->GetOwnerDoc()|.  And probably mCanvasElement != canvas, so we don't have to worry about the SFE_WANT_NEW_SURFACE bit.

At that point we should be taking the "simple" path through the <canvas> part of nsLayoutUtils::SurfaceFromElement, but even that has all sorts of complications with multiple contexts, etc.  Is there a sane common case here that would be fast to check for and cover the vast majority of cases?

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 3

•

13 years ago

I'm not sure if this is meant to be filed as Mac-only. I'm going to steal it for Windows on the assumption that that's what the stackoverflow issue was about.

OS: Mac OS X → Windows 7

Summary: Canvas image cache doesn't work when passing a <canvas> element to drawImage → Speed up canvas-to-canvas image drawing with Azure/D2D

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 4

•

13 years ago

On Windows, the immediate performance problems here are unrelated to the security checks. The problem is poor Azure-D2D performance. In particular, DrawTargetD2D->Snapshot followed by DrawSurface of the snapshot allocates a new snapshot with new GPU resources every time, even if the underlying DrawTarget state hasn't changed since the last snapshot.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 5

•

13 years ago

Attached patch Part 1: nsCanvasRenderingContext2DAzure cleanups — Details — Splinter Review

Some small code cleanup to eliminate unnecessary branches etc. Might make some tiny performance improvements.

Assignee: nobody → roc

Attachment #577556 - Flags: review?

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Updated

•

13 years ago

Attachment #577556 - Flags: review? → review?(jmuizelaar)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 6

•

13 years ago

Attached file performance test — Details

BTW this is the canvas-to-canvas drawing performance test I'm using. The goal here is to get the canvas number close to or better than the image number --- and both of them as low as possible :-).

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 7

•

13 years ago

Attached patch Part 2: DrawTargetD2D fix (obsolete) — Details — Splinter Review

I don't understand why mSnapshots is currently a vector. We clear it when the target changes, so all the snapshot objects in the vector must represent the same surface contents. So let's just have a single snapshot object and let Snapshot() return the existing snapshot if there is one. That makes all the difference in this test; the mBitmap only needs to be created once, and everything else is fast.

Well, almost everything. mDependentTargets and mDependingOnTargets get big. So let's use hash-sets for those.

We need to keep the last mSnapshot alive otherwise it can be destroyed when the DrawImage call has finished using it. So make DrawTargetD2D hold a ref to it, and avoid cycles by having the snapshot surface store a non-addrefed pointer to its DrawTargetD2D.

Attachment #577564 - Flags: review?(bas.schouten)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 8

•

13 years ago

Attached patch Part 3: use mTransformDirty — Details — Splinter Review

This flag was designed to allow us to avoid SetTransform calls. I don't know why we aren't doing that. This fixes that and speeds up both the image and canvas cases of my test by about 10% (since we never need to change the transform in this test).

Attachment #577565 - Flags: review?(bas.schouten)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 9

•

13 years ago

With this patch, I get image times of about 1150ms and canvas times of about 1400ms. That's for 200K draws, so not too bad. Profiling shows that much of the remaining difference is mostly in the AddDependencyOnSource call in DrawTargetD2D, which is now about 10% of the profile. CanvasUtils::DoDrawImageSecurityCheck is still only about a few % of the profile (hard to tell exactly how much because inlining or something else is throwing off xperf).

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 10

•

13 years ago

Attached patch Part 4: speed up AddDependencyOnSource — Details — Splinter Review

Simple stuff, but it drops the canvas time to around 1250ms.

Probably good enough for now.

Attachment #577578 - Flags: review?(bas.schouten)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 11

•

13 years ago

Attached patch Part 2 v2 (obsolete) — Details — Splinter Review

Previous version failed to initialize mDrawTarget in SourceSurfaceD2DTarget.

Attachment #577564 - Attachment is obsolete: true

Attachment #577564 - Flags: review?(bas.schouten)

Attachment #577580 - Flags: review?(bas.schouten)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 12

•

13 years ago

Attached patch part 2 v3 — Details — Splinter Review

Refresh patch correctly

Attachment #577580 - Attachment is obsolete: true

Attachment #577580 - Flags: review?(bas.schouten)

Attachment #577581 - Flags: review?(bas.schouten)

Boris Zbarsky [:bzbarsky]

Reporter

Comment 13

•

13 years ago

> I'm not sure if this is meant to be filed as Mac-only.

That's just the usual bugzilla stupidity.

Jeff Muizelaar [:jrmuizel]

Comment 14

•

13 years ago

Comment on attachment 577556 [details] [diff] [review]
Part 1: nsCanvasRenderingContext2DAzure cleanups

Can you describe what's changing here in a bit more detail? Should this have any functionality changes?

Attachment #577556 - Flags: review?(jmuizelaar)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 15

•

13 years ago

It's just refactoring.

We skip the canvas->CountContexts() == 1 checks because canvases currently can only have 0 or 1 contexts, so checking the canvas->GetContextAtIndex(0) result for null is equivalent.

We reorder some code so that two "if (canvas)" checks get fused into one.

The self-copy test gets moved inside the "if (canvas)" block after we've gotten srcCanvas, the rendering context, so we can simplify the self-copy test to "if (srcCanvas == this)".

nsLayoutUtils::SurfaceFromElementResult doesn't need to set imgsurf, we can use the surface in 'res' directly. Then imgsurf is only needed for the CanvasImageCache lookup, so we move it into there. It doesn't need to addref, since if non-null, it matches an entry in the cache which is holding a reference.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Updated

•

13 years ago

Attachment #577556 - Flags: review?(jmuizelaar)

Boris Zbarsky [:bzbarsky]

Reporter

Updated

•

13 years ago

Blocks: 707143

Jeff Muizelaar [:jrmuizel]

Updated

•

13 years ago

Attachment #577556 - Flags: review?(jmuizelaar) → review+

Bas Schouten (:bas.schouten)

Comment 16

•

13 years ago

Comment on attachment 577565 [details] [diff] [review]
Part 3: use mTransformDirty

Review of attachment 577565 [details] [diff] [review]:
-----------------------------------------------------------------

I have a patch like this on the graphics project branch. Note that when I last tested this, this made FishIE a little bit slower actually. I never investigated why (presumably this branch is poorly predicted in some cases?)

Attachment #577565 - Flags: review?(bas.schouten) → review+

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 17

•

13 years ago

(In reply to Bas Schouten (:bas) from comment #16)
> I have a patch like this on the graphics project branch. Note that when I
> last tested this, this made FishIE a little bit slower actually. I never
> investigated why (presumably this branch is poorly predicted in some cases?)

I find that hard to believe, but OK!

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 18

•

13 years ago

https://tbpl.mozilla.org/?tree=Try&rev=953ed8559efc

Bas Schouten (:bas.schouten)

Comment 19

•

13 years ago

Comment on attachment 577581 [details] [diff] [review]
part 2 v3

Review of attachment 577581 [details] [diff] [review]:
-----------------------------------------------------------------

Let's add a documentation in 2D.h that the returned snapshot object can be a cached object and is not necessarily uniquely owned by the user.

::: gfx/2d/DrawTargetD2D.cpp
@@ +191,5 @@
>  
> +  if (mSnapshot && !mSnapshot->hasOneRef()) {
> +    // If we hold the only reference, just let it die. Otherwise we have to
> +    // detach it.
> +    mSnapshot->DrawTargetWillChange();

Let's just set mSnapshot->mDrawTarget = NULL; Copying mTexture is a waste since we know it'll never change and can happily be kept alive by the snapshot.

We could actually just do mSnapshot->MarkIndependent() here unconditionally I think, it'll clear our reference to the snapshot and the snapshot's reference to the DrawTarget.

@@ +217,5 @@
> +  if (!mSnapshot) {
> +    mSnapshot = new SourceSurfaceD2DTarget();
> +    mSnapshot->mFormat = mFormat;
> +    mSnapshot->mTexture = mTexture;
> +    mSnapshot->mDrawTarget = this;

Let's just move these initializations into the constructor while we're at it.

Attachment #577581 - Flags: review?(bas.schouten) → review+

Bas Schouten (:bas.schouten)

Comment 20

•

13 years ago

Comment on attachment 577578 [details] [diff] [review]
Part 4: speed up AddDependencyOnSource

Review of attachment 577578 [details] [diff] [review]:
-----------------------------------------------------------------

Hrm, I'm fine with this patch, but doesn't this just -add- one operation in the !mDependingOnTargets.count(aSource->mDrawTarget) case (it does a count, and two inserts), and remove one operation in the mDependingOnTargets.count(aSource->mDrawTarget) case (i.e. it only does a count, instead of two inserts). Insert should not do anything if an entry is already found in the hash set?

Attachment #577578 - Flags: review?(bas.schouten) → review+

Bas Schouten (:bas.schouten)

Comment 21

•

13 years ago

(In reply to Robert O'Callahan (:roc) (Mozilla Corporation) from comment #17)
> (In reply to Bas Schouten (:bas) from comment #16)
> > I have a patch like this on the graphics project branch. Note that when I
> > last tested this, this made FishIE a little bit slower actually. I never
> > investigated why (presumably this branch is poorly predicted in some cases?)
> 
> I find that hard to believe, but OK!

Yeah, so do I. It confused me quite a bit since we introduced this based on earlier profiles! Anyway, since I was optimizing for FishIE/FishBowl and friends which change transforms often, I stopped using it because of the minor perf improvement.

I started using it again on the graphics project branch due to hitting cases where the transform didn't change a lot and it made us faster. And haven't seen a noticable slowdown on that branch in FishIE and friends. So we should be fine.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 22

•

13 years ago

(In reply to Bas Schouten (:bas) from comment #19)
> Let's just set mSnapshot->mDrawTarget = NULL; Copying mTexture is a waste
> since we know it'll never change and can happily be kept alive by the
> snapshot.
> 
> We could actually just do mSnapshot->MarkIndependent() here unconditionally
> I think, it'll clear our reference to the snapshot and the snapshot's
> reference to the DrawTarget.

Good point.

> @@ +217,5 @@
> > +  if (!mSnapshot) {
> > +    mSnapshot = new SourceSurfaceD2DTarget();
> > +    mSnapshot->mFormat = mFormat;
> > +    mSnapshot->mTexture = mTexture;
> > +    mSnapshot->mDrawTarget = this;
> 
> Let's just move these initializations into the constructor while we're at it.

OK.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 23

•

13 years ago

(In reply to Bas Schouten (:bas) from comment #20)
> Hrm, I'm fine with this patch, but doesn't this just -add- one operation in
> the !mDependingOnTargets.count(aSource->mDrawTarget) case (it does a count,
> and two inserts), and remove one operation in the
> mDependingOnTargets.count(aSource->mDrawTarget) case (i.e. it only does a
> count, instead of two inserts). Insert should not do anything if an entry is
> already found in the hash set?

Yes, but it makes a huge difference in practice. From profiles it looks to me like insert allocates memory even if the element is already in the set.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 24

•

13 years ago

Some test failures on try that I need to look into.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 25

•

13 years ago

> We could actually just do mSnapshot->MarkIndependent() here unconditionally I
> think, it'll clear our reference to the snapshot and the snapshot's reference to
> the DrawTarget.

Yeah, that works. I'm using a death-grip RefPtr to keep the snapshot object alive while we call MarkIndependent. Don't want MarkIndependent's setting of mDrawTarget->mSnapshot = NULL to cause the snapshot to die while we're in one of its methods.

The test failures were caused by DrawTargetD2D's destructor calling the mSnapshot RefPtr's destructor, which unrefs the SourceSurfaceD2DTarget and destroys that; ~SourceSurfaceD2DTarget runs MarkIndependent, which goes back to the DrawTargetD2D and sets it's mSnapshot to NULL, unreffing the SourceSurfaceD2DTarget *again*. Fixed this by observing that the SourceSurfaceD2DTarget destructor actually doesn't need to do anything: *something* must be clearing DrawTargetD2D::mSnapshot or the snapshot object wouldn't be destroyed, so ~SourceSurfaceD2DTarget doesn't need to do it again.

https://tbpl.mozilla.org/?tree=Try&rev=8fb88a4885eb

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 26

•

13 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/58146fa12d76
https://hg.mozilla.org/integration/mozilla-inbound/rev/9a100a5d2796
https://hg.mozilla.org/integration/mozilla-inbound/rev/c635867cc091
https://hg.mozilla.org/integration/mozilla-inbound/rev/40a13154e0ba

Whiteboard: [inbound]

Ed Morley [:emorley]

Comment 27

•

13 years ago

https://hg.mozilla.org/mozilla-central/rev/58146fa12d76
https://hg.mozilla.org/mozilla-central/rev/9a100a5d2796
https://hg.mozilla.org/mozilla-central/rev/c635867cc091
https://hg.mozilla.org/mozilla-central/rev/40a13154e0ba

Status: NEW → RESOLVED

Closed: 13 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla11

You need to log in before you can comment on or make changes to this bug.