Closed Bug 776957 Opened 13 years ago Closed 13 years ago

OMTC: Remove unnecessary frame copies with canvas and layers acceleration

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla17

People

(Reporter: nical, Assigned: nical)

References

Details

Attachments

(3 files, 1 obsolete file)

Use OpenDescriptorForDirectTexturing with CanvasLayerOGL when available. 13 years ago Nicolas Silva [:nical] 1.38 KB, patch	cjones : review-	Details \| Diff \| Splinter Review
Use OpenDescriptorForDirectTexturing with CanvasLayerOGL when available. 13 years ago Nicolas Silva [:nical] 1.66 KB, patch	cjones : review-	Details \| Diff \| Splinter Review
Fixes 13 years ago Chris Jones [:cjones] inactive; ni?/f?/r? if you need me 1.40 KB, patch		Details \| Diff \| Splinter Review
Revert UA changes 13 years ago Chris Jones [:cjones] inactive; ni?/f?/r? if you need me 4.13 KB, patch		Details \| Diff \| Splinter Review

Nicolas Silva [:nical]

Assignee

Description

•

13 years ago

When we draw into a canvas, we draw into "normal" memory (as opposed to shared), then during a layer transaction we copy the frame into shared memory and do a swap operation synchronously. on the compositor side we upload the image to a gl texture. This involves many image buffers and many copies, so we need to reduce the memory and cpu overhead here. One limitation that we have is that our drawing APIs don't separate the image data from the rendering state. This means that it is very hard for us to use a DrawTarget and swap its image data while keeping drawing states. So our solution will most likely need to involve having the canvas context always render in the same buffer. If we use Layers acceleration, we have to upload the canvas's image to gpu memory on the compositor side, so we don't actually need to keep the CanvasSurface (which is a shared image) around after we have done the upload. We can have the canvas context render into shmem, then during a synchronous transaction we do the upload right away on the compositor side so that on the content side we can still use the same shared buffer in our DrawTarget. The best would maybe to be able to do upload on the content side and share the texture between content and compositor processes.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 1

•

13 years ago

(In reply to Nicolas Silva [:nical] from comment #0) > One limitation that we have is that our drawing APIs don't separate the > image data from the rendering state. This means that it is very hard for us > to use a DrawTarget and swap its image data while keeping drawing states. An Azure DrawTarget has very little drawing state, just the transform and clip, so we could implement buffer swapping pretty easily (either in nsCanvasRenderingContext2DAzure, or as a new Azure API) --- if we have to. If we swap buffers then we still need to copy at least the changed area of the canvas buffer between front and back (and in most performance-intensive demos the changed area is basically the whole canvas), so there's still often going to be one full memcpy of the buffer every frame. > If we use Layers acceleration, we have to upload the canvas's image to gpu > memory on the compositor side, so we don't actually need to keep the > CanvasSurface (which is a shared image) around after we have done the upload. > We can have the canvas context render into shmem, then during a synchronous > transaction we do the upload right away on the compositor side so that on > the content side we can still use the same shared buffer in our DrawTarget. That seems like a reasonable approach. For the case where the canvas is not changing (which there's actually a fair bit of on the Web), and something like gralloc is available, we could potentially use less memory by having a single gralloc buffer shared by the canvas context and the compositor. So it might be more efficient to have a model where once the canvas buffer has been passed to the compositor, it's treated as read-only by the content side until the content needs to modify the canvas again, at which point we copy the compositor's version of the canvas into a new buffer and use that. This model potentially allows for asynchronous transactions, although there would need to be some buffer recycling so reallocating the buffer for a new frame is cheap. Either that model or the sync-transaction-with-upload you suggested would be pretty good, so I don't have a strong opinion on which one you implement.

Nicolas Silva [:nical]

Assignee

Updated

•

13 years ago

Component: Graphics → GFX: Color Management

Nicolas Silva [:nical]

Assignee

Comment 2

•

13 years ago

(In reply to Robert O'Callahan (:roc) (Mozilla Corporation) from comment #1) > (In reply to Nicolas Silva [:nical] from comment #0) > > One limitation that we have is that our drawing APIs don't separate the > > image data from the rendering state. This means that it is very hard for us > > to use a DrawTarget and swap its image data while keeping drawing states. > > An Azure DrawTarget has very little drawing state, just the transform and > clip, so we could implement buffer swapping pretty easily (either in > nsCanvasRenderingContext2DAzure, or as a new Azure API) --- if we have to. The hard part is that if we want to do this, we need to manualy keep track of every clip and transformation, because some of the backends (CoreGraphcis for instance) don't provide a way to get the clips. So Copying the state from a target to another is not easy and some backends don't provide a way to swap the image buffer while keeping the drawing states.

Nicolas Silva [:nical]

Assignee

Comment 3

•

13 years ago

I'm confused, I didn't mean to change the component to color management

Component: GFX: Color Management → Graphics

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 4

•

13 years ago

(In reply to Nicolas Silva [:nical] from comment #2) > The hard part is that if we want to do this, we need to manualy keep track > of every clip and transformation, because some of the backends (CoreGraphcis > for instance) don't provide a way to get the clips. So Copying the state > from a target to another is not easy and some backends don't provide a way > to swap the image buffer while keeping the drawing states. nsCanvasRenderingContext2DAzure already tracks all the clips, so we could support buffer-swapping at that level.

Nicolas Silva [:nical]

Assignee

Comment 5

•

13 years ago

Attached patch Use OpenDescriptorForDirectTexturing with CanvasLayerOGL when available. (obsolete) — Details — Splinter Review

This is a simple enhancement that makes us take advantage of gralloc buffers or other types of "direct texturing" memory when they are used with canvas and gl layers. There are still canvas frame copies that we want to remove.

Assignee: nobody → nsilva

Attachment #648018 - Flags: review?(jones.chris.g)

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Comment 6

•

13 years ago

Comment on attachment 648018 [details] [diff] [review] Use OpenDescriptorForDirectTexturing with CanvasLayerOGL when available. >diff --git a/gfx/layers/opengl/CanvasLayerOGL.cpp b/gfx/layers/opengl/CanvasLayerOGL.cpp >+ if (nsRefPtr<TextureImage> texImage = >+ ShadowLayerManager::OpenDescriptorForDirectTexturing( >+ gl(), aNewFront.get_SurfaceDescriptor(), LOCAL_GL_CLAMP_TO_EDGE)) { >+ >+ mTexImage = texImage; >+ >+ *aNewBack = aNewFront; >+ Following along from ShadowThebesLayerOGL, inside here we want to - destroy the current front buffer if |aNewFront| is different size/content-type - save a reference to |aNewFront| - return the old front buffer in |*aNewBack| r- because this breaks the concurrency model between content / compositor for the gralloc buffers.

Attachment #648018 - Flags: review?(jones.chris.g) → review-

Nicolas Silva [:nical]

Assignee

Comment 7

•

13 years ago

Attached patch Use OpenDescriptorForDirectTexturing with CanvasLayerOGL when available. — Details — Splinter Review

try push looks good https://tbpl.mozilla.org/?tree=Try&rev=f1b67dde45e1

Attachment #648018 - Attachment is obsolete: true

Attachment #650555 - Flags: review?(jones.chris.g)

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Updated

•

13 years ago

Attachment #650555 - Flags: review?(jones.chris.g) → review+

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Comment 8

•

13 years ago

Comment on attachment 650555 [details] [diff] [review] Use OpenDescriptorForDirectTexturing with CanvasLayerOGL when available. Crashes on device I/Gecko ( 363): [Parent 363] ###!!! ABORT: unknown union type: file /home/cjones/mozilla/new-b2g/objdir-gecko/ipc/ipdl/PLayersParent.cpp, line 771 F/libc ( 363): Fatal signal 11 (SIGSEGV) at 0x00000000 (code=1)

Attachment #650555 - Flags: review+ → review-

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Comment 9

•

13 years ago

Attached patch Fixes — Details — Splinter Review

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Comment 10

•

13 years ago

With this patch, we go from ~35fps on TowerJelly to ~50fps, but the fps are a bit of a lie in both cases. Too bad the user agent string changed and we're not getting honest content anymore (16x more pixels than screen size). Will look into backing out UA change and see what real improvement is.

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Comment 11

•

13 years ago

Attached patch Revert UA changes — Details — Splinter Review

Surprisingly nontrivial ...

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Comment 12

•

13 years ago

Definite framerate improvement, but not much qualitative improvement. Measured using the wonky old fps counter, so I might have been seeing a lot of GCs that went undetected.

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Comment 13

•

13 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/92ca5b751abf

Ryan VanderMeulen [:RyanVM]

Comment 14

•

13 years ago

https://hg.mozilla.org/mozilla-central/rev/92ca5b751abf

Status: NEW → RESOLVED

Closed: 13 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla17

You need to log in before you can comment on or make changes to this bug.

Bugzilla

OMTC: Remove unnecessary frame copies with canvas and layers acceleration

Categories

(Core :: Graphics, defect)

Tracking

()

People

(Reporter: nical, Assigned: nical)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(3 files, 1 obsolete file)

Description

Comment 1

Updated

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Updated

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Comment 14

Attachment

General

Description

File Name

Content Type