1669239 - Schedule the next tick to paint in PuppetWidget::Paint instead of painting directly

Assignee

Description

•

4 years ago

The WillPaintWindow call is problematic for FirstContentfulPaint
algorithm because it causes paints that outside of ticks, which would
revoke the viewer flush for the next tick, hence no firstContentful
paint entries are fired.

The proposed solution here is instead of doing the paint directly in
PuppetWidget::Paint, we ask the next tick to do it.

Sean Feng [:sefeng]

Assignee

Comment 1

•

4 years ago

Attached file Bug 1669239 - Call Tick to paint in PuppetWidget::Paint instead of using WillPaintWindow — Details

The WillPaintWindow call is problematic for FirstContentfulPaint
algorithm because it causes paints that outside of ticks, which would
revoke the viewer flush for the next tick, hence no firstContentful
paint entries are fired.

However, this WillPaintWindow call is also useful because it allows
the parent to aware that the layers are updated, so that the parent
can fire corresponding events based on the status. If we don't do it,
parent may end up not knowing that the layers are updated, so some
tests may stall because of the missing events.

The proposed solution here is instead of doing the paint directly in
PuppetWidget::Paint, we ask the next tick to do it.

Phabricator Automation

Updated

•

4 years ago

Attachment #9179621 - Attachment description: Bug 1669239 - Schedule the next tick to paint in PuppetWidget::Paint instead of painting directly → Bug 1669239 - Call Tick to paint in PuppetWidget::Paint instead of using WillPaintWindow

Pulsebot

Comment 2

•

4 years ago

Pushed by sefeng@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/604e2561311f
Call Tick to paint in PuppetWidget::Paint instead of using WillPaintWindow r=emilio,mattwoodrow

Alexandru Michis [:malexandru]

Comment 3

•

4 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/604e2561311f

Status: NEW → RESOLVED

Closed: 4 years ago

status-firefox83: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → 83 Branch

Sebastian Hengst [:aryx] (needinfo me if it's about an intermittent or backout)

Updated

•

4 years ago

Regressions: 1671488

Emilio Cobos Álvarez (:emilio)

Updated

•

4 years ago

Regressions: 1671592

Darkspirit

Updated

•

4 years ago

Regressions: 1672139

Alexandru Ionescu (needinfo me) [:alexandrui]

Comment 4

•

4 years ago

Sean, there's an improvement that seems to be caused by your patch, but around this revision it's also a build failure that prevents me from figuring out for certain what's the paych that caused the improvement. Does the improvement below makes sense to be caused by this patch?
== Change summary for alert #27381 (as of Mon, 26 Oct 2020 21:12:38 GMT) ==

Improvements:

Ratio	Suite	Test	Platform	Options	Absolute values (old vs new)
4%	tresize		macosx1014-64-shippable	e10s stylo	13.02 -> 12.52

For up to date results, see: https://treeherder.mozilla.org/perfherder/alerts?id=27381

Flags: needinfo?(sefeng)

Sean Feng [:sefeng]

Assignee

Comment 5

•

4 years ago

Yeah, the test basically measures the time for window.resizeTo by using MozAfterPaint event, which is indeed something that this patch can impact. Ahthough I am a bit suprised of the improvement, but I think it's possible.

Flags: needinfo?(sefeng)

Alexandru Ionescu (needinfo me) [:alexandrui]

Comment 6

•

4 years ago

Actually, if I look more closely, there's a slight improvement in results and the graph became bimodal.

Alexandru Ionescu (needinfo me) [:alexandrui]

Comment 7

•

4 years ago

So I tend ore to say that the improvement might be a regression in terms of graph stability. What do you say? Does this make sense to you?

Flags: needinfo?(sefeng)

Sean Feng [:sefeng]

Assignee

Comment 8

•

4 years ago

Yeah, it makes sense to call it an regression in terms of graph stability. This makes sense since the patch started to schedule a paint at where it was painting directly.

I am not worrying about it since the overall performance is better. Would it be okay for you?

Flags: needinfo?(sefeng)

Alexandru Ionescu (needinfo me) [:alexandrui]

Comment 9

•

4 years ago

Sorry for replying this late. I am a bit worried as the noise is one of the main factors that are preventing us to identify the regression cause.:sefen

Flags: needinfo?(sefeng)

Alexandru Ionescu (needinfo me) [:alexandrui]

Comment 10

•

4 years ago

Wait, did this test stopped running on autoland since more than a month ago?

Sean Feng [:sefeng]

Assignee

Comment 11

•

4 years ago

This patch indeed makes the paint from sync to async, so it makes sense when the tests become nosier. I am not sure if there are things that we can do to improve the graph stability.

Alexandru, the links seem to be broken so I can't check the current status.

Flags: needinfo?(sefeng)

Alexandru Ionescu (needinfo me) [:alexandrui]

Comment 12

•

4 years ago

https://treeherder.mozilla.org/perfherder/graphs?highlightAlerts=1&series=autoland,2056535,1,1&timerange=5184000&zoom=1602225032387,1604469402591,11.638130248281701,14.13227760840817

Flags: needinfo?(sefeng)

Sean Feng [:sefeng]

Assignee

Comment 13

•

4 years ago

I think I get what you meant. Are you worrying that we might not be able to identify future regressions due to the test becomes nosier? I wish we can still identify regressions for nosier benchmarks....

Honestly, I don't know if there are things that we can do to improve it. What do we usually do for cases like this?

Flags: needinfo?(sefeng) → needinfo?(aionescu)

Alexandru Ionescu (needinfo me) [:alexandrui]

Comment 14

•

4 years ago

The performance team is trying to stick the sheriffed tests as stable as possible. A stable test means also a healthy one. I'm not sure what you can do, cases like this don't happen very often and there's no procedure for them. I would look for way to reduce noise, maybe a better approach for the code.
Can you explain is a simple way what is this patch doing, please? My gut feeling is that there's a tradeoff (well, it's always a tradeoff, haha).

Flags: needinfo?(aionescu) → needinfo?(sefeng)

Sean Feng [:sefeng]

Assignee

Comment 15

•

4 years ago

So PuppetWidget::Invalidate was a method that was being used for tab switches and tab resizes. It would make a paint and this paint was a prerequisite for a bunch of painting related events to fire. These events were used in many places, I believe this particular test uses MozAfterPaint.

The issue with this approach was that this paint was executed outside of the Tick, so it messed up first-contentful-paint timing because the spec required paints to happen within the Tick.

So instead of doing paints directly from PuppetWidget::Invalidate, this patch changed it to schedule paints to run at the next Tick by scheduling a Tick.

Since this patch actually scheduled paints to run later than before, it's not clear to me how the performance got improved. I wonder if it's caused by the scheduled paint actually makes MozAfterPaint to be fired earlier because we started to schedule a tick earlier.

Does it sound good?

Flags: needinfo?(sefeng)

Markus Stange [:mstange]

Updated

•

3 years ago

Regressions: 1689156