Popup windows from toolbar draws incorrectly
Categories
(Core :: Graphics: WebRender, defect)
Tracking
()
People
(Reporter: aosmond, Assigned: aosmond)
References
(Regression)
Details
(Keywords: regression)
Attachments
(6 files, 1 obsolete file)
This happens to me on Ubuntu 20.04 with my AMD Juniper card, but seemingly not on Ubuntu 18.04 with my Haswell GPU. When the GPU process was disabled, this started happening. I'll see if I can isolate this further.
STR:
- Explicitly disable GPU process if on older builds.
- Go to a random website. I used https://soylentnews.org/.
- Click on the Firefox Account button in the toolbar. Almost always happens right away, but I click a few times just in case I don't see it.
- Observe issue.
Assignee | ||
Updated•4 years ago
|
Comment 1•4 years ago
|
||
If this only happens with WR this might have the same cause as bug 1650246
Assignee | ||
Comment 2•4 years ago
|
||
mozregression --good 2020-01-01 --bad 2020-07-30 --pref gfx.webrender.all:true layers.gpu-process.enabled:false
It aborted before it could finish but I got:
Comment 3•4 years ago
|
||
This is the GLX appearance of bug 1650246. So I assume disabling the GPU process only made it more likely to occur.
(Jan Andre Ikenmeyer [:darkspirit] from bug 1650246 comment 0)
With GLX, the top-left quarter of these menus is sometimes cut off (transparent), but with EGL they seem to be incorrectly sized until you hover them.
Assignee | ||
Comment 4•4 years ago
|
||
If it makes it easier to reproduce, I'll take it. I can make it happen very easily, although it is not my dev machine, which makes it slightly less convenient for manually rebuilding.... I am currently eyeing bug 1612440 as a candidate.
Assignee | ||
Comment 5•4 years ago
|
||
Of course, if it is just a race as nical speculates in the other bug, that might not be the most useful regression bug ever :).
Assignee | ||
Comment 6•4 years ago
|
||
I'm not quite sure if they are one and the same..... if I run with LIBGL_ALWAYS_SOFTWARE=1, I see the symptoms observed in that bug, partial drawing of the popup, and at the wrong origin, which fits with nicals assessment.
In my case, it looks like it is pulling something random of the texture cache to fill in the upper left hand gap. It often looks different when I reproduce.
Comment 7•4 years ago
•
|
||
Gnome XWayland, Debian Testing, 2560x1440@60Hz, Intel HD Graphics 630 (KBL GT2)
Make sure that no input is focused, no caret is blinking, no text is highlighted and close the "Welcome to Nightly" message. Then click like crazy on the account menu button.
mozregression --good 2020-05-04 --bad 2020-05-07 --pref gfx.webrender.all:true layers.gpu-process.enabled:false
9:34.75 INFO: Last good revision: 74d50028caec9d5856a173c98a6172700f1ccc29
9:34.75 INFO: First bad revision: 6a43f985307516ec1ae2d413a39cd6d813560b8b
9:34.75 INFO: Pushlog:
https://hg.mozilla.org/integration/autoland/pushloghtml?fromchange=74d50028caec9d5856a173c98a6172700f1ccc29&tochange=6a43f985307516ec1ae2d413a39cd6d813560b8b
6a43f985307516ec1ae2d413a39cd6d813560b8b sotaro — Bug 1574746 - Remove AllowWebRenderForThisWindow() r=nical
This is just the commit which enabled WebRender for search results, autoscroll icon, toolbar menus like the account menu (oop webext panels used WebRender already before).
Comment 8•4 years ago
|
||
(In reply to Andrew Osmond [:aosmond] from comment #6)
In my case, it looks like it is pulling something random of the texture cache to fill in the upper left hand gap. It often looks different when I reproduce.
If I open the account panel while something is moving or blinking, the "transparent" top-left quarter can be partly red and looks squished(?). It's more like a corrupted screenshot of the area behind it. Your screenshot has these red nuances, too.
Assignee | ||
Comment 9•4 years ago
|
||
That bug seems much more relevant. Let's see if I can reproduce that. Thanks darkspirit! I don't need to click like crazy but maybe I'm just lucky :). It almost always appears on the first click, and if not the first, then the second.
Comment 10•4 years ago
|
||
"Enabling WebRender for this widget type" must not be the cause. We have to find different STR to get to the bottom of it. ^^
bug 1650246 has an older sister that I saw on KDE long before: bug 1502519
Updated•4 years ago
|
Comment 11•4 years ago
|
||
Oh. I found out that it can also occur with bookmark, library and main menus as well.
Attached screenshot, build from May:
mozregression --launch 2020-05-04 --pref gfx.webrender.all:true layers.gpu-process.enabled:false -a https://keithclark.co.uk/labs/css-fps/nojs/
While testing, the library menu could either
a) be fine,
b) be transparent with red corruption as in attached screenshot, or
c) cause all rendering to hang for a second, then show up, but be totally transparent on the left - almost like attached screenshot, but without the red glitch.
Comment 12•4 years ago
•
|
||
But in previous comment, the library menu is apparently Basic, not WebRender. gfx.webrender.debug.new-frame-indicator does not show up.
Edit: No it's OpenGL, layers.acceleration.draw-fps does show up.
Comment 13•4 years ago
|
||
mozregression --launch 2020-05-04 --pref gfx.webrender.all:true layers.gpu-process.enabled:false -a https://keithclark.co.uk/labs/css-fps/nojs/
While in one session it can be easier to reproduce, in others it's impossible.
Now I have sucessfully raped the About Nightly window (a WebRender window) on Gnome XWayland, by repeatedly maximizing/unmaximizing and changing window size, which is bug 1502519, and proven that it's not restricted to KDE. These red glitches look similar to the ones at library and account menu.
Comment 11 to comment 13 may be offtopic, I don't know, I just wanted to mention it.
Comment 14•4 years ago
|
||
Comment 13 is still reproducible with latest Nightly: Updated bug 1502519 with screencast.
Updated•4 years ago
|
Comment 16•4 years ago
|
||
@aosmond: reminder :)
Updated•4 years ago
|
Comment 18•4 years ago
|
||
Also happens with extension popup windows. Second video from me: https://youtu.be/_I1MsTzthiU
Assignee | ||
Comment 19•4 years ago
|
||
We are deferring Linux MVP another release; I haven't been able to prioritize this work but we can't ship without it fixed.
Comment 22•4 years ago
|
||
This one looks like WR specific, I can't reproduce it on GL backend. It's also reproducible in debug builds so it does not look like a race condition or some timing issue. I can reproduce it on GLX backend only, EGL backend is OK.
Comment 23•4 years ago
|
||
I used qapitrace and traced the textures/buffers used here. It looks like more deep problem with popup rendering via. WebRender. What we see on the screens here is just backbuffer (perhaps ATI does not clear it).
The issue is that the popup is drawn in two or more steps when the part of the popup is drawn in first phase, then we swap buffers and draw rest of the popup and sometimes a part of the popup is missing and/or we see backbuffer content from the previous step.
This is not a problem with source textures - source texture with the popup content is created correctly.
The corruption happens at draw_frame/end_frame time when the popup texture is rendered to backbuffer - for instance draw_frame/pass5/framebuffer/Composite clears and update only part of the back buffer.
I'm afraid that's all I can help here as the rust stuff is beyond my scope.
Comment 24•4 years ago
|
||
Backbuffer content after first frame of popup rendering on radeon drivers, taken from apitrace.
Updated•4 years ago
|
Comment 26•4 years ago
|
||
(In reply to Martin Stránský [:stransky] from comment #22)
This one looks like WR specific, I can't reproduce it on GL backend. It's also reproducible in debug builds so it does not look like a race condition or some timing issue. I can reproduce it on GLX backend only, EGL backend is OK.
- EGL became less reproducible, but with llvmpipe (bug 1650246 comment 13) I can still reproduce attachment 9161212 [details] (bug 1650246 comment 3).
- GLX/OpenGL: attachment 9167186 [details] from comment 11 actually shows an OpenGL panel (just checked with layers.acceleration.draw-fps)
Updated•4 years ago
|
Assignee | ||
Comment 27•4 years ago
|
||
Given we see it most often with GLX, I am exploring using GLX_EXT_buffer_age to decide whether we need to do a full render or not. It is possible there are multiple related problems since we already use EGL_EXT_buffer_age today... for the problem I reproduce regularly, I only see it with GLX.
Assignee | ||
Comment 28•4 years ago
|
||
When swapping buffers with a GLX context, the back buffer may no longer
be valid. We can determine this by checking the buffer age via the
GLX_EXT_buffer_age extension, which is similar to the EGL_EXT_buffer_age
we use with EGL. If it is 0, then we must redraw the entire frame.
Updated•4 years ago
|
Assignee | ||
Comment 29•4 years ago
|
||
Upon further examination and discussions with jnicol/kvark, I believe:
- The EGL implementation on Linux would also suffer from problems if the buffer age was 2+ (might explain why we see it less often there).
- Since partial present is disabled on Linux, it is supposed to be always doing a full redraw, but clearly it isn't.
So I have a few bugs to fix, which involves:
- Splitting out partial present support such that we can let WR know what areas it needs to redraw, even if the compositor itself doesn't support a set of damage rects when it does the swap
- Ensure that if we don't support the above with buffer age, then we always do a full redraw.
Updated•4 years ago
|
Assignee | ||
Comment 30•4 years ago
|
||
Similar to bug 1280653, it appears that GLX invalidates the back buffer
while we are drawing. The only indication we get of this are resize and
configure events from X. We suppress the configure event for popups for
various reasons, so this patch explicitly generates a forced recomposite
of the frame. It does it immediately so that most of the time it should
beat the presentation of the buffer and avoid displaying of the bad
frame to the user; popups generally are not complicated and should have
plenty of budget to perform the second composite.
Assignee | ||
Updated•4 years ago
|
Comment 31•4 years ago
|
||
Gnome XWayland, Debian Testing, Intel HD Graphics 630 (KBL GT2), Mesa 20.1.9
If WebRender is enabled, I can reproduce this bug (top left quarter of the library menu is transparent) by clicking on the library button, then hovering the main menu button:
mozregression --launch 2020-10-19 --pre gfx.webrender.all:true -a about:blank
The bug does not occur if the GPU process is enabled.
With try build from comment 30 I am still able to reproduce it with WebRender (top left quarter is transparent) and OpenGL (top left quarter is colorfully corrupted after playing a video on YouTube):
mozregression --repo try --launch dd9b61a3d09f8f3dbaf7a885b2815e1696332d6e --pref gfx.webrender.force-disabled:true layers.acceleration.force-enabled:true -a about:blank
Comment 32•4 years ago
|
||
Comment 33•4 years ago
|
||
bugherder |
Comment 34•4 years ago
|
||
Since the status are different for nightly and release, what's the status for beta?
For more information, please visit auto_nag documentation.
Updated•4 years ago
|
Comment 35•4 years ago
|
||
I just tested the autoland build (changeset 99dcd4e9ee56) with mozregression and I still see this bug with the default settings (WebRender enabled and GPU process disabled), e.g. when I click on the library button. Am I missing something here?
Assignee | ||
Comment 36•4 years ago
|
||
I'm investigating the reports of this still happening. Managed to reproduce. It seems timely related, so if the forced composite comes in a bit later, it seems to avoid the problem.
Updated•4 years ago
|
Updated•4 years ago
|
Comment 37•4 years ago
|
||
With Firefox 85 build 20201118215158 and EGL the top left corner transparency issue doesn't happen, but this issue still happens as before: https://youtu.be/jghvl0R31LQ
Comment 38•4 years ago
|
||
Comment 39•4 years ago
|
||
We are tracking this issue now in bug 1676164, please check if this is what you can observe.
Comment 40•4 years ago
|
||
I could reproduce this issue using old Nightly from 2020-07-30 using Ubuntu 18.04 with AMD HD 5450 gpu and gfx.webrender.all
true
and layers.gpu-process.enabled
false
. I also verified that using beta Firefox 84.0b3 this does not reproduce, although there still is a glimpse of bad rendering but only for a fraction of a second if clicking fast enough on toolbar tools but it recovers very fast. Is this acceptable in your opinion?
Assignee | ||
Comment 41•4 years ago
|
||
(In reply to Bogdan Maris [:bogdan_maris], Release Desktop QA from comment #40)
I could reproduce this issue using old Nightly from 2020-07-30 using Ubuntu 18.04 with AMD HD 5450 gpu and
gfx.webrender.all
true
andlayers.gpu-process.enabled
false
. I also verified that using beta Firefox 84.0b3 this does not reproduce, although there still is a glimpse of bad rendering but only for a fraction of a second if clicking fast enough on toolbar tools but it recovers very fast. Is this acceptable in your opinion?
Unfortunately yes. Ideally we would get rid of that flash too, but as this is a race condition with X (see bug 1280653 for a more indepth analysis with the OpenGL compositor), it might not be trivial to do so.
Comment 42•4 years ago
|
||
(In reply to Andrew Osmond [:aosmond] from comment #41)
(In reply to Bogdan Maris [:bogdan_maris], Release Desktop QA from comment #40)
I could reproduce this issue using old Nightly from 2020-07-30 using Ubuntu 18.04 with AMD HD 5450 gpu and
gfx.webrender.all
true
andlayers.gpu-process.enabled
false
. I also verified that using beta Firefox 84.0b3 this does not reproduce, although there still is a glimpse of bad rendering but only for a fraction of a second if clicking fast enough on toolbar tools but it recovers very fast. Is this acceptable in your opinion?Unfortunately yes. Ideally we would get rid of that flash too, but as this is a race condition with X (see bug 1280653 for a more indepth analysis with the OpenGL compositor), it might not be trivial to do so.
Got it, I'm going to close this bug as verified for now.
Updated•3 years ago
|
Description
•