GPU Process hang and crash when browsing Imgur.com in Nightly with Radeon
Categories
(Core :: Graphics, defect, P1)
Tracking
()
Tracking | Status | |
---|---|---|
firefox-esr91 | --- | unaffected |
firefox100 | --- | unaffected |
firefox101 | + | disabled |
firefox102 | + | fixed |
People
(Reporter: lh.bennett, Assigned: sotaro)
References
(Blocks 2 open bugs, Regression, )
Details
(Keywords: crash, hang, regression)
Attachments
(1 file)
12.79 KB,
text/plain
|
Details |
User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Firefox/102.0
Steps to reproduce:
This is probably filed somewhere, even though I can't find exactly what bug it is.
Over the last couple of weeks, browsing on Imgur.com can cause a GPU hang and crash.
Actual results:
Visit Imgur.com and browse, even click on different image/video pages. Occasionally the GPU Process will hard lock, CPU usage will climb to about 30% and will stay there. Controls will no longer be functional. I can get it to auto recover if I resize the window. Otherwise, it will stay that way for as long as it's active.
Even worse, sometimes it can take the driver instance with it, causing the OS to terminate the graphics driver runtime and recover.
Expected results:
The GPU process shouldn't hard lock and prevent nearly all interaction with the window.
Reporter | ||
Comment 1•3 years ago
|
||
Reporter | ||
Updated•3 years ago
|
Comment 2•3 years ago
|
||
The Bugbug bot thinks this bug should belong to the 'Core::Graphics' component, and is moving the bug to that component. Please correct in case you think the bot is wrong.
Comment 3•3 years ago
|
||
Are you able to capture a profile using the "Graphics" setting in the profiler? See https://profiler.firefox.com/. Thanks!
Comment 4•3 years ago
|
||
Also, are there any crash reports under about:crashes
for the GPU process?
Comment 5•3 years ago
|
||
Also, just a thought, bug 1763280 landed recently which affected video performance and your driver is affected. If you flip media.wmf.no-copy-nv12-textures-force-enabled
to true, does it make a difference?
Reporter | ||
Comment 6•3 years ago
|
||
Breakpad does not catch the hang of GPU Process. I suspect It's because of the recovery of the instead of a full termination.
I will try those suggestions and report back.
Reporter | ||
Comment 7•3 years ago
|
||
I got lucky today, not only did Breakpad catch a crash this time, but it took out the driver, forcing a restart which killed the profiler. Tried again and got another profile.
https://share.firefox.dev/39SaSxL
https://crash-stats.mozilla.org/report/index/dea967c2-21b3-4340-b0cc-e349d0220509
Reporter | ||
Comment 8•3 years ago
|
||
Just to note, I cannot reproduce this crash with pref 'media.wmf.no-copy-nv12-textures-force-enabled' set to true.
Comment 9•3 years ago
|
||
(In reply to Leman Bennett [Omega] from comment #8)
Just to note, I cannot reproduce this crash with pref 'media.wmf.no-copy-nv12-textures-force-enabled' set to true.
Awesome, thanks.
Comment 10•3 years ago
|
||
(In reply to Leman Bennett [Omega] from comment #7)
I got lucky today, not only did Breakpad catch a crash this time, but it took out the driver, forcing a restart which killed the profiler. Tried again and got another profile.
https://share.firefox.dev/39SaSxL
https://crash-stats.mozilla.org/report/index/dea967c2-21b3-4340-b0cc-e349d0220509
Unfortunately that crash is in the content process, and the profiler recording only captured the tail end of the graphics threads. Probably not enough information. Would you mind retrying? Thanks!
Updated•3 years ago
|
Updated•3 years ago
|
Comment 11•3 years ago
|
||
Set release status flags based on info from the regressing bug 1763280
Reporter | ||
Comment 12•3 years ago
|
||
(In reply to Andrew Osmond [:aosmond] (he/him) from comment #10)
(In reply to Leman Bennett [Omega] from comment #7)
I got lucky today, not only did Breakpad catch a crash this time, but it took out the driver, forcing a restart which killed the profiler. Tried again and got another profile.
https://share.firefox.dev/39SaSxL
https://crash-stats.mozilla.org/report/index/dea967c2-21b3-4340-b0cc-e349d0220509
Unfortunately that crash is in the content process, and the profiler recording only captured the tail end of the graphics threads. Probably not enough information. Would you mind retrying? Thanks!
I see what's going on. Unfortunately, I need some advice to get a capture. The browser has to be responsive in order to use the profiler. When the GPU Process locks to an unresponsive 30% CPU, I cannot interact with the browser at all. Pressing CTRL+SHIFT+2 does nothing until I kill the GPU Process, which erases that part of the capture. I tried using WinDBG, but it did not go well. I also tried waiting it out, but the profiler exits with an OOM error after the GPU Process recovers.
Also, I kept a version of GPU-Z active just to see if the GPU is affected, and it stays completely nominal. UVD clocks will spike on occasion, but that's due to the videos loading. Without any activity, the clocks are flat.
Assignee | ||
Comment 13•3 years ago
|
||
Bug 1763280 seems not to related to the regression. Bug 1763280 enables zero video frame copy only on Intel GPU. Reporter's GPU was AMD Radeon.
Assignee | ||
Comment 14•3 years ago
|
||
(In reply to Leman Bennett [Omega] from comment #8)
Just to note, I cannot reproduce this crash with pref 'media.wmf.no-copy-nv12-textures-force-enabled' set to true.
If it addressed the problem. Bug 1766282 and Bug 1767212 might address the problem. Pref is going to be changed to 'media.wmf.zero-copy-nv12-textures-force-enabled'.
Assignee | ||
Updated•3 years ago
|
Assignee | ||
Comment 15•3 years ago
|
||
(In reply to Sotaro Ikeda [:sotaro] from comment #13)
Bug 1763280 seems not to related to the regression. Bug 1763280 enables zero video frame copy only on Intel GPU. Reporter's GPU was AMD Radeon.
I wonder if Bug 1758601 could be an actual regression bug.
Comment 16•3 years ago
|
||
The bug has a release status flag that shows some version of Firefox is affected, thus it will be considered confirmed.
Comment 17•3 years ago
|
||
Given that Fx101 goes to RC in a week, is there something we should be considering preffing off on Beta before then?
Comment 18•3 years ago
|
||
Changing the priority to P1 as the bug is tracked by a release manager for the current beta.
See Triage for Bugzilla for more information.
If you disagree, please discuss with a release manager.
Assignee | ||
Comment 19•3 years ago
|
||
(In reply to Ryan VanderMeulen [:RyanVM] from comment #17)
Given that Fx101 goes to RC in a week, is there something we should be considering preffing off on Beta before then?
Video overlay is enabled only on nightly on non-Intel GPU on current m-c. It does not go to beta.
Do we need to preffing it off also in this case?
Comment 20•3 years ago
|
||
(In reply to Sotaro Ikeda [:sotaro] from comment #19)
Video overlay is enabled only on nightly on non-Intel GPU on current m-c. It does not go to beta.
If it's disabled by default for Beta, we're good for now. Can we please make sure this bug is set blocking whatever bug is tracking letting non-Intel GPUs ride the trains?
Assignee | ||
Comment 21•3 years ago
|
||
(In reply to Ryan VanderMeulen [:RyanVM] from comment #20)
(In reply to Sotaro Ikeda [:sotaro] from comment #19)
Video overlay is enabled only on nightly on non-Intel GPU on current m-c. It does not go to beta.
If it's disabled by default for Beta, we're good for now. Can we please make sure this bug is set blocking whatever bug is tracking letting non-Intel GPUs ride the trains?
Created bug 1769643 for it.
Comment 22•3 years ago
|
||
The severity field for this bug is set to S3. However, the bug is marked as tracked for firefox102 (nightly) and tracked for firefox101 (beta).
:bhood, could you consider increasing the severity of this tracked bug?
For more information, please visit auto_nag documentation.
Updated•3 years ago
|
Updated•3 years ago
|
Assignee | ||
Comment 23•3 years ago
|
||
:Omega, can you check if the problem is addressed on latest nightly? Bug 1767212 might address the problem.
Reporter | ||
Comment 24•3 years ago
|
||
(In reply to Sotaro Ikeda [:sotaro] from comment #23)
:Omega, can you check if the problem is addressed on latest nightly? Bug 1767212 might address the problem.
It's been a driver upgrade and a day trying to recreate the hang/crash. But so far, I have not been able to reproduce the bug.
Comment 25•2 years ago
|
||
Set release status flags based on info from the regressing bug 1763280
Comment 26•2 years ago
|
||
Leman, does this behavior persist, or would you considered it resolved?
Reporter | ||
Comment 27•2 years ago
|
||
(In reply to Bob Hood from comment #26)
Leman, does this behavior persist, or would you considered it resolved?
I would consider it resolved. I haven't seen the issue come back.
Updated•2 years ago
|
Updated•2 years ago
|
Description
•