Closed Bug 1681671 Opened 4 years ago Closed 4 years ago

Crash in [@ __memcpy_avx_unaligned | webrender::renderer::Renderer::draw_frame]

Categories

(Core :: Graphics: WebRender, defect, P3)

Desktop
Linux
defect

Tracking

()

RESOLVED FIXED
85 Branch
Tracking Status
firefox-esr78 --- unaffected
firefox83 --- unaffected
firefox84 --- wontfix
firefox85 --- fixed

People

(Reporter: aryx, Assigned: aosmond)

References

(Blocks 1 open bug, Regression)

Details

(Keywords: crash, regression)

Crash Data

Attachments

(1 file)

Bug 1632698 landed earlier this week to fix similar signatures. Andrew, please also take a look at this one (build ID 20201209213504).

Maybe Fission related. (DOMFissionEnabled=1)

Crash report: https://crash-stats.mozilla.org/report/index/356a3a4e-17fa-4d69-a1ce-f3a410201210

Reason: SIGSEGV /SEGV_MAPERR

Top 10 frames of crashing thread:

0 libc.so.6 __memcpy_avx_unaligned 
1 libxul.so webrender::renderer::Renderer::draw_frame gfx/wr/webrender/src/renderer.rs:5915
2 libxul.so webrender::renderer::Renderer::render_impl gfx/wr/webrender/src/renderer.rs:3488
3 libxul.so webrender::renderer::Renderer::render gfx/wr/webrender/src/renderer.rs:3244
4 libxul.so wr_renderer_render gfx/webrender_bindings/src/bindings.rs:614
5 libxul.so mozilla::wr::RendererOGL::UpdateAndRender gfx/webrender_bindings/RendererOGL.cpp:186
6 libxul.so mozilla::wr::RenderThread::UpdateAndRender gfx/webrender_bindings/RenderThread.cpp:492
7 libxul.so mozilla::wr::RenderThread::HandleFrameOneDoc gfx/webrender_bindings/RenderThread.cpp:326
8 libxul.so mozilla::detail::RunnableMethodImpl<mozilla::wr::RenderThread*, void  xpcom/threads/nsThreadUtils.h:1201
9 libxul.so base::MessagePumpDefault::Run ipc/chromium/src/base/message_pump_default.cc:35
Flags: needinfo?(aosmond)

There are a few similar signatures floating around, we see it in beta too without fission. I'm in the process of fixing some telemetry/crash report annotations which may shed light on whether an NVIDIA or innocent device reset happened before the crash.

Blocks: wr-nv-linux
Severity: -- → S3
OS: Unspecified → Linux
Priority: -- → P3
Hardware: Unspecified → Desktop

These signatures began in build 20201108093650:

https://hg.mozilla.org/mozilla-central/pushloghtml?fromchange=e40cc6272439&tochange=d366215bf9af5dd06c27f8b36d875553838d2fe0

But that was a Sunday daily, probably fairly low uptake. Earlier builds would include:

Bug 1675453 seems of particular note, since it enabled VAAPI by default. We should probably hook hardware video decoding into the gfx blocklist. This would allow us to run a test on nightly to see if it is indeed related without disabling HW acceleration for all users.

Reviewing

https://crash-stats.mozilla.org/report/index/6b16cc93-2d3d-49c4-8626-932720201212#tab-metadata

I can see DeviceResetReason is 10, which corresponds to the NVIDIA video memory reset as the last known reset. So it appears our current solution is inadequate for handling it.

Flags: needinfo?(aosmond)

I think we might need to do:

https://searchfox.org/mozilla-central/rev/95cf843de977805a3951f9137f5ff1930599d94e/gfx/webrender_bindings/RenderThread.cpp#867

for the NVIDIA video memory resets, and implement RenderDMABUFTextureHost::ClearCachedResources.

Assignee: nobody → aosmond
Status: NEW → ASSIGNED
Keywords: regression
Regressed by: 1675453
Has Regression Range: --- → yes

This should also fix issues with device resets for non-NVIDIA users as well who experience a full device reset who have DMABuf support turned on.

Pushed by aosmond@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/f78f7ee4f267 Ensure we clear cached DMABuf textures during an NVIDIA video memory device reset. r=sotaro
Status: ASSIGNED → RESOLVED
Closed: 4 years ago
Resolution: --- → FIXED
Target Milestone: --- → 85 Branch
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: