Closed Bug 1749324 Opened 3 years ago Closed 3 years ago

nvidia-vaapi-driver: Crash in [@ GI_sched_get_priority_max]

Tracking

()

Status:

RESOLVED WORKSFORME

Tracking Flags:

Tracking

Status

firefox98

---

disabled

People

(Reporter: gsvelto, Assigned: jld)

References

(Blocks 1 open bug, Regression)

Details

(Keywords: crash, regression)

Crash Data

Gabriele Svelto [:gsvelto]

Reporter

Description

•

3 years ago

Crash report: https://crash-stats.mozilla.org/report/index/be4ac4b5-0245-4763-a024-54e990220110

Reason: SIGSYS / 0x00000001

Top 10 frames of crashing thread:

0 libc.so.6 __GI___sched_get_priority_max 
1 libcuda.so.1 <.text ELF section in libcuda.so.495.46> 
2 libcuda.so.1 <.text ELF section in libcuda.so.495.46> 
3 libcuda.so.1 cudbgApiInit 
4 libcuda.so.1 <.init ELF section in libcuda.so.495.46> 
5 ld-linux-x86-64.so.2 call_init 
6 ld-linux-x86-64.so.2 _dl_init 
7 libc.so.6 __GI__dl_catch_exception 
8 ld-linux-x86-64.so.2 dl_open_worker 
9 libc.so.6 __GI__dl_catch_exception

These are calls to sched_get_priority_max() obviously (syscall 142). They seem to come from deep within CUDA libraries (cringe) being invoked by VAAPI code.

Gian-Carlo Pascutto [:gcp]

Updated

•

3 years ago

Assignee: nobody → jld

Priority: -- → P1

Gian-Carlo Pascutto [:gcp]

Comment 1

•

3 years ago

Adding regressed by as this is spiking now.

Regressed by: 1745225

BMO Automation

Updated

•

3 years ago

Has Regression Range: --- → yes

Robert Mader [:rmader]

Comment 2

•

3 years ago

32097 crashes off a single install on just one day O.o
Looks like a prime example of bug 1746232

Darkspirit

Comment 3

•

3 years ago

•

Edited

All crashes occured with Nvidia.
https://github.com/elFarto/nvidia-vaapi-driver crashes the RDD process when used without MOZ_DISABLE_RDD_SANDBOX=1 env var.
Duplicate of bug 1748460.

Blocks: 1748460

Type: defect → enhancement

status-firefox98: --- → disabled

Summary: Crash in [@ __GI___sched_get_priority_max] → nvidia-vaapi-driver: Crash in [@ __GI___sched_get_priority_max]

Darkspirit

Updated

•

3 years ago

Type: enhancement → defect

Darkspirit

Comment 4

•

3 years ago

Regressed by: bug 1745225

Here they are trying to add AV1 support: https://github.com/elFarto/nvidia-vaapi-driver/issues/31

Stephen

Comment 5

•

3 years ago

I'm going to take a look at changing the library to fail to init if it detects it's in the sandbox. The only issue I can see with that is that the CUDA library is running before any of our code is run, so I'll need to take a close look and make sure we're not directly linking with libcuda.

I'm also somewhat confused how va_infoMessage called dlopen.

Stephen

Comment 6

•

3 years ago

I've added a check to v0.0.3 of the library, which will check for the sandbox. I'll be releasing this shortly.

BugBot [:suhaib / :marco/ :calixte]

Updated

•

3 years ago

Keywords: regression

Darkspirit

Updated

•

3 years ago

Updated

•

3 years ago

Severity: S2 → S3

Priority: P1 → --

Gian-Carlo Pascutto [:gcp]

Updated

•

3 years ago

Priority: -- → P3

BugBot [:suhaib / :marco/ :calixte]

Comment 7

•

3 years ago

Closing because no crashes reported for 12 weeks.

Status: NEW → RESOLVED

Closed: 3 years ago

Resolution: --- → WORKSFORME

You need to log in before you can comment on or make changes to this bug.

Bugzilla

nvidia-vaapi-driver: Crash in [@ GI_sched_get_priority_max]

Categories

(Core :: Security: Process Sandboxing, defect, P3)

Tracking

()

People

(Reporter: gsvelto, Assigned: jld)

References

(Blocks 1 open bug, Regression)

Details

(Keywords: crash, regression)

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Updated

Comment 2

Comment 3

Updated

Comment 4

Comment 5

Comment 6

Updated

Updated

Updated

Updated

Comment 7