Closed Bug 1844627 (webgpu-v1-windows-blocklist-intermittent) Opened 1 year ago Closed 1 month ago

`WebGPU is disabled by blocklist` in Windows CI

Categories

(Core :: Graphics: WebGPU, defect, P1)

defect

Tracking

()

RESOLVED FIXED

People

(Reporter: ErichDonGubler, Assigned: ErichDonGubler)

References

(Blocks 2 open bugs)

Details

Attachments

(1 obsolete file)

There appears to be an odd variance in Windows CI environments where WebGPU does not run, and the diagnostic emitted blames our blocklist:

EXCEPTION: WebGPU device failed to initialize with NotSupportedError "WebGPU is disabled by blocklist."; not retrying

When this error is encountered, all WebGPU CTS tests in the job fail. EDIT: Actually, it appears that test runs can recover from this. 🤔⁉️

No longer blocks: 1818918
See Also: → 1818918
Duplicate of this bug: 1828552
Duplicate of this bug: 1827741
Blocks: 1838695
No longer blocks: 1827741, 1828552
Duplicate of this bug: 1851402

The severity field for this bug is set to S3. However, the following bug duplicate has higher severity:

:jimb, could you consider increasing the severity of this bug to S2?

For more information, please visit BugBot documentation.

Flags: needinfo?(jimb)
Blocks: 1859780
No longer blocks: webgpu-v1-cts-blockers

Raising the priority on this to P1: "CTS impact widespread enough that it obscures other issues", based on Matrix discussion with EDG.

Leaving the severity as S3, per WebGPU project policy.

Flags: needinfo?(jimb)
Priority: P2 → P1
Duplicate of this bug: 1898326
Assignee: nobody → egubler
Status: NEW → ASSIGNED

Not sure that I have a strong lead on resolving this yet, but I did want to note an interesting observation: It appears we may legitimately have different graphics adapters. In most cases (where this issue isn't happening), a log message is occasionally printed by glean with the following text:

[WARN  glean_core::error_recording] gfx.adapter.primary.driver_files: Value length 811 exceeds maximum of 100

However, in tests that do run into this issue, there is no such thing, implying that the set of driver files is small enough to fit within the limit.

One way to debug this would be to start with that error message:

EXCEPTION: WebGPU device failed to initialize with NotSupportedError "WebGPU is disabled by blocklist."; not retrying

and then work backwards to find the sufficient conditions for getting that message, and add logging at each point so we can see in detail how we're arriving at that message. For example, it sure would be nice to know which blocklist entry we're hitting.

This would entail repeated submissions to the try server, so it would need to be a background project.

Attachment #9403645 - Attachment is obsolete: true

A patch stack now exists against dependency bug 1899536. Once it's landed, this should be resolved. 🙌🏻

Status: ASSIGNED → RESOLVED
Closed: 1 month ago
Resolution: --- → FIXED
Blocks: webgpu-v1-cts-blockers
No longer blocks: webgpu-triage
Duplicate of this bug: 1873505
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: