Closed Bug 1157764 Opened 10 years ago Closed 10 years ago

Video (presumably accelerated) causes a large increase in TDRs

Categories

(Core :: Audio/Video, defect)

x86
Windows
defect
Not set
normal

Tracking

()

RESOLVED INCOMPLETE
Tracking Status
firefox37 - wontfix
firefox38 + wontfix
firefox38.0.5 + wontfix
firefox39 + wontfix
firefox40 + wontfix
firefox41 + wontfix

People

(Reporter: bas.schouten, Unassigned)

References

Details

Attachments

(3 files)

There's a lot of information that suggests video causes an increase in TDRs on some hardware with some fairly recent drivers (drivers on which D3D11 and D2D seem to be just fine!). We should start figuring out which drivers those are and start blacklisting DXVA on them to see if that improves the situation. Not only are we not great at recovering from TDRs, some other software is quite sensitive to it as well, and I've seen it lead to BSODs too. We should keep causing them down to a minimum. One example of a bug with a user explicitly complaining about TDRs in Video where neither D3D11 or D2D are blacklisted: https://crash-stats.mozilla.com/report/index/5e2e363f-20d8-4195-ae5e-b7fa22150423
Anthony, is this actionable, should we track it?
Flags: needinfo?(ajones)
(In reply to Milan Sreckovic [:milan] from comment #1) > Anthony, is this actionable, should we track it? I don't have any objection to aggressively or even speculatively blacklisting drivers for DXVA. Can you just go ahead and do that or do you want some involvement from me?
Flags: needinfo?(matt.woodrow)
That sounds fine with me too, DXVA isn't *that* important, so we should blacklist anything that looks risky.
Flags: needinfo?(matt.woodrow)
Should we go to the other extreme and whitelist instead?
So for getting data on how much this improves crashiness, the pref we're talking about here is media.windows-media-foundation.use-dxva, is that correct media guys? :)
I'm going to put up a patch to disable DXVA by disabling this pref. According to offline conversations, media+graphics and some user accounts are telling us DXVA is causing a lot of TDRs and we just can't recover from them. Once up, I'm going to ask for the 38 and 39 uplift, and get it done today.
Approval Request Comment [Feature/regressing bug #]: [User impact if declined]: [Describe test coverage new/current, TreeHerder]: [Risks and why]: [String/UUID change made/needed]: A lot of TDR crashes. DXVA seems to be correlated with getting them.
Flags: needinfo?(ajones)
Attachment #8598720 - Flags: review?(bas)
Attachment #8598720 - Flags: approval-mozilla-beta?
Attachment #8598720 - Flags: approval-mozilla-aurora?
Attachment #8598720 - Flags: review?(bas) → review+
Comment on attachment 8598720 [details] [diff] [review] Nightly patch only. Disable dxva by default. r=bschouten [Triage Comment] Should be in 38 beta 9.
Attachment #8598720 - Flags: approval-mozilla-release+
Attachment #8598720 - Flags: approval-mozilla-beta?
Attachment #8598720 - Flags: approval-mozilla-aurora?
Attachment #8598720 - Flags: approval-mozilla-aurora+
Attachment #8598720 - Attachment description: Disable dxva by default. r=bschouten → Nightly patch only. Disable dxva by default. r=bschouten
OS: Unspecified → Windows
Hardware: Unspecified → x86
Attachment #8598741 - Flags: review?(bas) → review+
Attachment #8598742 - Flags: review?(bas) → review+
I don't love this approach, since it seems like a dead end. How are we going to get DXVA enabled again, or is this intended are a temporary measure? A whitelist sounds nice, but I don't see any easy way to populate it. We don't really have the resources to manually stress test all the configurations in use by our users. We also have evidence that disabling DXVA can *increase* the number of crashes, see bug 1146313. I'd much prefer if we just aggressively blacklist driver versions that crash a lot of youtube URLs, and then blacklist layers as well if they continue to crash.
Comment on attachment 8598720 [details] [diff] [review] Nightly patch only. Disable dxva by default. r=bschouten Review of attachment 8598720 [details] [diff] [review]: ----------------------------------------------------------------- I can't let this happen. I'm OK with disabling DXVA drivers/hardware that is suspected of causing a problem, but we simply can't just disable it wholesale.
Attachment #8598720 - Flags: review+ → review-
Why not? Are you saying that the non-DXVA codepath is perceivably worse for users than DXVA?
Asking for tracking as this blocks a tracked graphics bug 1116812.
(In reply to Benjamin Smedberg [:bsmedberg] from comment #13) > Why not? Are you saying that the non-DXVA codepath is perceivably worse for > users than DXVA? Yes. Some machines, notably older ones and those with some Atom CPUs, can't smoothly play videos with resolutions greater than 480p without hardware decoding. Battery drain is pretty bad without GPU decoding too.
Too bad we are not taking this change... bug 1116812 is impacting the 38 release (it is the most important source of crash).
(In reply to Sylvestre Ledru [:sylvestre] from comment #16) > Too bad we are not taking this change... > bug 1116812 is impacting the 38 release (it is the most important source of > crash). It is also possible this is unrelated, and that DXVA off does not reduce the number of driver resets. We don't actually know, so this change was going to be speculative at best. The change is in A/V area, even thought the code is in gfx/, so Anthony makes a call (we don't seem to track a "video" module owner.) See comment 12 and comment 15.
Comment on attachment 8598741 [details] [diff] [review] Aurora patch. Disable dxva by default. r=bschouten Removing Bas' r+, as we got an r- on the original patch.
Attachment #8598741 - Flags: review+
Comment on attachment 8598742 [details] [diff] [review] Beta patch. Disable dxva by default. r=bschouten Removing Bas' r+, as we got an r- on the original patch.
Attachment #8598742 - Flags: review+
Comment on attachment 8598720 [details] [diff] [review] Nightly patch only. Disable dxva by default. r=bschouten Removing sledru's aurora(39) and beta (38) approval, because of an r-
Attachment #8598720 - Flags: approval-mozilla-release+
Attachment #8598720 - Flags: approval-mozilla-aurora+
I still think we should land this on nightly for a couple of days to get data on how much of the TDRs this is responsible for.
(In reply to Bas Schouten (:bas.schouten) from comment #21) > I still think we should land this on nightly for a couple of days to get > data on how much of the TDRs this is responsible for. If you will collect the data from that and make it comparable, sure. I don't trust that I'd be able to make enough heads or tails from the low volume and noise from random checkins that we see on nightly.
I'm OK with testing it in nightly. It will cause a fairly serious regression for people with old machines (less common in nightly) but it is worth the pain.
Don't we keep getting bit by the fact that the beta (and even release) population have hardware/driver combinations that we didn't see on earlier channels?
We decided we're going to run the "experiment" instead. Bas & Smedberg will sort it out.
Tracking for Firefox41. n-i to Bas. Do we have a fix for Firefox41? There hasn't been any activity on the bug in 2-3 weeks.
Flags: needinfo?(bas)
(In reply to Ritu Kothari (:ritu) from comment #26) > Tracking for Firefox41. n-i to Bas. Do we have a fix for Firefox41? There > hasn't been any activity on the bug in 2-3 weeks. I need to talk to Benjamin and get the experiment sorted out, I will do so somewhere this week.
Flags: needinfo?(bas)
Too late for 38.0.5 but happy to take a fix for 39.
Milan is this a priority at this point? It sounds like a pretty bad problem, on the other hand I know we have others we're addressing. Benjamin is this part of the early startup work you were mentioning lately?
Flags: needinfo?(milan)
Flags: needinfo?(benjamin)
I'll let Milan comment.
Flags: needinfo?(benjamin)
I'm starting to think this is not and won't ever be actionable. Whatever experiments we may do, it won't be under this bug. Please reopen if you disagree...
Status: NEW → RESOLVED
Closed: 10 years ago
Flags: needinfo?(milan)
Resolution: --- → INCOMPLETE
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: