Closed Bug 1480242 Opened 7 years ago Closed 7 years ago

Shield study enrollment is lower than it should be

Categories

(Core :: Graphics: WebRender, defect, P4)

63 Branch
defect

Tracking

()

RESOLVED DUPLICATE of bug 1495152

People

(Reporter: jrmuizel, Unassigned)

References

Details

Attachments

(1 file)

For example, of 1621 submissions that are enrolled in the treatment group 1007 end up with d3d11 and only 561 get WebRender
Priority: -- → P3
What does this mean? Are we planning to extend the time of the study? Do we need to tweak and start over?
Attached image power_curve.png β€”
Sorry if this is redundant but, just to state the analytics perspective, the fewer users that qualify, the harder it is to make a robust claim about small changes in the observed metrics; additionally, if the users that are not qualified for the experiment are biased in an unexpected way, that can make the results less generalizable, so it's interesting to understand why the numbers are lower than expected. I've attached the power curve from the pre-experiment power analysis illustrating the fraction change in the median that's observable as a function of the number of users in the treatment arm.
(With respect to remedies, extending the study duration is fine if we don't think the disqualifications are for an interesting reason.)
> jrmuizel: darkspirit: I noticed you marked your comments in bug 1480242 as obsolete. Why? I noticed a mistake. I didn't want to spam. Then I thought you might be filtering for the exact wrQualified and webrender statuses anyway afterwards, and that you might have forgotten it only in comment 0. It was hot, I had headaches and felt unsure, so I wanted to bring it out of sight. (In reply to Jan Andre Ikenmeyer [:darkspirit] from comment #5) > - unqualified Nvidia: dual GPUs: Intel is in use (bp-4c6f365f-0a13-4a47-8db8-292710180802) bp-4c6f365f-0a13-4a47-8db8-292710180802 > {"compositor":"d3d11","gpuProcess":{"status":"available"},"wrQualified":{"status":"blocked"},"webrender":{"status":"opt-in"} > {"prefflip-webrender-v1-1-1474484":{"branch":"treatment","type":"normandy-exp"} This example of "unqualified Nvidia: dual GPUs: Intel is in use" looks wrong to me because webrender has status "opt-in"[1] instead of "blocked"[2], so it might be more an example of >unqualified manually disabled in treatment<. Why would an unqualified user in treatment manually disable gfx.webrender.all.qualified which has no practical function? Is there still an app.normandy.startupExperimentPrefs.gfx.webrender.all.qualified;false pref around? (bug 1479649 comment 2) An example of "unqualified Nvidia: dual GPUs: Intel is in use" (d3d11 in treatment) might be rather bp-644068d6-a700-43b9-b652-cc07d0180807 because webrender is blocked [2]: > {"compositor":"d3d11","gpuProcess":{"status":"available"},"wrQualified":{"status":"blocked"},"webrender":{"status":"blocked"} > {"prefflip-webrender-v1-1-1474484":{"branch":"treatment","type":"normandy-exp"} [1] https://searchfox.org/mozilla-central/rev/51268dcbdff0f6f4a5cff7986df0f616efc5bcfd/gfx/thebes/gfxPlatform.cpp#2619 [2] https://searchfox.org/mozilla-central/rev/51268dcbdff0f6f4a5cff7986df0f616efc5bcfd/gfx/thebes/gfxPlatform.cpp#2640
So I looked some more at this. The majority of wrQualified users in the treatment group that have a d3d11 compositor report "opt-in" for webrender.status. My reading of the code suggests that this should only happen if the webrender pref is not set. Is there perhaps another reason why the pref is not being set as widely as we hoped?
Yesterday I tried to reproduce not being enrolled locally with a sample study from the staging server and was not able to. Michael is going to look at some alternative data to see if he sees any anomalies there.
Flags: needinfo?(mcooper)
Oh, sorry I wasn't clear. I wasn't planning on doing that analysis myself. It is beyond my skills with Spark and Telemetry analysis. I can summarize the data I mentioned though: Normandy emits Telemetry Events [0] about when it enrolls and unenrolls a user. My theory is that because the feature requires a restart before Webrender becomes active, the data between when Normandy enrolls a user and when they restart for the first time is polluting the data. In theory this noise could be removed by figuring out if the ping being analyzed was generated during the same Firefox session as when the user enrolled in the study.
Flags: needinfo?(mcooper)
I can help with that but I'm at onboarding this week so it'll be delayed; please let me know if you'd like me to take a look the week after next.
Blocks: 1485156
No longer blocks: 1485156
Depends on: 1485156
Priority: P3 → P4
Jeff -- is this bug still active or has it been overtaken by the newer study?
Flags: needinfo?(jmuizelaar)
Yeah. We can replace this bug with bug 1495152.
Status: NEW → RESOLVED
Closed: 7 years ago
Flags: needinfo?(jmuizelaar)
Resolution: --- → DUPLICATE
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: