Closed Bug 1574618 Opened 5 years ago Closed 4 years ago

Support for pref-flip study to asses Fenix tracking protection modes

Categories

(Data Science :: Experiment Collaboration, task)

x86_64
Windows 10
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: esmyth, Assigned: ccd)

References

Details

Brief Description of the request (required):

The Fenix team would like to understand how tracking protection modes impact page load performance and how a change in the default mode may impact user engagement. To inform this decision, we intend to conduct an experiment to test the hypothesis that Tracking Protection has perceptibly faster page loads than Enhanced Tracking Protection and that changes to the default mode will measurably impact user engagement.

Business purpose for this request (required):

There are product benefits from both modes. However we need to better understand how the modes affect user behavior, especially whether a change in the default behavior would impact 7-day retention of new users, churn of existing users, and user engagement (e.g., total_uri_count or subsession_length).

Requested timelines for the request or how this fits into roadmaps or critical decisions (required):

Start the experiment on or after September 19

Links to any assets (e.g Start of a PHD, BRD; any document that helps describe the project):

Name of Data Scientist (If Applicable):

Corey Dow-Hygelund

Assignee: nobody → cdowhygelund
Status: NEW → ASSIGNED

Reference material regarding mobile experiments and Mako.

See Also: → 1582828

Sample size analysis for new user retention and churn has been calculated. Sampling 23K clients, which is ~30% of available experiment population, yields:

  • 7-day new user retention:
    • < 5% measurable difference
    • 0.005 Type I Error (false positive)
    • 0.10 Type II Error (false negative)
  • 7-day churn (i.e., 1 - existing user retention):
    • < 10% measurable difference
    • 0.005 Type I Error
    • 0.10 Type II Error

As 10% is quite large, this seems like the smallest viable sample size to satisfy that requirement.

Decreasing the measurable difference in new user retention to 1% increases the necessary sample size to ~70K clients. This is 84% of available experiment population. Therefore, that is close to the maximum sensitivity we can achieve with new user retention in Fenix.

Engagement metrics will be added shortly.

Comparison of navigation timings to visual metrics. Idea would be to utilize a linear combo of timings as a means of getting user-perceived page load metrics.

Engagements metrics have been added. Requesting 85% of population.

Experiment is set to launch October 15th or 31st.

Experiment has been launched. Notebook for data pull and initial analysis.

Report drafted and sent for review.

Status: ASSIGNED → RESOLVED
Closed: 4 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.