Traffic Cop Pinning to Win10 taskbar A/B testing
Categories
(www.mozilla.org :: Analytics, enhancement)
Tracking
(Not tracked)
People
(Reporter: RT, Assigned: craigcook, NeedInfo)
References
Details
Attachments
(1 file)
Reporter | ||
Comment 1•6 years ago
|
||
Comment 2•6 years ago
|
||
Reporter | ||
Comment 3•6 years ago
|
||
Comment 4•6 years ago
|
||
Comment 5•6 years ago
|
||
Reporter | ||
Comment 6•6 years ago
|
||
Comment 7•6 years ago
|
||
Comment 8•6 years ago
|
||
Comment 9•6 years ago
|
||
Comment 10•6 years ago
|
||
Reporter | ||
Comment 11•6 years ago
|
||
Comment 12•6 years ago
|
||
Comment 13•6 years ago
|
||
Assignee | ||
Comment 14•6 years ago
|
||
Comment 15•6 years ago
|
||
Reporter | ||
Comment 16•6 years ago
|
||
Comment 17•6 years ago
|
||
Comment 18•6 years ago
|
||
Comment 19•6 years ago
|
||
Assignee | ||
Comment 20•6 years ago
|
||
Assignee | ||
Comment 21•6 years ago
|
||
Comment 22•6 years ago
|
||
Updated•6 years ago
|
Comment 23•6 years ago
|
||
Comment 24•6 years ago
|
||
Assignee | ||
Comment 25•6 years ago
|
||
Comment 26•6 years ago
|
||
Comment 27•6 years ago
|
||
Comment 28•6 years ago
|
||
Comment 29•6 years ago
|
||
Comment 30•6 years ago
|
||
Comment 31•6 years ago
|
||
Comment 32•6 years ago
|
||
Comment 33•6 years ago
|
||
Comment 34•6 years ago
|
||
Comment 35•6 years ago
|
||
Comment 36•6 years ago
|
||
Reporter | ||
Comment 37•6 years ago
|
||
Comment 38•6 years ago
|
||
Updated•6 years ago
|
Reporter | ||
Comment 39•6 years ago
|
||
For info we're moving forward with a stub-based solution, please see details on https://bugzilla.mozilla.org/show_bug.cgi?id=1493597#c43
Su, can you please validate this looks fine from an analytics standpoint?
Comment 40•6 years ago
|
||
The custom stub installers work in Bug 1493597 is now ready for testing. I spoke to craigcook on slack about re-enabling the Traffic Cop in staging, and also re-enabling it for win32 requests. Plan is to do that on Friday ('zilla std time).
Comment 41•6 years ago
|
||
:rt,
Yes, that solution looks great to me. From an analysis standpoint, this works perfectly.
Follow up question, will this custom stubinstaller be sending install pings like normal stubinstallers do? And if so, where? That data will be useful for the experiment. (I'll also ask in the funnelcake bug).
:craigcook, just to confirm, so we'll be targeting:
- Windows 10
- en-US
users on the website, sampling 9% for 138 stubinstaller, 9% for 139 stubinstaller, and then the stubinstaller will determine:
- if 32 bit OS, install regular Firefox
- if 64 bit OS, install 13X funnelcake
is that a correct understanding?
Thanks!
- Su
Assignee | ||
Comment 42•6 years ago
|
||
(In reply to Su-Young Hong from comment #41)
:craigcook, just to confirm, so we'll be targeting:
- Windows 10
- en-US
users on the website, sampling 9% for 138 stubinstaller, 9% for 139 stubinstaller, and then the stubinstaller will determine:
- if 32 bit OS, install regular Firefox
- if 64 bit OS, install 13X funnelcake
is that a correct understanding?
Correct.
However, I now realize we'll need to make a minor change in our traffic cop setup since we were previously detecting win64 and now we need to remove that condition. I can make that update and we can get it into production first thing Monday. Then we can reactivate the experiment on stage to test and verify, and if all is well we can also activate it in production on Monday.
Comment 43•6 years ago
|
||
thanks Craig!
that works for me.
:rt, does that work for you?
Comment 44•6 years ago
|
||
thank you guys for moving forward on a solution for this!
Reporter | ||
Comment 45•6 years ago
|
||
(In reply to Su-Young Hong from comment #43)
thanks Craig!
that works for me.
:rt, does that work for you?
Good with me! Let's watch closely user acquisition numbers as this roll-out to validate acquisition is fine early on.
Comment 46•6 years ago
|
||
Craig re-enabled traffic cop on stage www.mozilla.org and I've done some testing. Everything seems to be working fine (see below) and I'm happy for us to re-enable on prod too. Is there any reason not to go ahead at this point ?
Test method:
- 64 or 32 bit Windows 10 VMs with matching processors, Google Chrome
- repeat:
- open a new Incognito window (to avoid traffic cop cookie)
- load https://www.allizom.org/en-US/firefox/new/
- if the URL now has a ?f=138 or ?f=139 suffix then
- click on the green 'Download Firefox' button
- run the stub installer
- verify the install using Help > About Firefox
- on 64 bit it'll mention Funnelcake and is a 64 bit build; 139 pins the Firefox icon on the taskbar
- on 32 bit it won't mention Funnelcake and is a 32 bit build
The 9% each selection seems to be working, broadly speaking, as the additional query arg is uncommon, and roughly the same frequency. Couldn't get Windows 7 to give me a funnelcake in ~40 attempts.
Reporter | ||
Comment 47•6 years ago
|
||
I could not get a URL with a ?f=138 or ?f=139 suffix when loading https://www.allizom.org/en-US/firefox/new/ in an incognito Window and pressing refresh about 50 times, both on Chrome and Firefox - could it be because I'm in France? It should still be working given we target en-US regardless of geography but maybe traffic cop got disabled on stage?
I also re-ran Su's query (https://sql.telemetry.mozilla.org/queries/60716/source) which shows a few more DAU of the funelcake as of yesterday.
I'm good with enabling on prod assuming the reason I cannot test is traffic cop got disabled or it did something wrong when testing.
Assignee | ||
Comment 48•6 years ago
|
||
You also need to ensure Do Not Track is OFF. We don't serve the Traffic Cop script to browsers with DNT enabled.
Comment 49•6 years ago
|
||
I found that I had to create a new private window for each request rather than refresh in a single one. Apparently there's a cookie set on the first request which has to be removed somehow (clearing data would also work).
Craig enabled Traffic Cop in prod approximately Tuesday, 15 January 2019 00:00 UTC. I see it working there.
Comment 50•6 years ago
|
||
monitoring FF telemetry and installs here: https://sql.telemetry.mozilla.org/dashboard/funnelcake138-139-monitoring
Comment 51•6 years ago
|
||
How's that going compared to the plan, are we getting sufficient enrolments ? Firefox 65.0 will ship on January 29 and it would be good to know if we want to wrap up distribution before then, or if the release is an opportunity when download traffic is higher (or different type of user, etc). There's a Releng and informal-QA cost to using 65.0 based builds - some checks around installer changes, and that distribution still works OK after bouncer changes.
Reporter | ||
Comment 52•6 years ago
|
||
Our target was 150k profiles enrolled over 14 days
If I read Su's graphs right, between Jan 15th and Jan 20th (6 days) we enrolled 42500 profiles - i.e 7083 daily
If we keep this same pace over the next 9 days (getting us to January 28th as last enrolled day) we'd get another 63700 profiles - i.e 106200 profiles total (30% off of our enrollment target) .
Is that an option to increase the enrollment rate (currently 9%) to avoid releng/QA costs related to using 65.0 based builds?
Comment 53•6 years ago
|
||
Hmm, I read the first graph as cohort size of ~8K each on Jan 20, now ~12k on Jan 21 data, which is a lot further away. Seems like an adjustment to traffic cop would be needed either way.
Comment 54•6 years ago
|
||
+1 romain's comment. We're enrolling (or having less enrollees end up in Telemetry) at a lower rate then we projected.
Currently, we have about 50K total enrollees.
We might want to bump up enrollment (maybe 14% each branch, eyeball projection here) to achieve our desired enrollment size.
- Su
Assignee | ||
Comment 55•6 years ago
|
||
(In reply to Su-Young Hong from comment #54)
We might want to bump up enrollment (maybe 14% each branch, eyeball projection here) to achieve our desired enrollment size.
I can change the sample rate. Do you want to go with 14% or do you need to crunch some numbers to be more precise?
Comment 56•6 years ago
|
||
I just crunched some numbers, we're at about 50K total. Assuming 7K a day, that'll give us another 49K by the 29th. Since we're targeting about 150K, I think we'll need to double our current rate to hit that.
If nobody has any objections, lets go 18% each branch (18% to control, 18% to variant).
- Su
Reporter | ||
Comment 57•6 years ago
|
||
Agreed let's do that quickly so we avoid the releng/QA complexity
Comment 58•6 years ago
|
||
Comment 59•6 years ago
|
||
Assignee | ||
Comment 60•6 years ago
|
||
The rate increase is active in production now. You should probably make a note in your analytics of the date it increased. We can always raise it again if it looks like we still won't hit 150k before Fx65 ships.
Comment 61•6 years ago
|
||
Looks like we've shipped 65.0, but are still distributing 64.0.2 based funnelcake to 18%+18% of requests ? I don't see a recent commit in bedrock anyway.
If it turns out we don't have a large enough cohort we could always verify the 65.0 based builds and re-enable.
Assignee | ||
Comment 62•6 years ago
|
||
(In reply to Nick Thomas [:nthomas] (UTC+13) from comment #61)
Looks like we've shipped 65.0, but are still distributing 64.0.2 based funnelcake to 18%+18% of requests ? I don't see a recent commit in bedrock anyway.
The code is still in bedrock, but controlled by a switch which is currently set to "OFF" so the experiment is suspended. See https://github.com/mozmeao/www-config/commit/31e8df10d47c3e1d5fdd87af67b198430f56caf6
If it turns out we don't have a large enough cohort we could always verify the 65.0 based builds and re-enable.
Su said we had about 136,600 at the time the experiment was halted, so a bit short of the 150k target but maybe it's enough? If you want to produce 65.0 builds, just say the word and we can flip the switch back on. I imagine it wouldn't take long at all to hit 150k total at the 18%/18% sample rate.
Comment 63•6 years ago
|
||
Great to hear suspension already happened thanks.
I will wait to hear if we want to gather more users. The builds were already created by automation, but I'd need to check for any installer changes in 65.0, and update bouncer to point to them.
Comment 64•6 years ago
|
||
Hey, sorry for late comment, but yes, wanted to confirm what Craig said.
To update, it looks like we have about 157,534 enrollees reporting telemetry so we have what we need r.e. population size.
Thanks all for all the hard work on this!
Comment 65•6 years ago
|
||
Experiment concluded with enrollees reporting telemetry. Seems all good, closing.
Description
•