Collect telemetry on the number of site-origins across all tabs for all windows
Categories
(Core :: Performance, enhancement, P2)
Tracking
()
Tracking | Status | |
---|---|---|
firefox73 | --- | fixed |
People
(Reporter: sefeng, Assigned: sefeng)
References
Details
Attachments
(2 files, 3 obsolete files)
47 bytes,
text/x-phabricator-request
|
Details | Review | |
2.15 KB,
text/plain
|
tdsmith
:
data-review+
|
Details |
The proposal is we collect number of site origins at the end of pageload, so this gives us a sense of the average/distribution of number of site origins per load (per tab). Then we can use it to correlate with the number of tabs, to get a sense of how Fission would impact our users in terms of the number of processes etc.
Assignee | ||
Comment 1•5 years ago
|
||
Comment 2•5 years ago
|
||
Hm, I'm not sure I understand what you want to collect. It's sometimes helpful to have the patch up for review before the data collection request because it reduces ambiguity. Is it one of these things:
- How many different origins a single pageload uses
- The count of distinct origins used by all active tabs, accumulated after each pageload
I'm not sure what the significance of a tab or of the number of tabs is. I'm also not sure if Fission processes are bound to tabs or what their lifespan will be. We collect the scalar browser.engagement.max_concurrent_tab_count
, but that only describes the high-water mark for number of tabs within each subsession; is that enough context to understand the measurement you want to collect?
Assignee | ||
Comment 3•5 years ago
|
||
Hi Tim,
So the probe that we want to implement, is the first bullet point you mentioned, "How many different origins a single pageload uses". By correlating with the number of tabs, we can learn how many different site origins across the entire browsing session, which basically means how many processes is going to created across the session.
The number of tabs telemetry I am looking at is the TAB_COUNT
one, I think correlating with this one should give us enough information.
Hope it clears the things up, please let me know your opinion.
Comment 4•5 years ago
|
||
Okay, great. If that's the quantity you collect, I think the documentation for this probe should not mention tabs. The data-review request looks good but the title of the ticket confused me.
I forgot about TAB_COUNT; thanks! That will let you find the modal number of simultaneous tabs for each browser session for each user, which is nicer than the maximum. The product of "average origins per pageload" and "modal simultaneous tabs" sounds like a reasonable guess at the number of simultaneous processes. Is that what you were thinking?
Comment 5•5 years ago
|
||
Assignee | ||
Updated•5 years ago
|
Assignee | ||
Comment 6•5 years ago
|
||
Comment 7•5 years ago
|
||
It looks like this is similar to bug 1441972, which added a (now expired) probe that uses Tab/DocGroups. Was this known? Is the idea to add the new probe because the old one uses things that we're likely going to tear out soon?
Assignee | ||
Comment 8•5 years ago
|
||
I didn't aware bug 1441972, and I don't think we had considered to reuse it.
However I don't think they are the same thing? For example, account.google.com
and service.google.com
are in the same docGroup, but they don't have the same siteOrigin.
Updated•5 years ago
|
Comment 9•5 years ago
|
||
(In reply to Sean Feng [:sefeng] from comment #3)
Hi Tim,
So the probe that we want to implement, is the first bullet point you mentioned, "How many different origins a single pageload uses". By correlating with the number of tabs, we can learn how many different site origins across the entire browsing session, which basically means how many processes is going to created across the session.
But tabs can reuse processes, or iframes in them. And that sharing can happen quite often.
Bug 1441972 should be very close to what we want (or I have misunderstood what we want).
Assignee | ||
Comment 10•5 years ago
|
||
What I said in Comment 4 was wrong, the siteOrigin for account.google.com
and service.google.com
are the same, so now I start to think the docGroup telemetry one probably makes sense.
Unless the number of siteOrigin can reveals something unique, we might should just reuse Bug 1441972.
Randell, what do you think?
(Nika's needinfo queue is blocked, can't NI her)
Comment 11•5 years ago
|
||
Bug 1441972 looks at per-tab only, but it sounds close to what we want (though we'd prefer to know the number of unique docgroups-per-session, which isn't exactly docsgroups-per-tab times tabs, though that's a first approximation).
It's be more work, but if we could count the number of unique docgroups that should be close to the number of processes we'll need
Assignee | ||
Comment 12•5 years ago
|
||
I see, yeah, the plan was to collect number of unique site origins per load, and correlate the it with TAB_COUNT telemetry to get a sense of number of process across the browsing session.
I think we'd want another telemetry to count the number of tabGroups, then we can reuse the docGroup-per-tabGroup telemetry.
What do you think Olli?
Comment 13•5 years ago
|
||
tabgroups are going away, bug 1561715.
But what would docgroup-per-tabgroup (== docgroup-per-browsingcontextgroup) tell us?
We want number of different-site-docgroups, no? Since that should map to the number of processes pretty closely
(data: url might be a special case).
Assignee | ||
Comment 14•5 years ago
|
||
Updated•5 years ago
|
Assignee | ||
Comment 15•5 years ago
|
||
After some discussion, we agreed that the old docGroup-per-tabGroup telemetry wasn't sufficient for our use case, because our high level goal was to get a sense of number of processes across the browser.
Then it felt back to the original plan, which was collecting number of site origins per load, and correlate with TAB_COUNT. However, the was one caveat. Olli pointed out that we can't do the correlation because TABs may share processes, for instance, Tab A has sites a, b, c, and Tab B has sites b, c, there will only be 3 processes.
So we agreed a new approach, which is collecting number of unique site origins across the browser with a minimum 5 minutes interval, then we no longer need to do correlations. And this is what the new patch is for.
I will update the data review collection form and the title of the bug after we updated the performance team OKR. (This bug was one of the KR).
Assignee | ||
Updated•5 years ago
|
Assignee | ||
Comment 16•5 years ago
|
||
Requesting Tim to review the updated data collection form.
Comment 17•5 years ago
|
||
Comment 18•5 years ago
|
||
Comment 19•5 years ago
|
||
Backed out for perma fails on browser_Telemetry_numberOfSiteOrigins.js,
Push with failure: https://treeherder.mozilla.org/#/jobs?repo=autoland&selectedJob=279646088&resultStatus=testfailed%2Cbusted%2Cexception&revision=52af8765cb2153b4218308039cf3387faf41900a
Failure log: https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=279646088&repo=autoland&lineNumber=1788
Backout: https://hg.mozilla.org/integration/autoland/rev/e17926e0c221c5c31c0df3ff4bae23782ba81670
Comment 20•5 years ago
|
||
Comment 21•5 years ago
|
||
bugherder |
Assignee | ||
Updated•5 years ago
|
Comment 22•5 years ago
|
||
Comment 23•5 years ago
|
||
Comment on attachment 9141586 [details]
Bug 1589700 - Collect per tab uniqute site origin telemetry r?nika!,snorp!,dexter!
Revision D71493 was moved to bug 1603185. Setting attachment 9141586 [details] to obsolete.
Description
•