Glean.js automatic page load instrumentation
Categories
(Data Platform and Tools :: Glean: SDK, task, P1)
Tracking
(Not tracked)
People
(Reporter: brosa, Assigned: brosa)
References
Details
Attachments
(2 files, 1 obsolete file)
45 bytes,
text/x-github-pull-request
|
Details | Review | |
3.28 KB,
text/plain
|
travis_
:
data-review+
|
Details |
Automatic instrumentation for basic page load events in Glean.js. We are looking to capture the event itself, plus 3 keys
- url
- referrer
- title
We need to handle 2 scenarios
- Automatic instrumentation where each time Glean.js is initialized, we collect the event by default.
- Automatic instrumentation is turned off and we provide the client a way to collect the same event manually. The metric is provided through a Glean API that allows for overriding all 3 values. If the values aren't overwritten, then we just collect the defaults.
Assignee | ||
Comment 1•1 year ago
|
||
Assignee | ||
Comment 2•1 year ago
|
||
This is a bit of a unique data review since what we are proposing to collect will ultimately be determined by each client.
Should we require a data-review on a per client basis to enable this? I am interested in hearing the data steward point of view for how we manage this once the code is released.
Comment 3•1 year ago
|
||
Bruno, typically for things defined internally in Glean like this, we just default to "on" and permanent collection. The way you have this designed it would sort of imply that each product that wants to use this needs to opt-in to enable it and would require a data-review for it. In the interests of not adding that additional friction to using this new page-load metric, I'd say we go ahead and say that this is permanently collected, and assume it will be used by anyone using Glean.js. It's only user interaction data, so no escalation should be necessary to be able to approve this collection.
Assignee | ||
Comment 4•1 year ago
|
||
I think that makes sense.
:dexter & :janerik does anyone have an issue with making this enabled by default? I think if it is going to be enabled by default, that will mean that my documentation updates will need to be very clear about who should be using this and who should use the manual events, based on the project set up.
Comment 5•1 year ago
|
||
(In reply to Travis Long [:travis_] from comment #3)
In the interests of not adding that additional friction to using this new page-load metric, I'd say we go ahead and say that this is permanently collected, and assume it will be used by anyone using Glean.js.
I agree with Travis
(In reply to Bruno Rosa [:brosa] from comment #4)
I think that makes sense.
:dexter & :janerik does anyone have an issue with making this enabled by default?
I think for data-review purposes we should assume this is going to be default on. We should likely wait to turn it on by default though, as we want to test and validate it first (but that doesn't matter for data-review purposes).
Assignee | ||
Comment 6•1 year ago
|
||
Updated•1 year ago
|
Comment 7•1 year ago
|
||
Comment 8•1 year ago
|
||
Comment on attachment 9366166 [details]
Updated data-review attached to bug
Data Review
- Is there or will there be documentation that describes the schema for the ultimate data set in a public, complete, and accurate way?
Yes, through the metrics.yaml file and the Glean Dictionary.
- Is there a control mechanism that allows the user to turn the data collection on and off?
Yes, through the data preferences in the integrating application's settings.
- If the request is for permanent data collection, is there someone who will monitor the data over time?
permanent collection to be monitored by brosa and glean-team@mozilla.com
- Using the category system of data types on the Mozilla wiki, what collection type of data do the requested measurements fall under?
Category 2, Interaction data
- Is the data collection request for default-on or default-off?
Default-on
- Does the instrumentation include the addition of any new identifiers (whether anonymous or otherwise; e.g., username, random IDs, etc. See the appendix for more details)?
No
- Is the data collection covered by the existing Firefox privacy notice?
Yes
- Does the data collection use a third-party collection tool?
No
Result
data-review+
Updated•1 year ago
|
Assignee | ||
Updated•1 year ago
|
Description
•