Closed Bug 1643490 Opened 4 years ago Closed 4 years ago

Add event telemetry probe to measure the number of words in the input on engagement

Categories

(Firefox :: Address Bar, task, P2)

task
Points:
2

Tracking

()

RESOLVED FIXED
Firefox 79
Iteration:
79.1 - June 1 - June 14
Tracking Status
firefox78 --- fixed
firefox79 --- fixed

People

(Reporter: bugzilla, Assigned: bugzilla)

References

Details

Attachments

(1 file)

This is to improve the data for the experiment in bug 1643386. Event telemetry depends on UrlbarInput rather than any particular queryContext, so it may be too expensive to tokenize the input value for every event telemetry collection. For the kind of analysis we plan on doing, measuring the number of elements in value.trim().split(" ") should be adequate. It should be noted this will produce some mismeasurements for URLs with spaces in the query parameter, or for queries that use keywords.

Severity: -- → S3
Priority: -- → P2
Type: enhancement → task

Comment on attachment 9154369 [details]
Bug 1643490 - Add event telemetry probe to measure the number of words in the input on engagement. r?adw

  1. What questions will you answer with this data?

How many words do users typically enter before selecting a suggestion result in the address bar?

  1. Why does Mozilla need to answer these questions? Are there benefits for users? Do we need this information to address product or business requirements?

If users don't choose suggestions for short/1-word queries, we could explore not fetching suggestions for short queries to improve user privacy.

  1. What alternative methods did you consider to answer these questions? Why were they not sufficient?

Counting the number of characters only, which we currently have a probe for. We'd like the additional nuance of how may words they've typed.

  1. Can current instrumentation answer these questions?

No, only how many characters were typed.

  1. List all proposed measurements and indicate the category of data collection for each measurement, using the Firefox data collection categories found on the Mozilla wiki.

Description: The number of words in the user's query. Measured by trimming the query, then splitting it up by the number of spaces and returning the number of split words. The content of the query is not collected.
Category: 2
Bug for the new probe: bug 1643490
Bug for the experiment: bug 1643386

  1. How long will this data be collected?

Permanently a part of event telemetry collection. The analysis of this data is part of a time-limited experiment planned to run 2020-06-23 to 2020-07-07. See https://experimenter.services.mozilla.com/experiments/measure-the-string-length-on-user-selected-suggestions/

  1. What populations will you measure?

The experiment targets all English locales in Release in all countries.

  1. If this data collection is default on, what is the opt-out mechanism for users?

Disabling event telemetry or opting out of the experiment.

  1. Please provide a general description of how you will analyze this data.

Comparing the number of words typed with when suggestions are picked from the address bar dropdown.

  1. Where do you intend to share the results of your analysis?

Internally, with the Search team.

  1. Is there a third-party tool (i.e. not Telemetry) that you are proposing to use for this data collection? If so:

No.

Attachment #9154369 - Flags: data-review?(bmiroglio)
Pushed by htwyford@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/f13d74051cc3
Add event telemetry probe to measure the number of words in the input on engagement. r=adw

Ah, I'm sorry. I landed this as a force of habit, before data review completed. I'll request backout. No telemetry should have been collected, since this is all behind the event telemetry pref.

Comment on attachment 9154369 [details]
Bug 1643490 - Add event telemetry probe to measure the number of words in the input on engagement. r?adw

Data Review Form

  1. Is there or will there be documentation that describes the schema for the ultimate data set in a public, complete, and accurate way?

This will be documented in the [Events.yaml file[(https://firefox-source-docs.mozilla.org/toolkit/components/telemetry/data/event-ping.html). Note this is data specific to an experiment at this moment in time.

  1. Is there a control mechanism that allows the user to turn the data collection on and off? (Note, for data collection not needed for security purposes, Mozilla provides such a control mechanism) Provide details as to the control mechanism available.

Users can disable this by opt-ing out of Telemetry in the Firefox preferences.

  1. If the request is for permanent data collection, is there someone who will monitor the data over time?

Yes, :harry and the DS team will monitor

  1. Using the category system of data types on the Mozilla wiki, what collection type of data do the requested measurements fall under?

Category 2: Interaction Data

  1. Is the data collection request for default-on or default-off?

default-on

  1. Does the instrumentation include the addition of any new identifiers (whether anonymous or otherwise; e.g., username, random IDs, etc. See the appendix for more details)?

No.

  1. Is the data collection covered by the existing Firefox privacy notice?

Yes.

  1. Does there need to be a check-in in the future to determine whether to renew the data? (Yes/No) (If yes, set a todo reminder or file a bug if appropriate)**

Yes, this should be reevaluated after a few months of collection.

  1. Does the data collection use a third-party collection tool?

No.

data-review: r+

Attachment #9154369 - Flags: data-review?(bmiroglio) → data-review+

This one comment outside of data review:

Would a histogram be more efficient to use here? I recall from past experiments that were sending search events that the volume of events was quite large, and not sustainable for 100% of release.

Flags: needinfo?(htwyford)

It would be nice to get the fine-grained data that event telemetry sends since we're particularly interested in how the number of words typed relates to the type of result picked. For example: are suggestions more likely to be picked when many words are typed?

We don't really need continuous monitoring of this; we're just looking to run a single experiment. I plan to create an experiment add-on that enables event telemetry and filters for the data we need. I have the experiment set up in Experimenter for 0.5% of Release because of the huge volume of data event telemetry produces.

Flags: needinfo?(htwyford)
Pushed by htwyford@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/06798bf879da
Add event telemetry probe to measure the number of words in the input on engagement. r=adw
Status: ASSIGNED → RESOLVED
Closed: 4 years ago
Resolution: --- → FIXED
Target Milestone: --- → Firefox 79

Comment on attachment 9154369 [details]
Bug 1643490 - Add event telemetry probe to measure the number of words in the input on engagement. r?adw

Beta/Release Uplift Approval Request

  • User impact if declined: We won't be able to run an experiment that's targeting 78. See bug 1643386.
  • Is this code covered by automated tests?: Yes
  • Has the fix been verified in Nightly?: Yes
  • Needs manual test from QE?: No
  • If yes, steps to reproduce:
  • List of other uplifts needed: None
  • Risk to taking this patch: Low
  • Why is the change risky/not risky? (and alternatives if risky): Adds one new parameter to an existing and well-tested event telemetry object. There are several tests for the added probe. It isn't collected by default, but only for those enrolled in an add-on experiment.
  • String changes made/needed:
Attachment #9154369 - Flags: approval-mozilla-beta?

Comment on attachment 9154369 [details]
Bug 1643490 - Add event telemetry probe to measure the number of words in the input on engagement. r?adw

approved for 78.0b6

Attachment #9154369 - Flags: approval-mozilla-beta? → approval-mozilla-beta+
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: