Closed Bug 1766999 Opened 2 years ago Closed 2 years ago

Report default search engine data via Glean

Categories

(Firefox :: Search, task, P3)

task

Tracking

()

VERIFIED FIXED
102 Branch
Tracking Status
firefox102 --- verified

People

(Reporter: standard8, Assigned: standard8)

References

(Blocks 1 open bug)

Details

Attachments

(2 files, 1 obsolete file)

We are already reporting default search engine telemetry via legacy telemetry, but we would also like to report it in the newer system - Glean.

Attachment #9274403 - Attachment description: WIP: Bug 1766999 - Report default search engine data via Glean. → Bug 1766999 - Report default search engine data via Glean. r?dexter
Attached file Data Review Request (obsolete) —

I am currently double-checking on the email address for the collection, I think we might have a new one now.

Attachment #9274958 - Flags: data-review?(chutten)

Comment on attachment 9274958 [details]
Data Review Request

PRELIMINARY NOTES:

The path and url are cat3 (called web_activity in Glean, "Stored Content and Communications" in the wiki) because they capture partial browsing history. They may still eligible for default-on collection in all channels subject to Legal/Trust's okay.

DATA COLLECTION REVIEW RESPONSE:

Is there or will there be documentation that describes the schema for the ultimate data set available publicly, complete and accurate?

Yes.

Is there a control mechanism that allows the user to turn the data collection on and off?

Yes. This collection is Telemetry so can be controlled through Firefox's Preferences.

If the request is for permanent data collection, is there someone who will monitor the data over time?

Yes, :standard8 is responsible.

Using the category system of data types on the Mozilla wiki, what collection type of data do the requested measurements fall under?

Category 3, Stored Content and Communications

Is the data collection request for default-on or default-off?

Default on for all channels.

Does the instrumentation include the addition of any new identifiers?

No. (the search engine id is an existing identifier)

Is the data collection covered by the existing Firefox privacy notice?

Yes.

Does the data collection use a third-party collection tool?

No.


Result: datareview- pending Legal approval for default on Cat3 collection

Flags: needinfo?(mfeldman)
Attachment #9274958 - Flags: data-review?(chutten) → data-review-

(In reply to Chris H-C :chutten from comment #3)

The path and url are cat3 (called web_activity in Glean, "Stored Content and Communications" in the wiki) because they capture partial browsing history. They may still eligible for default-on collection in all channels subject to Legal/Trust's okay.

Thank you, I wasn't totally sure about those.

To aid Legal's review, I just wanted to point out that these are already processed via legacy telemetry and they were added in bug 1164159. Some of the comments there might be useful in reviewing - I'm quite happy having an explicit re-review with the newer data collection processes.

approved. This is data we already collect, just updating the tool we use to collect it.

Flags: needinfo?(mfeldman)
Attached file Data Review Request

Updated request

Attachment #9274958 - Attachment is obsolete: true
Attachment #9275691 - Flags: data-review?(chutten)

For qe-verify: Run through tests for changing the default search engine (and private), and confirm if Glean is correctly updated.

Flags: qe-verify+

Comment on attachment 9275691 [details]
Data Review Request

DATA COLLECTION REVIEW RESPONSE:

Is there or will there be documentation that describes the schema for the ultimate data set available publicly, complete and accurate?

Yes.

Is there a control mechanism that allows the user to turn the data collection on and off?

Yes. This collection is Telemetry so can be controlled through Firefox's Preferences.

If the request is for permanent data collection, is there someone who will monitor the data over time?

Yes, Mark Banner, fx-search-telemetry@mozilla.com, and rev-data@mozilla.com are responsible.

Using the category system of data types on the Mozilla wiki, what collection type of data do the requested measurements fall under?

Category 3, Stored Content and Communications

Is the data collection request for default-on or default-off?

Default on for all channels.

Does the instrumentation include the addition of any new identifiers?

No.

Is the data collection covered by the existing Firefox privacy notice?

Yes.

Does the data collection use a third-party collection tool?

No.


Result: datareview+ given Legal's okay.

Attachment #9275691 - Flags: data-review?(chutten) → data-review+
Pushed by mbanner@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/0f38955cbc7c
Report default search engine data via Glean. r=Dexter,mcheang
Depends on: 1769998
Status: NEW → RESOLVED
Closed: 2 years ago
Resolution: --- → FIXED
Target Milestone: --- → 102 Branch
Blocks: 1769998
No longer depends on: 1769998

Seems I forgot to mark this issue as verified. The telemetry stated here was tested as part of Search Nightly regression test run.

Status: RESOLVED → VERIFIED
Flags: qe-verify+ → in-qa-testsuite+
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: