Sanitize search engine values on ingestion
Categories
(Firefox for Android :: General, defect, P1)
Tracking
()
People
(Reporter: royang, Assigned: royang)
References
Details
(Whiteboard: [dataquality])
Attachments
(1 file, 3 obsolete files)
4.29 KB,
patch
|
jonalmeida
:
review+
|
Details | Diff | Splinter Review |
+++ This bug was initially created as a clone of Bug #1751753 +++
We need to sanitize the keys of the search_counts histogram during ingestion based on an allowlist of known engines.
Per :standard8:
From a desktop perspective, we’re going to have this list in remote settings (existing version using prefixes), and that also gets updated a few days after remote settings into the main repositories.
From a BQ perspective, the histogram exists in main_v4
under payload.keyed_histograms.search_counts
, which is a key/value struct.
From the JSON perspective, the histogram would be paylod.keyedHistograms.SEARCH_COUNTS
with potential casing differences we'll need to account for.
The structure of the histogram is documented in Histograms.json:
Records search counts for search access points and in-content searches. For search access points in general, the format is: <engine-name>.<search-access-point> For the urlbar when in search mode, the format is <engine name>.urlbar-searchmode For the urlbar when an internal @engine shortcut is used, the format is: <engine-name>.alias For in-content searches, the format is <provider>.in-content:[sap|sap-follow-on|organic]:[code|none]
Assignee | ||
Updated•3 years ago
|
Assignee | ||
Comment 1•3 years ago
|
||
Assignee | ||
Comment 2•3 years ago
|
||
Assignee | ||
Comment 3•3 years ago
|
||
Updated•3 years ago
|
Updated•3 years ago
|
Assignee | ||
Comment 4•3 years ago
|
||
Comment 5•3 years ago
|
||
Comment on attachment 9260720 [details] [diff] [review]
code_validate_4.patch
Looks good. (corrections were based on test failures)
Assignee | ||
Comment 6•3 years ago
|
||
Assignee | ||
Updated•3 years ago
|
Assignee | ||
Comment 7•3 years ago
|
||
Updated•3 years ago
|
Updated•3 years ago
|
Updated•2 years ago
|
Description
•