Propagate Accounts deletion requests to telemetry
Categories
(Data Platform and Tools :: General, task)
Tracking
(Not tracked)
People
(Reporter: akomar, Unassigned)
References
Details
Attachments
(2 files)
We need to propagate Accounts deletion requests to properly shred new telemetry ping added in https://bugzilla.mozilla.org/show_bug.cgi?id=1889123
We should make sure that all telemetry deletion request scenarios, including account deletion, are captured.
Reporter | ||
Comment 1•9 months ago
|
||
We are currently ingesting Accounts (FxA) deletion requests through legacy logs: https://github.com/mozilla/bigquery-etl/blob/main/sql/moz-fx-data-shared-prod/firefox_accounts_derived/fxa_delete_events_v2/query.sql that are later used by shredder: https://github.com/mozilla/bigquery-etl/blob/9dbc7cd08bb5da53311f01527a906013f86d16c3/bigquery_etl/shredder/config.py#L108
In order to propagate these deletion requests to Telemetry data, we will:
- Add raw, unhashed
uid
to https://github.com/mozilla/bigquery-etl/blob/main/sql/moz-fx-data-shared-prod/firefox_accounts_derived/fxa_delete_events_v2/query.sql (acceptable per https://docs.google.com/document/d/17eoA4lTdJO6IPjdkzRPT6e7uDbTWM8uGKprgVZA93V8/edit) - Build derived, synthetic deletion request table for legacy Telemetry data in
telemetry_derived
by joining aboveuid
withlegacy_client_id
sent in https://bugzilla.mozilla.org/show_bug.cgi?id=1889123 - Create a view to union synthetic legacy Telemetry deletion table with
telemetry_stable.deletion_request_v4
, and use it instead of this table in Shredder config - Build derived, synthetic deletion request table for desktop Glean telemetry in
firefox_desktop_derived
by joining aboveuid
withclient_info.client_id
sent in https://bugzilla.mozilla.org/show_bug.cgi?id=1889123, and make Shredder use it by adding it in the config
Reporter | ||
Comment 2•8 months ago
•
|
||
In context of https://bugzilla.mozilla.org/show_bug.cgi?id=1895503 :mreid pointed out to me that we want to export just the data from newly added fx_accounts
ping table. With that and thinking about the deletion part more I think the approach that the plan above would implement is too broad.
I think accounts deletion request in its current form should not trigger deletion of all the Firefox telemetry by default. To do that, there should either be a separate request, or a flag in the existing request.
Taking above into account, to let us collect client association ping (https://bugzilla.mozilla.org/show_bug.cgi?id=1903310) we will just shred the newly added ping. This means the plan is to:
- Add raw, unhashed
uid
to https://github.com/mozilla/bigquery-etl/blob/main/sql/moz-fx-data-shared-prod/firefox_accounts_derived/fxa_delete_events_v2/query.sql (acceptable per https://docs.google.com/document/d/17eoA4lTdJO6IPjdkzRPT6e7uDbTWM8uGKprgVZA93V8/edit) - Add new DeleteSource pointing to
fxa_delete_events_v2.uid
and DeleteTarget pointing tofirefox_desktop_stable.fx_accounts_v1
in Shredder config
/cc :mreid for visibility
Comment 3•8 months ago
|
||
Comment 4•8 months ago
|
||
Reporter | ||
Updated•8 months ago
|
Description
•