Airflow task bqetl_glean_usage clients_last_seen_joined__v1 tasks for all apps failing on 2025-02-05
Categories
(Data Platform and Tools :: General, defect)
Tracking
(Not tracked)
People
(Reporter: lvargas, Assigned: kik)
Details
(Whiteboard: [airflow-triage])
Airflow task bqetl_glean_usage clients_last_seen_joined__v1 tasks for all apps failing on 2025-02-05 due to duplicates for is_default_browser. The error is the same for all of them. These are sql_generated tables that use a single template.
Tables affected & link:
- fenix_derived__clients_last_seen_joined__v1
- klar_ios_derived__clients_last_seen_joined__v1
- focus_ios_derived__clients_last_seen_joined__v1
- focus_android_derived__clients_last_seen_joined__v1
- firefox_ios_derived__clients_last_seen_joined__v1
Log extract:
Error in query string: Error processing job 'moz-fx-data-shared- [2025-02-05, 05:55:58 UTC] {pod_manager.py:472} INFO - [base] prod:bqjob_r2d34dd0a9ad2d986_00000194d4adf2ea_1': Duplicate column names in the [2025-02-05, 05:55:58 UTC] {pod_manager.py:472} INFO - [base] result are not supported when a destination table is present. Found [2025-02-05, 05:55:58 UTC] {pod_manager.py:472} INFO - [base] duplicate(s): is_default_browser
Reporter | ||
Comment 1•1 month ago
•
|
||
The duplicated column seems to be introduced in PR https://github.com/mozilla/bigquery-etl/pull/6930.
Assignee | ||
Updated•1 month ago
|
Reporter | ||
Comment 3•1 month ago
|
||
Fenix running okay today.
Other mobile apps are still failing [klar_ios, firefox_ios, focus_android, focus_ios] are still failing as the column is duplicated in the schemas (is_default_browser, is_default_browser_1).
We need a DROP column / update schema.
Assignee | ||
Comment 4•1 month ago
|
||
The tasks now complete successfully. And yes, we should drop is_default_browser_1
in tables where it was created as the result of this.
Assignee | ||
Comment 5•1 month ago
|
||
Used the following query to identify which tables now contain the is_default_browser_1
field:
FROM `region-us`.INFORMATION_SCHEMA.COLUMNS
|> WHERE LOWER(column_name) = 'is_default_browser_1' AND table_name = "clients_last_seen_joined_v1"
|> SELECT table_catalog, table_schema, table_name, column_name;
The result:
table_catalog table_schema table_name column_name
moz-fx-data-shared-prod focus_android_derived clients_last_seen_joined_v1 is_default_browser_1
moz-fx-data-shared-prod focus_ios_derived clients_last_seen_joined_v1 is_default_browser_1
moz-fx-data-shared-prod firefox_ios_derived clients_last_seen_joined_v1 is_default_browser_1
moz-fx-data-shared-prod fenix_derived clients_last_seen_joined_v1 is_default_browser_1
moz-fx-data-shared-prod klar_ios_derived clients_last_seen_joined_v1 is_default_browser_1
Assignee | ||
Comment 6•1 month ago
|
||
Statements to drop this column in those tables:
ALTER TABLE `moz-fx-data-shared-prod.focus_android_derived.clients_last_seen_joined_v1` DROP COLUMN IF EXISTS is_default_browser_1;
ALTER TABLE `moz-fx-data-shared-prod.focus_ios_derived.clients_last_seen_joined_v1` DROP COLUMN IF EXISTS is_default_browser_1;
ALTER TABLE `moz-fx-data-shared-prod.firefox_ios_derived.clients_last_seen_joined_v1` DROP COLUMN IF EXISTS is_default_browser_1;
ALTER TABLE `moz-fx-data-shared-prod.fenix_derived.clients_last_seen_joined_v1` DROP COLUMN IF EXISTS is_default_browser_1;
ALTER TABLE `moz-fx-data-shared-prod.klar_ios_derived.clients_last_seen_joined_v1` DROP COLUMN IF EXISTS is_default_browser_1;
Assignee | ||
Comment 7•1 month ago
|
||
The statements were executed and columns have been marked for deletion.
Assignee | ||
Comment 8•1 month ago
|
||
I believe this issue is done now. If this is not the case please reopen the bug and reach out to me.
Description
•