Closed Bug 1915996 Opened 10 months ago Closed 10 months ago

Airflow task socorro_import.bigquery_load failed for exec_date 2024-08-30

Categories

(Data Platform and Tools :: General, defect)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: anicholson, Assigned: srose)

Details

(Whiteboard: [airflow-triage])

Airflow task socorro_import.bigquery_load failed for exec_date 2024-08-30

Task link:
https://workflow.telemetry.mozilla.org/dags/socorro_import/grid?dag_run_id=scheduled__2024-08-29T00%3A00%3A00%2B00%3A00&task_id=bigquery_load&tab=logs

Log extract:

airflow.providers.cncf.kubernetes.utils.pod_manager.PodLaunchFailedException: Pod took longer than 120 seconds to start. Check the pod events in kubernetes to determine why.

The pod is failing to pull the docker image (Back-off pulling image "docker.io/mozilla/parquet2bigquery:20190722": ImagePullBackOff in gke) and it seems to be because it's not available in docker hub anymore https://www.docker.com/mozilla/parquet2bigquery. I don't know why though. We could put the image in gcr but the repo is archived and hasn't been updated in 5 years so I don't know if that makes sense to do https://github.com/mozilla/parquet2bigquery

Assignee: nobody → srose
Status: NEW → ASSIGNED

https://github.com/mozilla/telemetry-airflow/pull/2084 seems to have fixed it. I cleared the tasks for 2024-08-30 forward.

All the previously failed tasks that :whd cleared completed successfully, and new tasks since then have also run successfully.

Status: ASSIGNED → RESOLVED
Closed: 10 months ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.