Airflow task socorro_import.bigquery_load failed for exec_date 2024-08-30
Categories
(Data Platform and Tools :: General, defect)
Tracking
(Not tracked)
People
(Reporter: anicholson, Assigned: srose)
Details
(Whiteboard: [airflow-triage])
Airflow task socorro_import.bigquery_load failed for exec_date 2024-08-30
Log extract:
airflow.providers.cncf.kubernetes.utils.pod_manager.PodLaunchFailedException: Pod took longer than 120 seconds to start. Check the pod events in kubernetes to determine why.
Comment 1•10 months ago
|
||
The pod is failing to pull the docker image (Back-off pulling image "docker.io/mozilla/parquet2bigquery:20190722": ImagePullBackOff
in gke) and it seems to be because it's not available in docker hub anymore https://www.docker.com/mozilla/parquet2bigquery. I don't know why though. We could put the image in gcr but the repo is archived and hasn't been updated in 5 years so I don't know if that makes sense to do https://github.com/mozilla/parquet2bigquery
Assignee | ||
Updated•10 months ago
|
Comment 2•10 months ago
|
||
I deleted this image assuming it wasn't used anymore in https://mozilla-hub.atlassian.net/browse/DSRE-1739?focusedCommentId=927691.
I've attempted to rebuild it based on https://github.com/mozilla/parquet2bigquery/pull/36, which has yielded https://github.com/mozilla/telemetry-airflow/pull/2084.
Comment 3•10 months ago
|
||
https://github.com/mozilla/telemetry-airflow/pull/2084 seems to have fixed it. I cleared the tasks for 2024-08-30 forward.
Assignee | ||
Comment 4•10 months ago
|
||
All the previously failed tasks that :whd cleared completed successfully, and new tasks since then have also run successfully.
Description
•