Open Bug 1945460 Opened 5 days ago Updated 5 days ago

Airflow task glam.clients_histogram_bucket_counts failed for 2025-01-30

Categories

(Data Platform and Tools :: General, defect)

defect

Tracking

(Not tracked)

People

(Reporter: lvargas, Unassigned)

Details

(Whiteboard: [airflow-triage])

Airflow task glam.clients_histogram_bucket_counts failed for 2025-01-30

Task link:
https://workflow.telemetry.mozilla.org/dags/glam/grid?task_id=clients_histogram_bucket_counts&tab=logs&dag_run_id=scheduled__2025-01-30T16%3A00%3A00%2B00%3A00

Log extract:

Could not read served logs: HTTPConnectionPool(host='10.20.3.140', port=8793): Max retries exceeded with url: /log/dag_id=glam/run_id=scheduled__2025-01-30T16:00:00+00:00/task_id=clients_histogram_bucket_counts/attempt=1.log (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x7ab6e544fb90>, 'Connection to 10.20.3.140 timed out. (connect timeout=5)'))

It seems related to the discussion in the data-platform channel, about an airflow failure where the tasks lost connection to the pods.

This failure caused downstream failures in bqetl_glam_export.

You need to log in before you can comment on or make changes to this bug.