Closed Bug 1667482 Opened 5 years ago Closed 5 years ago

Intermittent requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/GJLIrPE4Sm-G9TzK7-3hQw/artifacts/public/parameters.yml

Categories

(Firefox Build System :: Task Configuration, defect, P5)

defect

Tracking

(firefox83 fixed)

RESOLVED FIXED
83 Branch
Tracking Status
firefox83 --- fixed

People

(Reporter: intermittent-bug-filer, Assigned: ahal)

References

Details

(Keywords: intermittent-failure)

Attachments

(1 file, 1 obsolete file)

Filed by: nbeleuzu [at] mozilla.com
Parsed log: https://treeherder.mozilla.org/logviewer.html#?job_id=316747715&repo=autoland
Full log: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/OUN3Pb9aQ_mIwPPG5ayLJA/runs/0/artifacts/public/logs/live_backing.log


[task 2020-09-25T21:06:37.874Z] Traceback (most recent call last):
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/mach_commands.py", line 208, in taskgraph_decision
[task 2020-09-25T21:06:37.874Z]     ret = taskgraph.decision.taskgraph_decision(options)
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/taskgraph/decision.py", line 232, in taskgraph_decision
[task 2020-09-25T21:06:37.874Z]     set_decision_indexes(decision_task_id, tgg.parameters, tgg.graph_config)
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/taskgraph/generator.py", line 145, in parameters
[task 2020-09-25T21:06:37.874Z]     return self._run_until('parameters')
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/taskgraph/generator.py", line 376, in _run_until
[task 2020-09-25T21:06:37.874Z]     k, v = next(self._run)
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/taskgraph/generator.py", line 241, in _run
[task 2020-09-25T21:06:37.874Z]     parameters = self._parameters(graph_config)
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/taskgraph/decision.py", line 218, in <lambda>
[task 2020-09-25T21:06:37.874Z]     lambda graph_config: get_decision_parameters(graph_config, options)
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/taskgraph/decision.py", line 405, in get_decision_parameters
[task 2020-09-25T21:06:37.874Z]     parameters['backstop'] = is_backstop(parameters)
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/taskgraph/util/backstop.py", line 64, in is_backstop
[task 2020-09-25T21:06:37.874Z]     last_pushdate = get_artifact(last_backstop_id, 'public/parameters.yml')["pushdate"]
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/taskgraph/util/taskcluster.py", line 154, in get_artifact
[task 2020-09-25T21:06:37.874Z]     response = _do_request(get_artifact_url(task_id, path, use_proxy))
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/taskcluster/taskgraph/util/taskcluster.py", line 113, in _do_request
[task 2020-09-25T21:06:37.874Z]     response.raise_for_status()
[task 2020-09-25T21:06:37.874Z]   File "/builds/worker/checkouts/gecko/third_party/python/requests/requests/models.py", line 840, in raise_for_status
[task 2020-09-25T21:06:37.874Z]     raise HTTPError(http_error_msg, response=self)
[task 2020-09-25T21:06:37.874Z] requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/GJLIrPE4Sm-G9TzK7-3hQw/artifacts/public/parameters.yml
[taskcluster 2020-09-25 21:06:38.497Z] === Task Finished ===
[taskcluster 2020-09-25 21:06:38.598Z] Artifact "public/docker-contexts" not found at "/builds/worker/checkouts/gecko/docker-contexts"
[taskcluster 2020-09-25 21:06:38.705Z] Unsuccessful task run with exit code: 1 completed in 179.279 seconds
Assignee: nobody → aki
Status: NEW → ASSIGNED
Pushed by asasaki@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/725f6e6d8aaf retry taskcluster requests. r=taskgraph-reviewers,jmaher DONTBUILD
Status: ASSIGNED → RESOLVED
Closed: 5 years ago
Resolution: --- → FIXED

Backed out changeset 725f6e6d8aaf (bug 1667482) by aki's request.

Backout link: https://hg.mozilla.org/integration/autoland/rev/127cd76bff369fecb81dc99cde6e10bd2620b6e1

Status: RESOLVED → REOPENED
Flags: needinfo?(aki)
Resolution: FIXED → ---

Retries are just making this worse: we continue to 404, but it takes longer to get the result, and sometimes we even hit claim-expired. I think we need to figure out why the queue is giving us 404s for valid artifact URLs.

Assignee: aki → nobody
Component: General → Operations and Service Requests
Flags: needinfo?(aki)
Product: Release Engineering → Taskcluster
QA Contact: catlee
[task 2020-09-25T21:06:37.874Z] requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/GJLIrPE4Sm-G9TzK7-3hQw/artifacts/public/parameters.yml

But https://firefox-ci-tc.services.mozilla.com/tasks/GJLIrPE4Sm-G9TzK7-3hQw has resolved time 2020-09-25T21:07:16.122Z. That was, I think, when the artifacts were finished uploading. The decision task's log has

[taskcluster 2020-09-25 21:06:59.366Z] === Task Finished ===

which is still 22 seconds after OUN3Pb9aQ_mIwPPG5ayLJA logged the 404.

So, I think those artifact URLs are not in fact valid at the time they are being requested. It looks like a dependency of some sort is missing.

Oh. This might be fallout from :ahal adding indexes mid-decision task.

Flags: needinfo?(ahal)
Component: Operations and Service Requests → Task Configuration
Product: Taskcluster → Firefox Build System

Ah, we moved the index insertion to the start of the decision task to avoid double backstops.. but then immediately afterwards rely on parameters.yml which is only available after the decision task anyway ><.

We only need the parameters.yml to compute the time component, so we can just assume that the current push is not a backstop in this case.

Assignee: nobody → ahal
Status: REOPENED → ASSIGNED
Flags: needinfo?(ahal)

For context, any decision task that started while a backstop decision task was still ongoing, would hit this issue. So that's why we were seeing strings of decision tasks that hit this in a row. I'll have a fix up soon.

Pushed by ahalberstadt@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/dd07adf3f7bd [taskgraph] Don't fail if last backstop decision task is still running, r=marco
Status: ASSIGNED → RESOLVED
Closed: 5 years ago5 years ago
Resolution: --- → FIXED
Target Milestone: --- → 83 Branch
Attachment #9177985 - Attachment is obsolete: true
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: