COT ERROR - Hit ScriptWorkerException
Categories
(Firefox Build System :: Task Configuration, defect)
Tracking
(Not tracked)
People
(Reporter: apavel, Unassigned)
Details
Treeherder link: https://treeherder.mozilla.org/#/jobs?repo=mozilla-central&resultStatus=testfailed%2Cbusted%2Cexception&selectedJob=240620177
2019-04-16T10:51:29 CRITICAL - Can't find task signing aQE2ZZ2TSG6VSTh8vfJM1g in signing:parent RuA4eB6sQ76r6eObSSEnaA task-graph.json!
2019-04-16T10:51:29 CRITICAL - Chain of Trust verification error!
Traceback (most recent call last):
File "/builds/scriptworker/lib/python3.6/site-packages/scriptworker/cot/verify.py", line 2136, in verify_chain_of_trust
task_count = await verify_task_types(chain)
File "/builds/scriptworker/lib/python3.6/site-packages/scriptworker/cot/verify.py", line 1887, in verify_task_types
await valid_task_types[task_type](chain, obj)
File "/builds/scriptworker/lib/python3.6/site-packages/scriptworker/cot/verify.py", line 1668, in verify_parent_task
verify_link_in_task_graph(chain, link, target_link)
File "/builds/scriptworker/lib/python3.6/site-packages/scriptworker/cot/verify.py", line 936, in verify_link_in_task_graph
task_link.name, task_link.task_id, decision_link.name, decision_link.task_id
File "/builds/scriptworker/lib/python3.6/site-packages/scriptworker/cot/verify.py", line 301, in raise_on_errors
raise CoTError("\n".join(errors))
scriptworker.exceptions.CoTError: "Can't find task signing aQE2ZZ2TSG6VSTh8vfJM1g in signing:parent RuA4eB6sQ76r6eObSSEnaA task-graph.json!"
2019-04-16T10:51:29 ERROR - Hit ScriptWorkerException: "Can't find task signing aQE2ZZ2TSG6VSTh8vfJM1g in signing:parent RuA4eB6sQ76r6eObSSEnaA task-graph.json!"
2019-04-16T10:51:29 DEBUG - "/builds/scriptworker/artifacts/public/logs/chain_of_trust.log" is encoded with "None" and has mime/type "text/plain"
2019-04-16T10:51:29 INFO - "/builds/scriptworker/artifacts/public/logs/chain_of_trust.log" can be gzip'd. Compressing...
Comment 1•6 years ago
|
||
Merge to central landed at 9:50:14 UTC.
First gecko decision https://treeherder.mozilla.org/#/jobs?repo=mozilla-central&resultStatus=success%2Ctestfailed%2Cbusted%2Cexception%2Cretry%2Cusercancel%2Crunnable&group_state=expanded&selectedJob=240592867 task got retried as https://treeherder.mozilla.org/#/jobs?repo=mozilla-central&resultStatus=success%2Ctestfailed%2Cbusted%2Cexception%2Cretry%2Cusercancel%2Crunnable&group_state=expanded&selectedJob=240593018 which was complete at 9:56:16 UTC.
The nightly desktop decision task started at 10:00:42 UTC and thus shouldn't have scheduled each job twice.
Justin, does the shippable script not skip the retried decision tasks?
Comment 2•6 years ago
|
||
This is unrelated to the shippable work. What happened is that the first run of the decision task started scheduling jobs and got interrupted. All the tasks depend on the decision task, so that if it fails they don't get started. However, the second run of the decision task succeeded, causing all the tasks scheduled by the first run to be started, along with all the tasks from the second. The task that failed was from the first run, and scriptworker (correctly) couldn't find it in the list of tasks from the succesful second run.
The solution would be to cancel or otherwise cause all the tasks from the first run of the decision tasks to not run. This seems like it happens rarely enough that it might not be worth spending much time on, though.
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Updated•5 years ago
|
Comment 8•5 years ago
|
||
Aryx
afaik decision tasks have to be rerun from the command line
aki
rerunning will cause bustage in tasks that have been scheduled but are blocked on run 0. rescheduling via the fxci replay-hg-push is probably the right solution
CosminS|sheriffduty
I think we're good without it, will be on the lookout for failures from that push
aki
ok. we'll see duplicate tasks, which waste $$, and bustage like https://bugzilla.mozilla.org/show_bug.cgi?id=1637750 , which is a dup of https://bugzilla.mozilla.org/show_bug.cgi?id=1373013 , and can be avoided by not rerunning the decision task
Description
•