taskcluster issues, potentially cause by pulse issues, e.g. requests.exceptions.HTTPError: 400 Client Error: Bad Request for url: http://taskcluster/queue/v1/task/G1TWngcxR5-elWhxEqP9pw
Categories
(Taskcluster :: Operations and Service Requests, defect)
Tracking
(Not tracked)
People
(Reporter: aryx, Unassigned)
Details
![]() |
Reporter | |
Comment 1•1 year ago
|
||
Yarik investigates an issue with high CPU load on both firefox-ci and the community instance as far as I understood, some pulse queues had an unacknowledged backlog but clearing them did not resolve it.
Tasks e.g. fail with https://treeherder.mozilla.org/logviewer?job_id=443648451&repo=autoland requests.exceptions.HTTPError: 400 Client Error: Bad Request for url: http://taskcluster/queue/v1/task/G1TWngcxR5-elWhxEqP9pw
, attempts to replay hg pushes or retrigger tasks fail with internal server error. There are failed tasks without obvious error (example) which might also be explained by this situation.
Trees are closed for this.
Comment 2•1 year ago
|
||
The problem in this instance was that the Pernosco consumer stopped working, leading to an extremely large queue of unacked messages. To remedy this we removed permissions from this account and purged the queue until the Pernosco folks can fix the issue on their side.
https://bugzilla.mozilla.org/show_bug.cgi?id=1875132 has been filed about the general issue of unacked messages taking down Pulse.
![]() |
Reporter | |
Comment 3•1 year ago
|
||
Try has been reopened, autoland will follow once it has been verified setting a task as failed works as expected.
Comment hidden (Intermittent Failures Robot) |
Comment hidden (Intermittent Failures Robot) |
Comment 6•1 year ago
|
||
This is long fixed.
Today we re-enabled the Pernosco account, and had the folks that run it subscribe to only the events they need. This doesn't fix any root causes (like their consumer breaking and not acking messages, or https://bugzilla.mozilla.org/show_bug.cgi?id=1875132), but it does limit the damage if this particular queue starts to build up messages again.
Updated•1 year ago
|
Comment hidden (Intermittent Failures Robot) |
Comment 8•1 year ago
|
||
Comment 9•1 year ago
|
||
(In reply to :gerard-majax from comment #8)
happening right now for me?
https://treeherder.mozilla.org/jobs?repo=try&revision=95a9ea78d65c056c804a7330e7ed5a7a1c6c23ba
https://treeherder.mozilla.org/jobs?repo=try&revision=162fdbc6e4e66b69e1fc152ac7c0dbccfe32223a
unlikely to be the same root cause. if it keeps up, please file something new or ping us in Matrix and we'll file it.
Comment hidden (Intermittent Failures Robot) |
Description
•