Closed
Bug 1343599
Opened 8 years ago
Closed 8 years ago
[Alert] treeherder-prod: Response times (web) 2017-03-01
Categories
(Tree Management :: Treeherder: Infrastructure, defect, P1)
Tree Management
Treeherder: Infrastructure
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: emorley, Assigned: emorley)
References
Details
Fired twice in the last hour (only sent to my email at present):
"""
The 95th percentile response time for treeherder-prod has exceeded your threshold setting of 1000ms.
"""
Investigating.
| Assignee | ||
Comment 1•8 years ago
|
||
This appears to be due to increased activity to:
* SetaJobPriorityViewSet.list
* RunnableJobsViewSet.list
...and since the response time for both of these is pretty high this raises the 95th percentile average response time, which is what the Heroku alert monitors.
The increased activity is presumably due to the switching over to Treeherder's SETA implementation in bug 1326102.
The reason those API requests are slow is that they make requests to both index.taskcluster.net and queue.taskcluster.net as part of the API call, which is bug 1339829.
It would be good to improve the response times for these endpoints, since:
* otherwise we'll have to raise the alert thresholds, which could mask issues with non-SETA API calls
* slow requests on webheads means lower capacity (or increased request queuing times) for the same number of dynos
| Assignee | ||
Comment 2•8 years ago
|
||
For example, this SetaJobPriorityViewSet.list request took 21 seconds (admittedly more chronic than the rest, the average is 5s):
https://rpm.newrelic.com/accounts/677903/applications/14179757/transactions?tw%5Bend%5D=1488392506&tw%5Bstart%5D=1488381706#id=5b225765625472616e73616374696f6e2f46756e6374696f6e2f747265656865726465722e7765626170702e6170692e736574613a536574614a6f625072696f72697479566965775365742e6c697374222c22225d&app_trace_id=9a53cfd5-fea1-11e6-a5be-f8bc124256a0_10937_12953
| Assignee | ||
Comment 3•8 years ago
|
||
Rob/Joel, would you mind taking a look at this sometime in the next few days?
(Doesn't need to be today, I'll just ignore the alerts for now. I'm also fine with bug 1326102 staying landed.)
Flags: needinfo?(rwood)
Flags: needinfo?(jmaher)
| Assignee | ||
Comment 6•8 years ago
|
||
Resolved via bug 1339829.
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
You need to log in
before you can comment on or make changes to this bug.
Description
•