Closed Bug 709171 Opened 13 years ago Closed 13 years ago

https://affiliates.allizom.org/en-US/ sometimes takes 4s or more

Categories

(Infrastructure & Operations Graveyard :: WebOps: Other, task)

All
Other
task
Not set
minor

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: rik, Assigned: jason)

Details

I don't know the frequency at which it happens but maybe every 3 or 4 requests.

$ time curl -vI https://affiliates.allizom.org/fr/
real	0m7.730s
and a normal one:
real	0m0.629s

Looking at the output, it seems the SSL handshake is happening quickly.

Maybe it is the Python process initialisation that we are hitting too often.
Assignee: server-ops → jthomas
On SUMO's staging servers we were seeing a lot of timeouts in automated tests, so we added a deploy task to prime the WSGI processes by hitting the local Apache with a bunch of parallel curl requests. We determined that WSGI starts up in a round-robin, on-demand fashion, and is not quick, especially on the SeaMicro boxes.
(In reply to James Socol [:jsocol, :james] from comment #1)
> On SUMO's staging servers we were seeing a lot of timeouts in automated
> tests, so we added a deploy task to prime the WSGI processes by hitting the
> local Apache with a bunch of parallel curl requests. We determined that WSGI
> starts up in a round-robin, on-demand fashion, and is not quick, especially
> on the SeaMicro boxes.

Yeah, the WSGI daemon spin up on the SeaMicro boxes seems to take forever. We were hoping that after we had enough traffic to the site that the daemons would keep spun up, but we think sometimes they go to sleep and we hit a node that is not awake. Dev and stage being slow with daemon spinning up is fairly normal given that they are not constantly used.
What are the WSGIDaemonProcess settings? They shouldn't go to sleep or die unless inactivity-timeout is set, and you almost certainly don't want that. maximum-requests could also be too low.
I made a modification to the update script on the web heads on engagement-stage:

* removed apache graceful restart - we touch the wsgi entry point on code push for stage
* curl requests to affiliates.allizom.org as mentioned in comment 1

This should help with the slowness on -stage.
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Component: Server Operations: Web Operations → WebOps: Other
Product: mozilla.org → Infrastructure & Operations
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.