Closed Bug 695105 Opened 14 years ago Closed 14 years ago

Mozillians dev/trunk staging server suffers from timeouts/Zeus "Service Unavailable" error pages

Categories

(mozilla.org Graveyard :: Server Operations, task)

task
Not set
normal

Tracking

(Not tracked)

VERIFIED FIXED

People

(Reporter: stephend, Assigned: jason)

References

()

Details

(Whiteboard: [fromAutomation])

Attachments

(2 files)

Our Mozillians prod and stage Selenium-automation runs don't fail like our mozillians.trunk one [1] does; it's really, really unreliable (we hit a Zeus "Service Unavailable" page quite a bit). Attached is an HTML report from our Jenkins (download the folder and open index.htm). [1] http://qa-selenium.mv.mozilla.com:8080/view/Mozillians/job/mozillians.trunk/ -- should be on Mozilla-MV, if you're remote.
Assignee: server-ops → jthomas
How often do these tests run on dev environment? I believe that these 'Service Unavailable' occur due to the backends not responding quick enough after a restart/reload of the apache httpd or wsgi daemon which continuously occurs on dev.
(In reply to Jason Thomas [:jason] from comment #2) > How often do these tests run on dev environment? I believe that these > 'Service Unavailable' occur due to the backends not responding quick enough > after a restart/reload of the apache httpd or wsgi daemon which continuously > occurs on dev. We have a cron set for: */30 09-18 * * 1-5 If we can do either graceful restarts (I believe AMO just "touch"es the dir/files, making Apache not restart as often, etc.), or work around when it's updating, we can change the cron to do so. Or, perhaps a post-commit trigger would be even better?
On the generic cluster we currently touch the wsgi file every time the update runs (every 5 mintutes), which would cause wsgi daemon to reload on a new request. I have updated the script to only touch the wsgi file for mozillians-dev when the code updates. Hopefully this will help with the issue.
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
(In reply to Jason Thomas [:jason] from comment #4) > On the generic cluster we currently touch the wsgi file every time the > update runs (every 5 mintutes), which would cause wsgi daemon to reload on a > new request. > > I have updated the script to only touch the wsgi file for mozillians-dev > when the code updates. > > Hopefully this will help with the issue. This seems a lot better now, Jason; thanks. Do you know if Affiliates does the same thing, too? If so, can you look at a fix for that?
Verified FIXED; thanks.
Status: RESOLVED → VERIFIED
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: