Closed Bug 530562 Opened 15 years ago Closed 14 years ago

[AMO] searchd is dead on preview.amo

Categories

(mozilla.org Graveyard :: Server Operations, task)

All
Other
task
Not set
blocker

Tracking

(Not tracked)

VERIFIED FIXED

People

(Reporter: clouserw, Assigned: chizu)

References

()

Details

(Whiteboard: [twss])

From preview.addons.mozilla.org:

PHP Fatal error:  Uncaught exception 'AddonsSearchException' with message 'could not connect to searchd' in /data/www/addons.mozilla.org-preview/site/vendors/sphinx/addonsSearch.php:214

Chizu - is this emailing you when it dies?  Is there a reason it seems to die so often?
I just kicked searchd on pm-app-sphinx01 and 02 should be back up now. I'll leave this open for chizu to figure the rest of it out.
Yup, the URL in the bug = workie for me now.
Trevor, can you see if anything failed? or if you can figure out what happened?
Assignee: server-ops → thardcastle
FWIW, the logs have nothing really...so I guess there is nothing further to dig. It stops on the 19th and starts again when I kicked searchd.

I've asked Dave Dash to turn on some debug stuff in case this happens again so we might have something useful in the logs.
Assignee: thardcastle → shyam
Status: NEW → RESOLVED
Closed: 15 years ago
Resolution: --- → FIXED
Tracking the general issue of crashiness on preview in bug 530830.
Verified FIXED; it's been back up for a while, now.
Status: RESOLVED → VERIFIED
Down again (been down for a couple hours, at least).  Any chance we can have this monitored in nagios?  It does affect our testing schedule, especially closer to release.  Thanks!

(Dave/Krupa told me to reopen; if you'd rather a new bug be filed, let me know, thanks!)
Status: VERIFIED → REOPENED
Resolution: FIXED → ---
It is monitored in nagios. Looking now.
Assignee: shyam → thardcastle
It didn't crash this time. It did exit on both boxes, which is probably a new issue. I'll keep looking into it.
Status: REOPENED → ASSIGNED
Search is down again and it is blocking QA as we have a release on 2/2
Severity: major → blocker
I have this tracked down to the update and rotate scripts occasionally running into each other. They now run further apart and lock to prevent conflicts. On-call is paged and has instructions for restarting it if something else goes wrong.
Status: ASSIGNED → RESOLVED
Closed: 15 years ago14 years ago
Resolution: --- → FIXED
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.