Closed Bug 807378 Opened 13 years ago Closed 13 years ago

snippets.mozilla.com uses shared memcache without a KEY_PREFIX

Categories

(Snippets :: Service, defect)

All
Other
defect
Not set
normal

Tracking

(Not tracked)

VERIFIED FIXED
Future

People

(Reporter: osmose, Unassigned)

References

()

Details

The following URL should should be returning HTML for a few snippets, but isn't: https://snippets.mozilla.com/3/Firefox/18.0/20120713134347/Darwin_Universal-gcc3/en-US/release/Darwin%2010.8.0/mozilla14/1.0/ The admin interface and all other parts of the site seem to be functioning as expected. We recently had some work done setting up a dev and staging site; perhaps something was accidentally changed on production? For starters, I'd like to request that we clear the Zeus and memcache caches to see if they've just cached invalid data. I'd also like to confirm that the site is still getting traffic in the apache logs and if there's anything showing up in the error logs.
:mkelly - per our discussion on irc, i am seeing errors like the following on the web heads: [Wed Oct 31 17:33:59 2012] [error] [client 10.8.81.215] mod_wsgi (pid=8147): Exception occurred processing WSGI script '/data/www/snippets.mozilla.com/home-snippets-server/wsgi/index.wsgi'. [Wed Oct 31 17:33:59 2012] [error] [client 10.8.81.215] IOError: failed to write data
An IOError usually only occurs if the client disconnects before the server can finish writing a response. How far back do these go? We get these from time to time on most of our sites and it usually isn't an issue.
Corey and I have discussed that, and we believe that error is an artifact of the Zeus "Simple HTTP" health check, and not an actual problem... probably not relevant here.
i have cleared the cache in zeus.
Still getting the issue. Per discussion on IRC, we also checked to ensure the DB is populated with data. Is there any way to get a shell to one of the webheads? I could do some more testing to see if it's an app issue with that.
This is fixed, for now. The temp fix was to completely clear memcache. The problem seems to be that this is a Django 1.2 app, and as such does not support a KEY_PREFIX setting. It's currently using a shared memcache cluster. Most likely some other app overwrote one or more of its keys. The long-term solution is to move this to a newer Django version which supports KEY_PREFIX. A medium-term solution would be to move this to a dedicated memcache cluster, or skip memcache altogether. I don't think the load would be an issue.
Morphing this bug and dropping severity.
Severity: critical → normal
Summary: snippets.mozilla.com appears to be responding incorrectly → snippets.mozilla.com uses shared memcache without a KEY_PREFIX
Assignee: server-ops-webops → nobody
Group: infra
Component: Server Operations: Web Operations → Service
Product: mozilla.org → Snippets
Target Milestone: --- → Future
Version: other → unspecified
Blocks: 744613
Seeing the issue again. Code was updated to Django 1.3.4 in d and b. Push bug to fix issue is bug 808689.
Blocks: 808150
Fixed and deployed, the snippets servers should stay working for the foreseeable future.
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Verified; looked at this with Jake and mkelly, and it looks good on prod.
Status: RESOLVED → VERIFIED
You need to log in before you can comment on or make changes to this bug.