Closed Bug 1381809 Opened 8 years ago Closed 8 years ago

webapp on socorro -stage memoryerrors

Categories

(Socorro :: Infra, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED WORKSFORME

People

(Reporter: willkg, Unassigned)

Details

We're seeing memory errors from one or more of the webapp nodes on socorro -stage: https://sentry.prod.mozaws.net/operations/socorro-stage/issues/625718/ https://sentry.prod.mozaws.net/operations/socorro-stage/issues/625718/ I checked Datadog to see if we had any monitoring of memory on the webapp nodes, and I don't see any on either -stage or -prod. I'm guessing we don't have alerts, either. Since -stage is undergoing an Elasticsearch migration, I figured it'd worth writing this up in case the memory error is related (new ES library memory usage different? new leak? different caching? something else?). This bug covers looking into the memory situation and alleviating that. Plus we should probably have monitoring of memory on webapp nodes for -stage and -prod. If we don't have monitoring, we'd never know if changes we're making affect memory usage.
We talked about this earlier today and while we can't be certain, no one thinks this is related to the ES changes. Given that, I think we're just going to sit on this and if it persists and/or happens in -prod, then we'll work on it then. If no one has touched this in a couple of months, we should close it out.
Fly-by-comment here; I suspect that we have work to do in the webapp, that is related to MemoryError, to do with failing to store results of an ES query in memcached. Might be because we're now trying to store something much larger or weirder.
Assignee: nobody → peterbe
Pretty tempted to resolve this as fixed. It's extremely likely that the memory errors come from trying to memcache a SuperSearch query, in the webapp, when SuperSearch was allowed to simply contain too much. A month ago (or so?) we added a piece of code that prevents massive enormous values to go into the ES processed crash fields. That should hopefully prevent this. Also, I don't believe we've seen this error in a loooong time now.
Assignee: peterbe → nobody
I made a change that nixed huge keys, but I don't think we're truncating values, yet. We probably should be especially for fields marked as keyword fields. Regardless of that clarification, I think Peter's right on and that this is probably not an issue any more. I'm going to mark it WORKSFORME. We can always re-open it later if it comes up again.
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → WORKSFORME
You need to log in before you can comment on or make changes to this bug.