Closed Bug 1160257 Opened 10 years ago Closed 10 years ago

EsRejectedExecutionException when bulk indexing on -stage

Categories

(support.mozilla.org :: General, defect, P1)

defect

Tracking

(Not tracked)

RESOLVED WORKSFORME

People

(Reporter: willkg, Assigned: rrosario)

Details

(Whiteboard: u=dev c=search p=1 s=2015.8)

Ever since we updated the ES PHX dev cluster to 1.2.4 (which SUMO -dev and SUMO -stage use), we've been getting this error when bulk indexing: https://errormill.mozilla.org/support/sumo-stage/group/175654/ (('48 document(s) failed to index.', [{u'index': {u'status': 503, u'_type': u'users_profile', u'_id': u'294066', u'error': u'EsRejectedExecutionException[rejected execution (queue capacity 50) on org.elasticsearch.action.support.replication.TransportShardReplicationOperationAction$AsyncShardOperationAction$1@528ec562]',....) This bug covers figuring out whether that's still happening and fixing it.
Also, we should do this soon since if we're still hosed, then this needs to block bug #1160189.
This is important. Tossing it in the current sprint as a P1.
Priority: -- → P1
Whiteboard: u=dev c=search p= s= → u=dev c=search p= s=2015.8
Ricky kicked off a reindexing on -stage. It took 2000 years to complete, but a future descendent of mine had the courtesy of hopping in a time travel device and leaving me a post-it note on my monitor that said, "IT WORKED." I checked Sentry and didn't see any errors. Seems like whatever was causing this problem went away or something.
Making this 1 point since we spent time talking about it, the ES upgrade for prod, finding all the ES related bugs, re-figuring out the current status of things and then testing this situation out with an indexing run. Everything seems fine now, so I'm closing this out as WORKSFORME.
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → WORKSFORME
Whiteboard: u=dev c=search p= s=2015.8 → u=dev c=search p=1 s=2015.8
Assignee: nobody → rrosario
You need to log in before you can comment on or make changes to this bug.