Closed Bug 1273081 Opened 8 years ago Closed 8 years ago

[mozillians.org][prod][celery] Database gave error: OperationalError(2006, 'MySQL server has gone away')

Categories

(Infrastructure & Operations Graveyard :: WebOps: Engagement, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: nemo-yiannis, Unassigned)

Details

(Whiteboard: [kanban:https://webops.kanbanize.com/ctrl_board/2/3127])

We are getting these errors triggered by celery:

Database gave error: OperationalError(2006, 'MySQL server has gone away')
Database error while sync: OperationalError(2006, 'MySQL server has gone away')

It looks like this was triggered by some DB work during the weekend. Possible relevant bug from IRC discussion (which I don't have access) is 1264322.

We fixed this issue on stage by forcing a chief push that restarts celery.
Can you restart celery on prod too?
Severity: normal → blocker
Whiteboard: [kanban:https://webops.kanbanize.com/ctrl_board/2/2983]
Per request in #moc ran:
  /etc/init.d/celeryd-mozillians-prod restart
  /etc/init.d/celeryd-mozillians-prod-beat restart

on python[1-4].webapp.phx1.mozilla.com

Presumably this still needs to be looked into as to why it failed in the first place from a db failover, leaving bug open.
Severity: blocker → normal
Also ran by request:
  /etc/init.d/celeryd-mozillians-dev restart
on python1.dev.webapp.phx1
Things look OK with mozillians.org celery services. Errors stopped triggering. Let's leave this one open to investigate what went wrong.
Sorry, we're pretty swamped at the moment...let's revisit this if it happens again.
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
We are getting the same errors. Can you restart the celery services in all envs as described in comments #2 and #3.
Whiteboard: [kanban:https://webops.kanbanize.com/ctrl_board/2/2983] → [kanban:https://webops.kanbanize.com/ctrl_board/2/3127]
Severity: normal → critical
lowering before it ping me, on it.
Severity: critical → normal
Per request in #moc ran:
  /etc/init.d/celeryd-mozillians-prod restart
  /etc/init.d/celeryd-mozillians-prod-beat restart

on python[1-4].webapp.phx1.mozilla.com

ran :
/etc/init.d/celeryd-mozillians-stage restart
/etc/init.d/celeryd-mozillians-stage-beat restart
on python[12].stage.webapp.phx1.mozilla.com
Did dev also as per comment #3
Awesome! I'll keep an eye on it to check for any errors. If everything is OK i will close the bug. Thanks for the fast response!
Status: REOPENED → RESOLVED
Closed: 8 years ago8 years ago
Resolution: --- → FIXED
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.