Closed Bug 1350376 Opened 7 years ago Closed 7 years ago

Write a run book to help address Etherpad downtime

Categories

(Infrastructure & Operations Graveyard :: WebOps: Other, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: danielh, Assigned: danielh)

Details

(Whiteboard: [kanban:https://webops.kanbanize.com/ctrl_board/2/4484])

Etherpad has a tendency to die periodically. The most recent occurrence was in bug 1350282. It looks like our general response is to run:

>supervisorctl restart etherpad

This seems to fix the issue. I have also been gathering logs to help isolate the underlying issue and report it to the etherpad-lite project on GitHub. Unfortunately I have not been able to build a good case for that yet.

It would help to write a run book (which we can share with the MOC) that describes this temporary fix. Let's do that.
Whiteboard: [kanban:https://webops.kanbanize.com/ctrl_board/2/4484]
Doc is available here:
https://mana.mozilla.org/wiki/display/WebOps/etherpad.mozilla.org
Assignee: server-ops-webops → dhartnell
Marking this as resolved since the document is complete.
Status: NEW → RESOLVED
Closed: 7 years ago
Resolution: --- → FIXED
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.