Closed Bug 1079701 Opened 10 years ago Closed 10 years ago

Perform a |service supervisord restart| on production deploy

Categories

(Tree Management :: Treeherder, defect, P1)

defect

Tracking

(Not tracked)

RESOLVED DUPLICATE of bug 1072437

People

(Reporter: emorley, Unassigned)

References

Details

21:14 mdoglio the celery worker on production are only listening to the default queue 21:14 mdoglio that's one problem for sure 21:15 mdoglio fubar: can you please check that the celery worker on the admin node is currently using https://github.com/mozilla/treehe...ob/master/bin/run_celery_worker to start? 21:16 fubar command=/data/treeherder/src/treeherder.mozilla.org/treeherder-service/bin/run_celery_worker 21:17 mdoglio fubar: I would try with a service supervisord restart then Mauro, should we make update.py do a |service supervisord restart| too then?
Flags: needinfo?(mdoglio)
(In reply to Ed Morley [:edmorley] from comment #0) > Mauro, should we make update.py do a |service supervisord restart| too then? That is to pick up future changes like bug 1076774 and bug 1076778 without requiring manual intervention.
Blocks: 1072437
Flags: needinfo?(mdoglio)
Restarting supervisord was our first idea to restart the celery workers, but for some reason it was sometimes creating zombie processes. I still think it's a good idea to broadcast a restart signal using celery itself, I just need to understand why sometimes that doesn't work. In order to do that, I need better monitoring of the workers, a good solution for that could [1]. Maybe fubar can help us to deploy it on the admin node and make it accessible only under vpn? [1]http://celery.readthedocs.org/en/latest/userguide/monitoring.html#flower-real-time-celery-web-monitor
Flags: needinfo?(klibby)
(In reply to Mauro Doglio [:mdoglio] from comment #2) > Restarting supervisord was our first idea to restart the celery workers, but > for some reason it was sometimes creating zombie processes. I still think > it's a good idea to broadcast a restart signal using celery itself, I just > need to understand why sometimes that doesn't work. In order to do that, I > need better monitoring of the workers, a good solution for that could [1]. > Maybe fubar can help us to deploy it on the admin node and make it > accessible only under vpn? > > [1]http://celery.readthedocs.org/en/latest/userguide/monitoring.html#flower- > real-time-celery-web-monitor Bug 1072437 is filed for this - let's move discussion over there :-)
No longer blocks: 1072437, 1072681
Status: NEW → RESOLVED
Closed: 10 years ago
Flags: needinfo?(klibby)
Resolution: --- → DUPLICATE
See Also: → 1093760
You need to log in before you can comment on or make changes to this bug.