Closed Bug 1861260 Opened 2 years ago Closed 2 years ago

Update the Push extension to add additional status check code in order to restart the pod if an error is occurring

Categories

(bugzilla.mozilla.org :: Extensions, enhancement)

Production
enhancement

Tracking

()

RESOLVED FIXED

People

(Reporter: dkl, Assigned: dkl)

Details

Attachments

(1 file)

When issues come up like the pod loses its connection tot he MySQL database (happened recently) we need to have a way for K8S to do a liveness check to restart the pod. We use Daemon::Generic which has a gd_check() method that can be defined that allows the daemon to do status checks from the commend line. Such as extensions/Push/bin/bugzilla-pushd.pl check which can be used by k8s. Currently check just returns if a PID is assigned. This would not have caught the MySQL issue encountered. By defining a gd_check() and then trying to connect to the database and get a count of queue jobs, we can get an better idea of the status of the pod. We currently do something similar for the JobQueue code that delivers emails so it would not be too much work to add it to the Push extensions as well.

https://github.com/mozilla-bteam/bmo/blob/master/Bugzilla/JobQueue/Runner.pm#L177-L188

Status: ASSIGNED → RESOLVED
Closed: 2 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: