we should monitor that long living server processes to make sure they don't go away

NEW
Unassigned

Status

Taskcluster
General
2 years ago
4 months ago

People

(Reporter: jhford, Unassigned)

Tracking

Details

We have things like the AWS Provisioner which have long living processes that ought not to disappear.  We should use something like Deadman's Snitch to make sure that emails are generated when a one of these long living processes disappears.  Things like the s3-copy-proxy and the cloud-mirror should have these alerts set up.
What do we have where this is missing?
Flags: needinfo?(jhford)
(In reply to Jonas Finnemann Jensen (:jonasfj) from comment #1)
> What do we have where this is missing?

My understanding is that we can use SignalFX for this now?  I'm not sure if we're intending to continue using dead man's snitch or not.
Flags: needinfo?(jhford)

Comment 3

4 months ago
We have data within signalfx and alerts from deadman snitch.  John, is there more you wanted to setup with this bug?
Flags: needinfo?(jhford)
only one service that I'm aware of uses deadman's snitch.  The reason for this bug was to have something like deadman's snitch more widely deployed.
Flags: needinfo?(jhford)
You need to log in before you can comment on or make changes to this bug.