Closed Bug 1129182 Opened 9 years ago Closed 9 years ago

Common monitors/alerts for all sources ingested by pipeline (minimum set)

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P1)

x86
macOS
defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: kparlante, Assigned: trink)

References

Details

quoting :mreid... Critical for trusting data that comes out of the pipeline.

We should have a ​minimum set of ​monitors for every source coming in to the pipeline that watches the basics:

- Submission rates
- gzip errors (for data that comes in compressed)
- json parse errors (for json data)
- decoder errors (for any custom code that runs on incoming records)

Monitoring specific particular data sources should be logged as separate bugs.
Assignee: nobody → kparlante
Status: NEW → ASSIGNED
Depends on: 1135252
Group: mozilla-employee-confidential
Priority: -- → P1
Assignee: kparlante → nobody
Group: mozilla-employee-confidential
Assignee: nobody → kparlante
Depends on: 1163818
We've defined the minimum set of common monitors as:
- TelemetryStats drops to zero (cep)
- TelemetryOutput ProcessFileFailures (dwl) (any increase)
- puppet configured plugin terminations
- dwl generic monitor for all plugins

Trink is implementing this so reassigning to him.
Assignee: kparlante → mtrinkala
Summary: Common monitors for all sources ingested by pipeline (minimum set) → Common monitors/alerts for all sources ingested by pipeline (minimum set)
The PRs have been reviewed and merged.
Status: ASSIGNED → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Product: Cloud Services → Cloud Services Graveyard
You need to log in before you can comment on or make changes to this bug.