Open Bug 1925614 Opened 15 days ago Updated 3 days ago

Create/update panels in Grafana to monitor Elasticsearch 8 cluster health

Categories

(Socorro :: General, task, P2)

Tracking

(Not tracked)

ASSIGNED

People

(Reporter: bdanforth, Assigned: bdanforth)

References

(Depends on 1 open bug, Blocks 1 open bug)

Details

Panels should be created/updated in stage and prod environments.

One major change as a result of upgrading ES is that we're moving from a self-hosted to a hosted cluster in Elastic Cloud (see Bug 1925594). While we will likely get a health dashboard out of the box from Elastic Cloud (and it will serve as a good reference), we ultimately want all our service health panels in our Grafana dashboard.

This may require adding a new Grafana data source, though we can check to see if other teams at Mozilla that host an ES cluster in Elastic Cloud have already done this (e.g. Fakespot).

Depends on: 1925594
Priority: -- → P2

Some things we can track:

  • Things we're currently tracking in our existing, self-hosted Elasticsearch cluster in Grafana, which may or may not include:
  • Disk usage, search latency, and indexing speed
    • This is helpful feedback for adjusting the size, zone_count and other attributes of the hot data tier of the cluster.
You need to log in before you can comment on or make changes to this bug.