Closed Bug 1643663 Opened 5 years ago Closed 5 years ago

Backfill stats data on Elasticsearch 6 cluster

Categories

(Cloud Services :: Operations: AMO, task)

task

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: mat, Assigned: wezhou)

Details

Now that we've upgrade elasticsearch cluster to 6.8.x, we should backfill missing stats data - we couldn't do that in one go during the push because there was too much of it.

The command to use is ./manage.py index_stats --date YYYY-MM-DD:YYYY-MM-DD - the date range is inclusive. I suggest doing several batches of a few months (maybe 6 ?) working our way backwards from the the most recent date we have stats for, going further in the past each time.

Jorge, Is there a good cutoff date we could pick to avoid going until the beginning of time ? Or do we want to truly expose everything, including from the legacy XUL extensions from 10 years ago ?

Flags: needinfo?(jorge)

If we can backfill the last 5 years, that should cover all of the WebExtensions history, which is more than enough. At a minimum we should backfill the last year, but more is better.

Flags: needinfo?(jorge)

I've finished back filling stats data since 2015-01-01.

Please check and let me know if there are issues or questions.

I looked at some old data and it appeared to be good.

Haven't heard any complaints, so mark it as resolved.

Status: NEW → RESOLVED
Closed: 5 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.