Closed Bug 1388025 Opened 8 years ago Closed 6 years ago

Get rid of crash_aggregates in favor of error_aggregates

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: mdoglio, Assigned: wlach)

References

Details

(Whiteboard: [DataPlatform])

Attachments

(1 file)

Link to GitHub pull-request: https://github.com/mozilla/telemetry-batch-view/pull/530 6 years ago GitHub Bugzilla PR Linker 56 bytes, text/x-github-pull-request		Details \| Review

Mauro Doglio [:mdoglio]

Reporter

Description

•

8 years ago

The Mission Control dataset (error_aggregates) schema is a superset of what crash_aggregates offers. Also, its latency is <1h vs up to 24h. We should get rid of crash_aggregates and save both machine and human time.

David Durst [:ddurst]

Comment 1

•

8 years ago

Is that (error_aggregates) coming from main pings, or is it using crash pings? I'm wondering if we should focus on using error_aggregates for everything dealing with crashes or if we should be creating a new crash_aggregates based on crash pings.

Mauro Doglio [:mdoglio]

Reporter

Comment 2

•

8 years ago

It's using both, like crash_aggregates does. I like the idea of having a crash ping-only aggregates dataset. I would be happy to work on that if it becomes a priority.

David Durst [:ddurst]

Comment 3

•

8 years ago

With 55 in release, we finally have the full capability to correlate crash reports with crash pings (if applicable). So if we could get this dataset up and running, we could start looking at ping-only data on all of release, even segmented by process type. I don't know who would determine the priority of this, but it would be very useful to get dashboards hooked up to this data as we lead up to 57.

Frank Bertsch [:frank]

Comment 4

•

8 years ago

(In reply to Mauro Doglio [:mdoglio] from comment #2) > It's using both, like crash_aggregates does. I like the idea of having a > crash ping-only aggregates dataset. I would be happy to work on that if it > becomes a priority. We could use direct-to-parquet for this dataset. It doesn't require parsing the entire payload, you can have it output a subset of fields to parquet, which in this case would probably just be that dimensional information (version, app, build, etc.).

Mauro Doglio [:mdoglio]

Reporter

Comment 5

•

8 years ago

It sounds like we can: 1- get rid of crash_aggregates 2- use crash_summary (which is already in parquet) to derive new datasets (like the one for crash reports correlations that :ddurst is talking about).

Chris H-C :chutten

Updated

•

8 years ago

Depends on: 1382236

Mark Reid [:mreid]

Updated

•

8 years ago

Priority: -- → P3

Firefox Bug Husbandry Bot

Comment 6

•

7 years ago

Moved, per https://bugzilla.mozilla.org/show_bug.cgi?id=1453996

Component: Datasets: Crash Aggregates → Datasets: General

Arkadiusz Komarzewski [:akomar]

Updated

•

7 years ago

Assignee: nobody → akomarzewski

Priority: P3 → P2

Whiteboard: [DataPlatform]

William Lachance (:wlach)

Assignee

Comment 7

•

6 years ago

I'll take this.

Assignee: akomarzewski → wlachance

William Lachance (:wlach)

Assignee

Comment 8

•

6 years ago

First step is going through redash to find queries still scheduled against this dataset. There were quite a few, but most were obviously out of date / not relevant any more. Except for some that were scheduled/run by hkirschner@mozilla.com and rtestard@mozilla.com, cdenizet@mozilla.com.

Romain, Harald, Calixte: can you let me know if it's ok to stop scheduling runs of these queries? If the data is still useful, can I help you migrate to error_aggregates?

https://docs.telemetry.mozilla.org/datasets/streaming/error_aggregates/reference.html

The schema is fairly similar so it shouldn't take long to update your queries if needed.

rtestard@mozilla.com:

Release_OK 32 VS 64-bit Crash Rates: https://sql.telemetry.mozilla.org/queries/14107/source
ESR 32 VS 64-bit Crash Rates: https://sql.telemetry.mozilla.org/queries/51957/source
Release 32 VS 64-bit Crash Rates: https://sql.telemetry.mozilla.org/queries/4655/source

hkirschner@mozilla.com:

cdenizet@mozilla.com:

Daily usage khours for Firefox release, beta, aurora and nightly: https://sql.telemetry.mozilla.org/queries/346/source

Flags: needinfo?(rtestard)

Flags: needinfo?(hkirschner)

Flags: needinfo?(cdenizet)

Romain Testard [:RT]

Comment 9

•

6 years ago

OK for my queries, the ones mentioned above won't run anymore.

Flags: needinfo?(rtestard)

Calixte Denizet (:calixte)

Comment 10

•

6 years ago

I just killed mine.

Flags: needinfo?(cdenizet)

:Harald Kirschner :digitarald

Comment 11

•

6 years ago

OK for mine, health.graphics doesn't use them anymore.

Flags: needinfo?(hkirschner)

William Lachance (:wlach)

Assignee

Comment 12

•

6 years ago

Thanks all! Harald: I took the liberty of unscheduling your queries for you, since you weren't using them anymore. :)

I will go ahead with the subsequent steps in the deprecation process.

William Lachance (:wlach)

Assignee

Updated

•

6 years ago

Type: enhancement → task

William Lachance (:wlach)

Assignee

Updated

•

6 years ago

Depends on: 1541148

William Lachance (:wlach)

Assignee

Comment 13

•

6 years ago

This dataset has been unscheduled from airflow and removed from hive metastore, next step per our deprecation policy is to wait a week and make sure nothing breaks.

GitHub Bugzilla PR Linker

Comment 14

•

6 years ago

Attached file Link to GitHub pull-request: https://github.com/mozilla/telemetry-batch-view/pull/530 — Details

William Lachance (:wlach)

Assignee

Comment 15

•

6 years ago

All done!

Status: NEW → RESOLVED

Closed: 6 years ago

Resolution: --- → FIXED

Nobody; OK to take it and work on it

Updated

•

3 years ago

Component: Datasets: General → General

You need to log in before you can comment on or make changes to this bug.