Closed Bug 1378435 Opened 7 years ago Closed 4 years ago

Evaluate spark implementation of MacroBase on `crash_rates` dataset

Categories

(Data Platform and Tools :: Monitoring & Alerting, enhancement, P3)

x86
macOS
enhancement
Points:
5

Tracking

(Not tracked)

RESOLVED WONTFIX

People

(Reporter: amiyaguchi, Assigned: amiyaguchi)

References

Details

Attachments

(1 obsolete file)

Crash rates are a good first use case of MacroBase on real telemetry data.[1] They are a well defined measure used in the wild and useful for identifying anomalous sub-populations. Crashes are often associated with particular build, operating systems, or hardware configuration.

Crash rates are defined as the total crashes per thousand usage hours. Clients aggregate measurements over the course of a session to avoid constant pinging to the telemetry ingestion endpoint. Thus, it is important to normalize by usage hours to take advantage of the classification and explanation pipeline to avoid individual client skew.

There are e10s experiments that can be used to verify the classification and explanation pipeline across experimental and control groups. The crash rates dashboard also provides a view to aid an intuition about the data.

The report will cover data cleaning, ETL, exploration, evaluation of the classification and explaination pipeline, and an overview of what this might look like in the context of Spark Streaming.

[1] https://chutten.github.io/telemetry-dashboard/crashes/
Assignee: nobody → amiyaguchi
Blocks: 1365077
Status: NEW → ASSIGNED
Priority: -- → P1
Points: --- → 2
Summary: Add notebook anomaly detection and explanation of crash rates to RTMO → Add notebook for anomaly detection and explaination of crash rates to RTMO
Attached file [WIP] MacroBase and Crash Rates (obsolete) —
This is a work in progress of the report that will be submitted to RTMO. I am trying to implement the smallest usable subset of the data and MacroBase pipeline. Batched explanations consistently kill the spark stage running on a 3x c3.4xlarge cluster.
Comment on attachment 8883630 [details]
[WIP] MacroBase and Crash Rates

URL to WIP can be found at the following gist.

https://gist.github.com/acmiyaguchi/30a4aafef6e2c4061767b8da90b6eba0
Attachment #8883630 - Attachment is obsolete: true
Attachment #8883630 - Attachment mime type: text/plain → text/url
Depends on: 1379806
Priority: P1 → P2
Bug 1379806 blocks the completion of this notebook since no meaningful comparison can be made without a reference.
Points: 2 → 3
Summary: Add notebook for anomaly detection and explaination of crash rates to RTMO → Evaluate spark implementation of MacroBase on `crash_rates` dataset
Points: 3 → 5
Depends on: 1379823, 1379824, 1379825
Priority: P2 → P3
Status: ASSIGNED → RESOLVED
Closed: 4 years ago
Resolution: --- → WONTFIX
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: