Closed
Bug 1378435
Opened 7 years ago
Closed 4 years ago
Evaluate spark implementation of MacroBase on `crash_rates` dataset
Categories
(Data Platform and Tools :: Monitoring & Alerting, enhancement, P3)
Tracking
(Not tracked)
RESOLVED
WONTFIX
People
(Reporter: amiyaguchi, Assigned: amiyaguchi)
References
Details
Attachments
(1 obsolete file)
Crash rates are a good first use case of MacroBase on real telemetry data.[1] They are a well defined measure used in the wild and useful for identifying anomalous sub-populations. Crashes are often associated with particular build, operating systems, or hardware configuration. Crash rates are defined as the total crashes per thousand usage hours. Clients aggregate measurements over the course of a session to avoid constant pinging to the telemetry ingestion endpoint. Thus, it is important to normalize by usage hours to take advantage of the classification and explanation pipeline to avoid individual client skew. There are e10s experiments that can be used to verify the classification and explanation pipeline across experimental and control groups. The crash rates dashboard also provides a view to aid an intuition about the data. The report will cover data cleaning, ETL, exploration, evaluation of the classification and explaination pipeline, and an overview of what this might look like in the context of Spark Streaming. [1] https://chutten.github.io/telemetry-dashboard/crashes/
Assignee | ||
Updated•7 years ago
|
Assignee | ||
Updated•7 years ago
|
Points: --- → 2
Assignee | ||
Updated•7 years ago
|
Summary: Add notebook anomaly detection and explanation of crash rates to RTMO → Add notebook for anomaly detection and explaination of crash rates to RTMO
Assignee | ||
Comment 1•7 years ago
|
||
This is a work in progress of the report that will be submitted to RTMO. I am trying to implement the smallest usable subset of the data and MacroBase pipeline. Batched explanations consistently kill the spark stage running on a 3x c3.4xlarge cluster.
Assignee | ||
Comment 2•7 years ago
|
||
Comment on attachment 8883630 [details] [WIP] MacroBase and Crash Rates URL to WIP can be found at the following gist. https://gist.github.com/acmiyaguchi/30a4aafef6e2c4061767b8da90b6eba0
Attachment #8883630 -
Attachment is obsolete: true
Attachment #8883630 -
Attachment mime type: text/plain → text/url
Assignee | ||
Comment 3•7 years ago
|
||
Bug 1379806 blocks the completion of this notebook since no meaningful comparison can be made without a reference.
Assignee | ||
Updated•7 years ago
|
Points: 2 → 3
Summary: Add notebook for anomaly detection and explaination of crash rates to RTMO → Evaluate spark implementation of MacroBase on `crash_rates` dataset
Assignee | ||
Updated•7 years ago
|
Updated•7 years ago
|
Priority: P2 → P3
Assignee | ||
Updated•4 years ago
|
Status: ASSIGNED → RESOLVED
Closed: 4 years ago
Resolution: --- → WONTFIX
You need to log in
before you can comment on or make changes to this bug.
Description
•