Closed
Bug 730038
Opened 13 years ago
Closed 9 years ago
[TRACKER] Need example queries for data to be used in nightly reports
Categories
(Socorro :: Data request, task)
Socorro
Data request
Tracking
(Not tracked)
RESOLVED
WONTFIX
People
(Reporter: dre, Unassigned)
References
Details
Attachments
(1 file)
89.44 KB,
application/pdf
|
Details |
- Nightly Reports / Rapid Release reports:
We need to get crashdata numbers by product (Firefox, Fennec, Thunderbird), by version and by channel (nightly, aurora, beta, release, esr) so we can show "Crashes/100 users" for d, d/d-7, d/d-28 and 3 month trend.
We also need those values broken down by OS (Windows, OS X, Linux), as we can see on the "Stability" section of Nightly Report
Reporter | ||
Comment 1•13 years ago
|
||
One additional note:
in our previous system, we were getting the total number of crashes. Since these queries will return the throttled number of crashes for release, it would be good if the query could take that throttling into account and deliver the adjusted number. Otherwise, we'll need to also have a query to run to get the throttling number and perform that computation on our end.
Because we will be displaying data for several dates either as a comparison or as a trend, I am wondering whether it would be better to get a query we run once per day and use it to populate a datawarehouse on our end, or if we should just perform the queries directly on your database daily.
Comment 2•13 years ago
|
||
Comment 4•13 years ago
|
||
Daniel, Paula,
We have some matviews which sort of correspond to what you want, but I think you should draw from the base fact and dimension tables so that changes I make in the future to the matviews won't break your stuff.
The main dimension table is reports_clean (not "reports", which contains raw uncleaned data). The table product_versions contains details about products and versions. product_version_channels contains the canonical throttling info. Thus:
SELECT product_versions.product_name,
product_versions.version_string,
reports_clean.os_name,
product_versions.build_type AS release_channel,
reports_clean.date_processed::date as report_date,
round(count(*) / product_release_channels.throttle) as adjusted_crash_count
FROM reports_clean
join product_versions USING (product_version_id)
join product_release_channels ON
product_versions.product_name = product_release_channels.product_name
AND product_versions.build_type = product_release_channels.release_channel
WHERE utc_day_is(date_processed, '2012-02-15')
GROUP BY product_versions.product_name, version_string, os_name,
build_type, date_processed::date, throttle;
Since this runs in around 10s for 1 day of data, I see no reason why you wouldn't simply pull it from Postgres.
Comment 5•13 years ago
|
||
Sorry, I meant the main *fact* table is reports_clean.
![]() |
||
Comment 6•13 years ago
|
||
BTW, you probably want the same number like is being used for the Socorro front page graph or the /daily pages by default, i.e. one that only counts one half of hang pairs.
Comment 7•13 years ago
|
||
Sorry but I need some help here. I'm getting weird results...
I tried the above query (added "and product_versions.product_name = 'Firefox' ")
and all I got was this:
http://paula.pastebin.mozilla.org/1491472
Why am I getting all those "<Other>" values?
Also, when I try:
select distinct product_name from product_versions;
I got 5 rows, all with "<Other>" value.
I'm using the driver: postgresql-9.1-901.jdbc4.jar
Comment 8•13 years ago
|
||
Paula,
Your pastebin expired, I can no longer access it.
Comment 9•13 years ago
|
||
Paula,
Might I suggest that we schedule a time to do this live/interactively?
Find me on #breakpad.
Comment 10•13 years ago
|
||
The problem is that Paula is using SQuirrelSQL, which is not fully PostgreSQL-compatible. I've advised switching to pgAdmin (www.pgadmin.org).
Comment 11•13 years ago
|
||
I'm making this bug a tracker and spawning new bugs off of it for database changes which need to be deployed.
Summary: Need example queries for data to be used in nightly reports → [TRACKER] Need example queries for data to be used in nightly reports
Updated•13 years ago
|
Severity: major → normal
Updated•13 years ago
|
Updated•13 years ago
|
Assignee: josh → sdeckelmann
Updated•13 years ago
|
Updated•12 years ago
|
Status: NEW → ASSIGNED
Updated•9 years ago
|
Assignee: sdeckelmann → nobody
Status: ASSIGNED → NEW
Comment 12•9 years ago
|
||
This particular kind of query is better sourced from the crash-ping and we're not interested in implementing it in crash-stats
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → WONTFIX
You need to log in
before you can comment on or make changes to this bug.
Description
•