Closed Bug 1617630 Opened 5 years ago Closed 5 years ago

Backfill OOMAllocationSize and MozCrashReason in `telemetry.crash`

Categories

(Data Platform and Tools :: General, defect)

defect
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: wbeard, Assigned: wlach)

References

Details

OOMAllocationSize and MozCrashReason are in telemetry.crash starting 2019-11-14, but only accessible from additional_properties before then. It would be useful to have these fields back to at least June, if not earlier than that for a project I'm working on.

According to bug #1595904 there are several other related missing fields as well that can be seen in this query.

See Also: → 1595904

This could likely be handled via query-based backfill. The query would look something like:

SELECT
  * REPLACE ( (
    SELECT
      AS STRUCT payload.* REPLACE ((
        SELECT
          AS STRUCT payload.metadata.* REPLACE (COALESCE(payload.metadata.oom_allocation_size,
              JSON_EXTRACT_SCALAR(additional_properties,
                '$.payload.metadata.OOMAllocationSize')) AS oom_allocation_size)) AS metadata)) AS payload)
FROM
  `moz-fx-data-shared-prod.telemetry_stable.crash_v4`
WHERE
  DATE(submission_timestamp) = @submission_date

And we'd likely want to process individual day partitions at a time. So the query reads a single partition and overwrites that partition as its output.

Assignee: nobody → wlachance

Backfilled to a temporary table in the backfill-1 project as described here: https://github.com/mozilla/bigquery-backfill/pull/5

@klukas -- can you copy these over to the production tables when you have a chance? Did some quick checking and the counts seem to sync up.

Flags: needinfo?(jklukas)

Copy is done. I left full details of the operations as a comment in the PR.

Flags: needinfo?(jklukas)
Status: NEW → RESOLVED
Closed: 5 years ago
Resolution: --- → FIXED
Component: Datasets: General → General
You need to log in before you can comment on or make changes to this bug.