Closed Bug 1765414 Opened 3 years ago Closed 3 years ago

Some telemetry uninstall pings missing `type` column

Categories

(Data Platform and Tools :: General, defect)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: srose, Assigned: akomar)

Details

(Whiteboard: [dataquality])

Attachments

(1 file)

Weekly platform health check meeting notes indicate this has been a problem going back to at least 2021-05-24.

It appears high-level type property definition was missing inside the schema template for uninstall ping here:
https://github.com/mozilla-services/mozilla-pipeline-schemas/blob/main/templates/telemetry/uninstall/uninstall.4.schema.json#L5

We've worked with Arkadiusz on implementing the change to the template and regenerating the telemetry.uninstall ping schema:
https://github.com/mozilla-services/mozilla-pipeline-schemas/blob/uninstall_ping_missing_type/templates/telemetry/uninstall/uninstall.4.schema.json

We've also noticed that the test definition for this particular schema contained top-level type field, however, the schema tests did not fail to indicate schema misconfiguration. This is something we also attempted to fix and will be part of the same PR.

Change made: https://github.com/mozilla-services/mozilla-pipeline-schemas/commit/2ca3070e03ecc563aff097aef59f4db52d142afb
Test for the schema: https://github.com/mozilla-services/mozilla-pipeline-schemas/blob/main/validation/telemetry/uninstall.4.sample.pass.json#L2

The test now seems to behave the way we expect it to:
https://app.circleci.com/pipelines/github/mozilla-services/mozilla-pipeline-schemas/2848/workflows/3247a6ef-decf-4a4d-9369-88fcdf839ab1/jobs/5477

Assignee: nobody → akomarzewski

PR merged, we should check this graph again on Monday to see if the count of this error starts to reduce. If that's the case this bug can be marked as fixed.

I think next schema deploy will run on Monday so the effect of this might not yet be visible during the next health check meeting.

Status: NEW → RESOLVED
Closed: 3 years ago
Resolution: --- → FIXED

Should the pings that ended up in the error table because of this issue be backfilled?

(In reply to Sean Rose from comment #6)

Should the pings that ended up in the error table because of this issue be backfilled?

These pings did not go to the error table. Because json schema allowed additional fields and did not contain type they passed schema validation and type value ended up in additional_properties (see https://docs.telemetry.mozilla.org/concepts/pipeline/schemas.html?highlight=additional_prope#decoding).

I have confirmed that after the schema deploy today telemetry_live.uninstall_v4 has records with type column populated.

:party-parrot:

Whiteboard: [data-quality] → [dataquality]
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: