Create UNIONed views for Glean app-datasets across all channels
Categories
(Data Platform and Tools Graveyard :: Glean Platform, task, P3)
Tracking
(Not tracked)
People
(Reporter: frank, Assigned: ascholtz)
References
Details
(Whiteboard: [dataplatform])
Attachments
(1 file)
In bug 1708166 (and friends), we created per-app datasets that contain derived datasets across all channels. We're looking to create ping-level views for just release there as well.
What we really want is to just be able to query the app-dataset; to do so, we need to configure the UNION views across all app channels. For example, we want the metrics ping for Fenix to be:
SELECT "release" AS channel, *
FROM mozdata.org_mozilla_firefox.metrics
UNION ALL
SELECT "nightly" AS channel, *
FROM mozdata.org_mozilla_fenix.metrics
The problem is the * select doesn't work, since different channels can have different dependencies. We need to use the superset of all fields from all tables, and order them appropriately for the UNION to work.
Comment 1•4 years ago
|
||
We need to use the superset of all fields from all tables, and order them appropriately for the UNION to work
Or, we use the superset of all fields, inserting NULL where a field is not defined for a given channel.
Updated•4 years ago
|
Comment 2•4 years ago
|
||
I have been working on adding app datasets to make Looker event explores work (https://github.com/mozilla/lookml-generator/issues/156). These app datasets currently only contain the release channel information, but making a note of this here for context.
| Assignee | ||
Updated•3 years ago
|
| Assignee | ||
Comment 3•3 years ago
|
||
| Assignee | ||
Comment 4•3 years ago
|
||
Views deployed successfully. All done
| Assignee | ||
Updated•2 years ago
|
Updated•10 months ago
|
Description
•