Closed Bug 1539683 Opened 6 years ago Closed 6 years ago

[Adhoc] Document Activity Stream telemetry in DTMO

Categories

(Data Science :: Documentation, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: tdsmith, Assigned: shong)

Details

Brief Description of the request (required):

DTMO should contain documentation on how to access Activity Stream telemetry.

Business purpose for this request (required):

  • Clarity for the data science team wrt available resources
  • Transparency to the Mozilla community about our collected data
  • Lower the activation energy for analysts touching AS data

Requested timelines for the request or how this fits into roadmaps or critical decisions (required):

¯\_(ツ)_/¯

Links to any assets (e.g Start of a PHD, BRD; any document that helps describe the project):

I haven't seen any resources about actually touching the collected data, though there's a Tiles data source in STMO, which I think is Redshift.

Name of Data Scientist (If Applicable):

n/a. Su seems to be working with this right now.

Ben points to https://dbc-caf9527b-e073.cloud.databricks.com/#notebook/37725/command/37742 cells 5-8 as some boilerplate for accessing the Tiles database from Spark (and says it's very slow).

Assignee: nobody → shong
Summary: Document Activity Stream telemetry in DTMO → [Adhoc] Document Activity Stream telemetry in DTMO

note: the purpose of this documentation is to point to:

  • HOW to access the telemetry
  • where the documentation for AS is

no claims about what it is and how it works (outside the scope)

Status: NEW → ASSIGNED

A pull request to DTMO has been filed here:

currently waiting review to merge

merged and updated

And we are live!
DTMO:datasets:other:activity-stream

Status: ASSIGNED → RESOLVED
Closed: 6 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.