Write a one-off jupyter notebook to aggregate RAPPOR experiment data

RESOLVED FIXED

Status

Data Platform and Tools
General
P1
normal
RESOLVED FIXED
a year ago
10 months ago

People

(Reporter: Dexter, Assigned: Florian Hartmann)

Tracking

(Depends on: 1 bug, Blocks: 1 bug)

Details

(Whiteboard: [measurement:client:tracking])

(Reporter)

Description

a year ago
Once the RAPPOR SHIELD study is deployed (bug 1386566), we will need to write a one-off Jupyter notebook that reads the generated data and produces an aggregate view of it. At the very least, the notebook should:

- perform the aggregation;
- make the data available in a private S3 location;
- make the aggregated data easy to navigate and understand.
(Reporter)

Updated

a year ago
Blocks: 1379180
Whiteboard: [measurement:client:tracking]
(Reporter)

Updated

11 months ago
Assignee: nobody → fhartmann
Priority: -- → P1
(Assignee)

Comment 1

11 months ago
We have developed two Jupyter notebooks: One for prototyping [1] and another one that is meant to be used in production [2].
The differences are as follows:

- The prototyping notebook contains all components that are potentially interesting for use with RAPPOR. This means it includes a lot of code that is optional. For some problems it also contains competing solutions
- The prototyping notebook works using the datasets generated by Google's repository [3, 4]
- Generally, the prototyping notebook contains a lot more explanations on how various things work
- The production notebook only contains the components that worked best for our use case. This makes it a lot smaller and also more straight-forward to use
- The prototyping notebook can optionally generate datasets using Spark. Alternatively, the user has to import their own dataset into the specified format

[1] https://github.com/florian/rappor-server/blob/master/RAPPOR-Prototyping.ipynb
[2] https://github.com/florian/rappor-server/blob/master/RAPPOR-Production.ipynb
[3] https://github.com/Alexrs95/rappor
[4] https://github.com/google/rappor
(Assignee)

Comment 2

10 months ago
I transferred the repository Alessio's GitHub account. It can now be found here: https://github.com/Dexterp37/rappor-server
We'll move the repository to the Mozilla organization later on.
Status: NEW → RESOLVED
Last Resolved: 10 months ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.