Closed Bug 1355884 Opened 7 years ago Closed 7 years ago

In the Socorro dataset on Telemetry, addons is an array of hard to parse strings

Categories

(Socorro :: General, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED DUPLICATE of bug 1250132

People

(Reporter: marco, Unassigned)

Details

> dataset = SQLContext(sc).read.load(['s3://telemetry-parquet/socorro_crash/v2/crash_date=20170411'], 'parquet')
> dataset = dataset.filter((dataset['product'] == 'Firefox') & (dataset['version'].isin(['52.0.2')))
> dataset.select('addons').take(1)

Returns:
> [Row(addons=[u'["{972ce4c6-7e08-4474-a285-3208198ce6fd} (default theme)","55.0a1"]', u'["client@anonymox.net","2.5.3.1"]', u'["activity-stream@mozilla.org","0.0.0"]', u'["aushelper@mozilla.org","2.0"]', u'["deployment-checker@mozilla.org","1.0"]', u'["e10srollout@mozilla.org","1.11"]', u'["firefox@getpocket.com","1.0.5"]', u'["flyweb@mozilla.org","1.0.0"]', u'["presentation@mozilla.org","1.0.0"]', u'["shield-recipe-client@mozilla.org","1.0.0"]', u'["webcompat-reporter@mozilla.org","1.0.0"]', u'["webcompat@mozilla.org","1.1"]', u'["formautofill@mozilla.org","1.0"]', u'["uBlock0@raymondhill.net","1.11.4"]'])]

Each entry in the array is something like this:
> ["{972ce4c6-7e08-4474-a285-3208198ce6fd} (default theme)","55.0a1"]

The documentation in the crash_report.json schema is wrong, as it currently says `This takes the form of \"addonid:version,[addonid:version...]\".`.

The field would be easier to use if it actually was an array of 'id:version' or an array of arrays or an array of objects:
['id1:version1','id2:version2',...]
[[ID1,VERSION1],[ID2,VERSION2],...]
[{'id': ID1, 'version': VERSION1}, {'id': ID2, 'version': VERSION2}, ...]
Summary: In the Socorro dataset on Telemetry, addons is an array of strings → In the Socorro dataset on Telemetry, addons is an array of hard to parse strings
This is a duplicate of bug 1250132. I'll be working on that soon!
Status: NEW → RESOLVED
Closed: 7 years ago
Resolution: --- → DUPLICATE
You need to log in before you can comment on or make changes to this bug.