Closed Bug 1255738 Opened 9 years ago Closed 8 years ago

Add select method to Dataset API to extract a subset of fields from pings.

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P2)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: rvitillo, Assigned: mdoglio)

References

Details

User Story

It would be convenient to be able to extract fields from a Dataset through JSONPath expressions instead of doing it "manually" or through get_pings_properties.

Attachments

(1 file)

Blocks: 1255748
No longer blocks: 1251580
Points: --- → 2
Priority: -- → P3
Summary: Support JSONPath in moztelemetry's get_pings_properties → Support JSON query language in moztelemetry's get_pings_properties
Priority: P3 → P2
Summary: Support JSON query language in moztelemetry's get_pings_properties → Add select method to Dataset API to extract a subset of fields from pings.
User Story: (updated)
User Story: (updated)
Flags: needinfo?(kparlante)
Depends on: 1331933
Assigning to Sunah to take when she has an opportunity
Assignee: nobody → ssuh
Flags: needinfo?(kparlante)
Blocks: 1330120
:sunahsuh I took a stab at this, I hope you don't mind :)
Assignee: ssuh → mdoglio
Going to investigate why Mauro's patch isn't working in production.
Assignee: mdoglio → wlachance
(In reply to William Lachance (:wlach) (use needinfo!) from comment #6) > Going to investigate why Mauro's patch isn't working in production. It's these instructions that are broken: https://wiki.mozilla.org/Telemetry/Custom_analysis_with_spark#How_do_I_load_an_external_library_into_the_cluster.3F If I hand install it by copying the files into /mnt/anaconda2/lib/python2.7/site-packages/moztelemetry/ everything works great (and seems like a nice improvement). I'll look into filing a bug and figuring out why that doesn't work tomorrow, if someone else doesn't get to it first.
(In reply to William Lachance (:wlach) (use needinfo!) from comment #7) > (In reply to William Lachance (:wlach) (use needinfo!) from comment #6) > > Going to investigate why Mauro's patch isn't working in production. > > It's these instructions that are broken: > https://wiki.mozilla.org/Telemetry/ > Custom_analysis_with_spark#How_do_I_load_an_external_library_into_the_cluster > .3F Actually, I was wrong: these steps work fine too. I think I just made some kind of error on my side that made loading the egg not work. Apparently this feature works for mdoglio too. I think we're clear to land?
Assignee: wlachance → mdoglio
Mauro, it looks like the docs were not updated with this method? http://python-moztelemetry.readthedocs.io/en/latest/userguide.html#module-moztelemetry.dataset
Flags: needinfo?(mdoglio)
(In reply to Georg Fritzsche [:gfritzsche] from comment #9) > Mauro, it looks like the docs were not updated with this method? > > http://python-moztelemetry.readthedocs.io/en/latest/userguide.html#module- > moztelemetry.dataset It does appear under the "stable" docs: http://python-moztelemetry.readthedocs.io/en/stable/userguide.html#moztelemetry.dataset.Dataset.select I think the problem is that 'latest' points to a weird tag: http://readthedocs.org/projects/python-moztelemetry/versions/ ... we should probably reset that to master or something. Roberto, can you add me to the list of owners for this repository so I can fix this?
Flags: needinfo?(mdoglio) → needinfo?(rvitillo)
:wlach I added you
Flags: needinfo?(rvitillo)
I believe the doc issue is fixed. Mauro, shall we resolve this?
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
(In reply to William Lachance (:wlach) (use needinfo!) from comment #12) > I believe the doc issue is fixed. Mauro, shall we resolve this? Oops, didn't mean to actually resolve. Oh well, someone can always reopen.
Product: Cloud Services → Cloud Services Graveyard
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: