<a class="header-button" href="https://bugzilla.mozilla.org/home" title="Go to home page"> Bugzilla

Comment 3

•

8 years ago

Per IRL chat today, we'd like to revive this request. Initially, we're looking to do some rapid prototyping on the type of data we send. So we're probably looking for a single endpoint that accepts N different message types with initially no schema validation. The data will almost certainly be JSON. Once we have the client-side bits implemented and have confidence in the data we're sending, we can formalize a schema "productionize" the ingestion. How that switchover works, I'm not sure. It probably involves 2 endpoints (e.g. "stage" vs "production") or some kind of routing key in the HTTP request. Not sure what options are available.

Mike Trinkala [:trink]

Updated

•

8 years ago

Component: Metrics: Pipeline → Pipeline Ingestion

Product: Cloud Services → Data Platform and Tools

Comment 4

•

8 years ago

(In reply to Gregory Szorc [:gps] from comment #3) > Initially, we're looking to do some rapid prototyping on the type of data we > send. So we're probably looking for a single endpoint that accepts N > different message types with initially no schema validation. The data will > almost certainly be JSON. > > Once we have the client-side bits implemented and have confidence in the > data we're sending, we can formalize a schema "productionize" the ingestion. > How that switchover works, I'm not sure. It probably involves 2 endpoints > (e.g. "stage" vs "production") or some kind of routing key in the HTTP > request. Not sure what options are available. Greg: do we have any further information about the format/content of data to be submitted via |mach telemetry| after 9 months?

Flags: needinfo?(gps)

Comment 5

•

8 years ago

(In reply to Chris Cooper [:coop] from comment #4) > Greg: do we have any further information about the format/content of data to > be submitted via |mach telemetry| after 9 months? No. The best we have is the mach resource-usage.json file that is produced during builds. But that lacks metadata like what mach command was used, CPU count, etc. These are things we'd almost certainly want in mach telemetry. At some point, someone just needs to hack up a PoC for what to capture. Or we can set up the endpoint to allow ingestion of anything until we get our act together.

Flags: needinfo?(gps)

Comment 6

•

8 years ago

(In reply to Gregory Szorc [:gps] from comment #5) > (In reply to Chris Cooper [:coop] from comment #4) > > Greg: do we have any further information about the format/content of data to > > be submitted via |mach telemetry| after 9 months? > > No. The best we have is the mach resource-usage.json file that is produced > during builds. But that lacks metadata like what mach command was used, CPU > count, etc. These are things we'd almost certainly want in mach telemetry. > > At some point, someone just needs to hack up a PoC for what to capture. Or > we can set up the endpoint to allow ingestion of anything until we get our > act together. Ted has signed up to do the PoC as a deliverable this quarter. Laura was quite interested in this when I spoke to her yesterday.

Updated

•

8 years ago

Blocks: buildmetrics

Jason Thomas [:jason]

Updated

•

8 years ago

Depends on: 1363160

Comment 7

•

7 years ago

Wes: any update on this, or the generic ingestion service in general? Do you have a new ETA?

Flags: needinfo?(whd)

Assignee

Comment 8

•

7 years ago

I spec'd out the service requirements with :mreid and :jason, which resulted in the creation of bug #1368197 and bug #1368196, which I'm adding as blockers. Unfortunately I've not had time to do the development work on this as I've been focused on Mission Control. I would expect to next be able to work on this, priority permitting, in mid to late July, and depending on whether the development work has been done by then, that should take a week or a few weeks.

Depends on: 1368196, 1368197

Flags: needinfo?(whd)

Comment 9

•

7 years ago

I'm not seeing any updates in the new dependencies...has this been de-prioritized?

Flags: needinfo?(whd)

Assignee

Comment 10

•

7 years ago

I expect to start working on this within the next two weeks.

Flags: needinfo?(whd)

Comment 11

•

7 years ago

Any update here? We've punted on this for 2 quarters now.

Flags: needinfo?(whd)

Jason Thomas [:jason]

Updated

•

7 years ago

Whiteboard: [SvcOps] → [DataOps]

Assignee

Comment 12

•

7 years ago

This infrastructure is (finally) available. The process for adding a namespace isn't documented and will probably change to be more self-service, but the current instructions are to file a PR against https://github.com/mozilla-services/mozilla-pipeline-schemas similar to https://github.com/mozilla-services/mozilla-pipeline-schemas/pull/104/files with JSON and parquet schemas for your data. This PR unfortunately also has some extra telemetry-specific diffs; you should only need to add schema files to e.g. schemas/mach and templates/mach for your ping types. Once that's merged and deployed, you should be able to POST your JSON blobs to e.g. https://incoming.telemetry.mozilla.org/submit/mach/<doctype>/<docversion>/<docid> and have the result accessible in stmo and atmo.

Flags: needinfo?(whd)

Mark Reid [:mreid]

Comment 13

•

7 years ago

Greg, are you able to start working out a schema for the Mach submissions? I'm happy to help you with that, just let me know.

Flags: needinfo?(gps)

Kim Moir [:kmoir] ET

Comment 14

•

7 years ago

I talked to mcote about this and we decided to de-prioritize this work for a few weeks until we have a better handle on the hypothesis we want to test before collecting this data.

Comment 15

•

7 years ago

Per kmoir, I guess there's nothing to do right this moment. But thanks for the offer to help with a schema!

Flags: needinfo?(gps)