Closed Bug 1332457 Opened 7 years ago Closed 7 years ago

Experiment with providing a GraphQL API to Treeherder

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: wlach, Assigned: seban)

References

Details

Attachments

(3 files)

[treeherder] SebastinSanty:graphql > mozilla:master 7 years ago GitHub Autolander Bot 47 bytes, text/x-github-pull-request		Details \| Review
[treeherder] SebastinSanty:gql > mozilla:master 7 years ago GitHub Autolander Bot 47 bytes, text/x-github-pull-request	wlach : review+	Details \| Review
[treeherder] wlach:1332457-followup > mozilla:master 7 years ago GitHub Autolander Bot 47 bytes, text/x-github-pull-request		Details \| Review

William Lachance (:wlach)

Reporter

Description

•

7 years ago

So I'm not 100% sure if this is a good idea, but I've chatted with a few people about it already and figured it might make sense to write down some of my thoughts, even if it's not something I'm going to be working on immediately.

Treeherder's REST API has proven to be a bit unwieldly for the purposes we want to put it towards. In particular, the jobs api endpoints provide *a lot* of data, which is usually not relevant to the consumers of it. For example we hit this endpoint to get the jobs for display:

https://treeherder.mozilla.org/api/project/mozilla-inbound/jobs/?count=2000&result_set_id=161730&return_type=list

Much of this is actually unused by the frontend! (at least in the initial view) This slows load and query times, since that much more data needs to be both fetched from the database and processed into a json response.

At the same time, getting different types of information pertaining to a job (performance data, job details, error summary lines) require multiple requests, which slows response time (every time you load the details panel for a job, 4+ http requests need to be processed).

I suspect this sort of problem will become larger in the future, as some of the new views we might want to have into treeherder data (e.g. a manifest-based view) will almost certainly need that isn't in the main jobs table, which means yet more http requests and a slower UI (or the tedious hand-coding of custom endpoints which return the data we need).

GraphQL (http://graphql.org/) is a burgeoning standard which seems to fit our exact requirements. You specify what data you want as a json "graph", traversing across object types if you like, and the API returns exactly what the user asked for in a single response.
This seems like an ideal fit for solving the above problems.

It also might be a good fit for solving some other things we don't yet have an answer to, like how to populate a development instance with a set of production data (since our endpoints only return a subset of the data in our database, it isn't possible to do this with them).

It seems like there's a pretty decent python library for working with GraphQL called Graphene, which also has a Django integration extension, Graphene Django (https://github.com/graphql-python/graphene-django). An interesting proof of concept might be to use that to build up a simple mechanism for querying the jobs endpoint (example above) and then update our UI code to use it. Depending on the results of that experiment, we could consider proposing a project to expose our entire data model using this interface.

William Lachance (:wlach)

Reporter

Comment 1

•

7 years ago

Seban is going to drive creating a prototype of this feature. I'll mentor where need be.

Assignee: nobody → sebastinssanty

GitHub Autolander Bot

Comment 2

•

7 years ago

Attached file [treeherder] SebastinSanty:graphql > mozilla:master — Details

GitHub Autolander Bot

Comment 3

•

7 years ago

Attached file [treeherder] SebastinSanty:gql > mozilla:master — Details

Sebastin Santy [:seban]

Assignee

Updated

•

7 years ago

Attachment #8843394 - Flags: review?(wlachance)

Treeherder GitHub Bugbot

Comment 4

•

7 years ago

Commit pushed to master at https://github.com/mozilla/treeherder

https://github.com/mozilla/treeherder/commit/0564a3dbfab5b02cf04a635207389050827d0fa8
Bug 1332457 -  Experiment with providing a GraphQL API to Treeherder (#2225)

William Lachance (:wlach)

Reporter

Comment 5

•

7 years ago

Comment on attachment 8843394 [details] [review]
[treeherder] SebastinSanty:gql > mozilla:master

Thanks for this, looking forward to building on top of it. :)

Attachment #8843394 - Flags: review?(wlachance) → review+

William Lachance (:wlach)

Reporter

Updated

•

7 years ago

Status: NEW → RESOLVED

Closed: 7 years ago

Resolution: --- → FIXED

GitHub Autolander Bot

Comment 6

•

7 years ago

Attached file [treeherder] wlach:1332457-followup > mozilla:master — Details

Treeherder GitHub Bugbot

Comment 7

•

7 years ago

Commit pushed to master at https://github.com/mozilla/treeherder

https://github.com/mozilla/treeherder/commit/ff18d1c84dbdf736d22de4f903201da4ba9aeceb
Bug 1332457 - Move py to common requirements (#2237)

It's needed for graphql

Ed Morley [:emorley]

Updated

•

7 years ago

Depends on: 1349237

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Experiment with providing a GraphQL API to Treeherder

Categories

(Tree Management :: Treeherder: API, defect)

Tracking

(Not tracked)

People

(Reporter: wlach, Assigned: seban)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(3 files)

Description

Comment 1

Comment 2

Comment 3

Updated

Comment 4

Comment 5

Updated

Comment 6

Comment 7

Updated

Attachment

General

Description

File Name

Content Type