Closed Bug 1467242 Opened 7 years ago Closed 7 years ago

spark-streaming - Jenkins job that runs databricks jobs api cmds

Categories

(Data Platform and Tools Graveyard :: Operations, defect, P1)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: hwoo, Assigned: hwoo)

References

Details

- Needs to query only jobs started by the jenkins user Jenkins job will query databricks jobs api to see if ENV & VERSION already exist. If it does, then issue a start/stop on existing cluster. - Reset(config) - Stop job - Poll until Stopped - Start job If ENV and VERSION do not exist in db rest api, or do not exist then - Create jobs api => get ID back - Run job with different output path, different checkpoint path, same kafka stream, new cluster, new checkpoint? (dbfs unique checkpoint path)
Assignee: nobody → hwoo
Blocks: 1457938
Priority: -- → P1
1. assert creator_user_name == some jenkins user 2. add VERSION to the custom_tags 3. if ENV&&VERSION are the same invoke reset/stop/poll/start else invoke create/get id/run on new cluster with new parameters curl 'https://dbc-caf9527b-e073.cloud.databricks.com/api/2.0/jobs/list' -X GET -H "Authorization: Bearer xxxx" | jq . some refs https://github.com/mozilla/telemetry-streaming/compare/master...akkomar:db_deploy_temp https://github.com/mozilla/telemetry-streaming/compare/job_definitions https://docs.databricks.com/api/latest/jobs.html#create
Priority: P1 → P2
Priority: P2 → P3
Priority: P3 → P1
Status: NEW → RESOLVED
Closed: 7 years ago
Resolution: --- → FIXED
Product: Data Platform and Tools → Data Platform and Tools Graveyard
You need to log in before you can comment on or make changes to this bug.