Closed Bug 1349565 Opened 8 years ago Closed 8 years ago

Investigate and reduce EMR cluster bootstrap time

Categories

(Data Platform and Tools :: General, enhancement, P1)

enhancement
Points:
3

Tracking

(Not tracked)

RESOLVED INVALID

People

(Reporter: jason, Assigned: jason)

Details

(Whiteboard: [SvcOps])

Currently EMR cluster bootstrap for spark takes upwards of 30 minutes (or more) to complete. Investigate why it takes so long and make improvements to reduce time. https://github.com/mozilla/emr-bootstrap-spark
Points: --- → 3
Assignee: nobody → jthomas
Priority: -- → P1
Component: Metrics: Pipeline → Spark
Product: Cloud Services → Data Platform and Tools
We are investigating using databricks here instead.
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → INVALID
Component: Spark → General
You need to log in before you can comment on or make changes to this bug.