Closed Bug 1246408 Opened 9 years ago Closed 9 years ago

Update EMR release to 4.3.0

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: rvitillo, Assigned: whd)

References

Details

Roberto Agostino Vitillo (:rvitillo)

Reporter

Description

•

9 years ago

No description provided.

Roberto Agostino Vitillo (:rvitillo)

Reporter

Comment 1

•

9 years ago

Once 1248336 lands, Parquet datasets will be accessible both from Spark and Presto. Packing more profiles per row group seems to be triggering a Spark bug that causes the "take(N)" operation to require a full scan of the dataset. The bug can be avoided by converting the dataset to a RDD, but that impacts performance. Spark 1.6 doesn't suffer from this issue and we should upgrade asap.

Roberto Agostino Vitillo (:rvitillo)

Reporter

Updated

•

9 years ago

Flags: needinfo?(whd)

Roberto Agostino Vitillo (:rvitillo)

Reporter

Updated

•

9 years ago

Blocks: 1242039

Priority: -- → P1

Roberto Agostino Vitillo (:rvitillo)

Reporter

Comment 2

•

9 years ago

Note that Hive has to be deployed as well for Spark 1.6 to be able to read Parquet datasets.

Roberto Agostino Vitillo (:rvitillo)

Reporter

Updated

•

9 years ago

Blocks: 1251580

Rob Miller [:rmiller]

Updated

•

9 years ago

Points: --- → 1

Wesley Dawson [:whd]

Assignee

Comment 3

•

9 years ago

https://github.com/mozilla/emr-bootstrap-spark/pull/16 https://github.com/mozilla/telemetry-server/pull/146

Status: NEW → RESOLVED

Closed: 9 years ago

Flags: needinfo?(whd)

Resolution: --- → FIXED

BMO Automation

Updated

•

6 years ago

Product: Cloud Services → Cloud Services Graveyard

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Update EMR release to 4.3.0

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P1)

Tracking

(Not tracked)

People

(Reporter: rvitillo, Assigned: whd)

References

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Updated

Updated

Comment 2

Updated

Updated

Comment 3

Updated