Closed Bug 1248336 Opened 9 years ago Closed 9 years ago

Pack more profiles per row group in longitudinal dataset.

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect)

defect
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: rvitillo, Assigned: rvitillo)

References

Details

(Whiteboard: [loasis])

Given the size of the history of a profile in the longitudinal dataset, row groups tend to have only a few hundred profiles, which is detrimental for performance. Furthermore, Presto has issues reading Parquet datasets with small row groups and becomes practically unusable. As packing more profiles per row group is going to require more memory we should come up with a smarter grouping mechanism.
Blocks: 1246426
Whiteboard: [loasis]
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Product: Cloud Services → Cloud Services Graveyard
You need to log in before you can comment on or make changes to this bug.