On the design space of mapreduce ROLLUP aggregates

Phan D. H.
Dell'Amico M.
Michiardi P.

Open PDF

Open link

Publication date

January 2014

Publisher

CEUR-WS

Language

English

Abstract

We define and explore the design space of efficient algorithms to compute ROLLUP aggregates, using the MapReduce programming paradigm. Using a modeling approach, we explain the non-trivial trade-o. that exists between parallelism and communication costs that is inherent to a MapReduce implementation of ROLLUP. Furthermore, we design a new family of algorithms that, through a single parameter, allow to find a sweet spot in the parallelism vs. communication cost trade-o. We complement our work with an experimental approach, wherein we overcome some limitations of the model we use. Our results indicate that efficient ROLLUP aggregates require striking the good balance between parallelism and communication for both one-round and chained algorit...

Extracted data

We use cookies to provide a better user experience.

Data Protection

On the design space of mapreduce ROLLUP aggregates

Abstract

Extracted data

On the design space of mapreduce ROLLUP aggregates

Abstract

Extracted data

Related items

Related items