Partial aggregation is of great importance in many dis-tributed data-parallel systems. Most notably, it is commonly applied by MapReduce programs to optimize I/O by succes-sively aggregating partially reduced results into a final result, as opposed to aggregating all input records at once. In spite of its importance, programmers currently enable partial ag-gregation by tediously encoding their reduce functionality into separate reduce and combine functions. This is error prone and often leads to missed optimization opportunities. This paper proposes an algorithm that automatically ver-ifies if the original monolithic reduce function of a MapRe-duce program is eligible for partial aggregation, and if so, synthesizes enabling partial aggregat...
In this paper we present a parallel implementation of Lévy's optimal reduction for the λ-calculus [1...
Abstract: Web-Scale Analytical Processing is a much investigated topic in current research. Next to ...
Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...
We study the problems of decomposing and sharing user-defined aggregate functions in distributed and...
Applications of aggregation for information summary have great meanings in various fields. In big da...
Abstract. MapReduce, being inspired by the map and reduce primi-tives available in many functional l...
Partial evaluation is an automatic program transformation that optimizes programs by specialization....
The emergence of the Internet as a computing platform increases the demand for new classes of algori...
We define and explore the design space of efficient algorithms to compute ROLLUP aggregates, using t...
The integration of computers into many facets of our lives has made the collection and storage of st...
Consider a network of processor elements arranged in a d-dimensional grid, where each processor can ...
Computing multiple related group-bys and aggregates is one of the core operations of On-Line Analyti...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
Master's thesis in Computer ScienceK-means is the most commonly known partitioning algorithm used fo...
International audienceAs the models that need to be handled in model-driven engineering grow in scal...
In this paper we present a parallel implementation of Lévy's optimal reduction for the λ-calculus [1...
Abstract: Web-Scale Analytical Processing is a much investigated topic in current research. Next to ...
Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...
We study the problems of decomposing and sharing user-defined aggregate functions in distributed and...
Applications of aggregation for information summary have great meanings in various fields. In big da...
Abstract. MapReduce, being inspired by the map and reduce primi-tives available in many functional l...
Partial evaluation is an automatic program transformation that optimizes programs by specialization....
The emergence of the Internet as a computing platform increases the demand for new classes of algori...
We define and explore the design space of efficient algorithms to compute ROLLUP aggregates, using t...
The integration of computers into many facets of our lives has made the collection and storage of st...
Consider a network of processor elements arranged in a d-dimensional grid, where each processor can ...
Computing multiple related group-bys and aggregates is one of the core operations of On-Line Analyti...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
Master's thesis in Computer ScienceK-means is the most commonly known partitioning algorithm used fo...
International audienceAs the models that need to be handled in model-driven engineering grow in scal...
In this paper we present a parallel implementation of Lévy's optimal reduction for the λ-calculus [1...
Abstract: Web-Scale Analytical Processing is a much investigated topic in current research. Next to ...
Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...