Since its introduction in 2004, the MapReduce framework has be-come one of the standard approaches in massive distributed and paral-lel computation. In contrast to its intensive use in practise, theoretical footing is still limited and only little work has been done yet to put MapReduce on a par with the major computational models. Follow-ing pioneer work that relates the MapReduce framework with PRAM and BSP in their macroscopic structure, we focus on the functionality provided by the framework itself, considered in the parallel external memory model (PEM). In this, we present upper and lower bounds on the parallel I/O-complexity that are matching up to constant factors for the shuffle step. The shuffle step is the single communication pha...
Algorithms for mitigating imbalance of the MapReduce computa-tions are considered in this paper. Map...
In an attempt to increase the performance/cost ratio, large compute clusters are becoming heterogene...
Abstract—MapReduce is arguably the most successful par-allelization framework especially for process...
International audienceData abundance poses the need for powerful and easy-to-use tools that support ...
Data abundance poses the need for powerful and easy-to-use tools that support processing large amoun...
The programming paradigm Map-Reduce [3] and its main open-source implementation, Hadoop [1], have ha...
Abstract. The MapReduce framework has been generating a lot of interest in a wide range of areas. It...
AbstractThe MapReduce framework has been generating a lot of interest in a wide range of areas. It h...
In this paper, we describe efficient MapReduce simulations of parallel algorithms specified in the B...
In this paper, we study the MapReduce framework from an algorithmic standpoint and demonstrate the u...
ESA 2013: 21st Annual European Symposium Sophia Antipolis, France, 2-4 September 2013In this paper, ...
This work explores fundamental modeling and algorithmic issues arising in the well-established MapRe...
The MapReduce framework has firmly established itself as one of the most widely used parallel comput...
In this paper we study the tradeoff between parallelism and communication cost in a map-reduce compu...
Abstract. In this paper we study the MapReduce Class (MRC) defined by Karloff et al., which is a for...
Algorithms for mitigating imbalance of the MapReduce computa-tions are considered in this paper. Map...
In an attempt to increase the performance/cost ratio, large compute clusters are becoming heterogene...
Abstract—MapReduce is arguably the most successful par-allelization framework especially for process...
International audienceData abundance poses the need for powerful and easy-to-use tools that support ...
Data abundance poses the need for powerful and easy-to-use tools that support processing large amoun...
The programming paradigm Map-Reduce [3] and its main open-source implementation, Hadoop [1], have ha...
Abstract. The MapReduce framework has been generating a lot of interest in a wide range of areas. It...
AbstractThe MapReduce framework has been generating a lot of interest in a wide range of areas. It h...
In this paper, we describe efficient MapReduce simulations of parallel algorithms specified in the B...
In this paper, we study the MapReduce framework from an algorithmic standpoint and demonstrate the u...
ESA 2013: 21st Annual European Symposium Sophia Antipolis, France, 2-4 September 2013In this paper, ...
This work explores fundamental modeling and algorithmic issues arising in the well-established MapRe...
The MapReduce framework has firmly established itself as one of the most widely used parallel comput...
In this paper we study the tradeoff between parallelism and communication cost in a map-reduce compu...
Abstract. In this paper we study the MapReduce Class (MRC) defined by Karloff et al., which is a for...
Algorithms for mitigating imbalance of the MapReduce computa-tions are considered in this paper. Map...
In an attempt to increase the performance/cost ratio, large compute clusters are becoming heterogene...
Abstract—MapReduce is arguably the most successful par-allelization framework especially for process...