International audienceMany cloud computations process large datasets. Programming paradigms have been proposed to design this type of applications, so as to take advantage of the huge processing and storage options the cloud holds, but at the same time, to provide the user with a clean and easy to use interface. Among these programming models, we consider the MapReduce paradigm and its reference implementation, the Hadoop framework. We focus on the aspect of intermediate data, that is data produced and transferred between the two stages of the computation (map and reduce). The goal of this paper is to propose a storage mechanism for intermediate data with the purpose of optimizing the execution of MapReduce applications in the presence of f...
Abstract — Cloud Computing is emerging as a new computational paradigm shift.Hadoop MapReduce has be...
MapReduce has been emerging as a popular programming paradigm for data intensive computing in cluste...
International audienceHadoop is a reference software framework supporting the Map/Reduce programming...
International audienceMany cloud computations process large datasets. Programming paradigms have bee...
Data-intensive applications are nowadays, widely used in various domains to extract and process info...
A preliminary version of this paper has been published as INRIA Research Report RR-7140.Internationa...
A slightly revised version of this work is published in the Proceedings of the 24th IEEE Internation...
Abstract—Over the last 2-3 years, the importance of data-intensive computing has increasingly been r...
National audienceIn this report we address the problem of data management in clouds for the MapReduc...
International audienceAs data volumes increase at a high speed in more and more application fields o...
Large quantities of data have been generated from multiple sources at exponential rates in the last ...
Abstract—MapReduce is a programming model which allows the processing of vast amounts of data in par...
The Hadoop framework has been developed to effectively process data-intensive MapReduce applications...
International audienceAs Map-Reduce emerges as a leading programming paradigm for data-intensive com...
The typical cloud big data systems are the workflow-based including MapReduce which has emerged as t...
Abstract — Cloud Computing is emerging as a new computational paradigm shift.Hadoop MapReduce has be...
MapReduce has been emerging as a popular programming paradigm for data intensive computing in cluste...
International audienceHadoop is a reference software framework supporting the Map/Reduce programming...
International audienceMany cloud computations process large datasets. Programming paradigms have bee...
Data-intensive applications are nowadays, widely used in various domains to extract and process info...
A preliminary version of this paper has been published as INRIA Research Report RR-7140.Internationa...
A slightly revised version of this work is published in the Proceedings of the 24th IEEE Internation...
Abstract—Over the last 2-3 years, the importance of data-intensive computing has increasingly been r...
National audienceIn this report we address the problem of data management in clouds for the MapReduc...
International audienceAs data volumes increase at a high speed in more and more application fields o...
Large quantities of data have been generated from multiple sources at exponential rates in the last ...
Abstract—MapReduce is a programming model which allows the processing of vast amounts of data in par...
The Hadoop framework has been developed to effectively process data-intensive MapReduce applications...
International audienceAs Map-Reduce emerges as a leading programming paradigm for data-intensive com...
The typical cloud big data systems are the workflow-based including MapReduce which has emerged as t...
Abstract — Cloud Computing is emerging as a new computational paradigm shift.Hadoop MapReduce has be...
MapReduce has been emerging as a popular programming paradigm for data intensive computing in cluste...
International audienceHadoop is a reference software framework supporting the Map/Reduce programming...