Abstract—The majority of large-scale data intensive appli-cations executed by data centers are based on MapReduce or its open-source implementation, Hadoop. Such applications are executed on large clusters requiring large amounts of energy, making the energy costs a large fraction of the data center’s overall costs. Therefore minimizing the energy consumption when executing MapReduce jobs is a critical concern for data centers. In this paper, we propose a framework for improving the energy efficiency of MapReduce applications, while satis-fying the service level agreement (SLA). We first model the problem of energy-aware scheduling of MapReduce jobs as an Integer Program. We then propose a greedy algorithm, called Energy-aware MapReduce Sch...
International audienceWith the explosion of data production, the efficiency of data management and a...
Data are presently being produced at an increased speed in different formats, which complicates the ...
peer reviewedThe scheduling of parallel tasks is a topic that has received a lot of attention in rec...
Abstract—The majority of large-scale data intensive applications executed by data centers are based ...
The majority of large-scale data intensive applications executed by data centers are based on MapRed...
The majority of large-scale data intensive applications carried out by information centers are based...
MapReduce is a framework proposed by Google for processing huge amounts of data in a distributed env...
Clouds which continue to garner interest from practitioners in industry and academia require effecti...
Abstract—MapReduce has become a popular framework for Big Data applications. While MapReduce has rec...
The energy-performance optimization of datacenters becomes ever challenging, due to heterogeneous wo...
The efficient use of energy is essential to address concerns of cost and sustainability. Many data c...
22 pagesInternational audienceMapReduce is emerged as a prominent programming model for data-intensi...
Interests have been growing in energy management of the cluster effectively in order to reduce the e...
MapReduce framework has been one of the most prominent ways for efficient processing large amount of...
MapReduce is a framework proposed by Google for processing huge amounts of data in a distributed env...
International audienceWith the explosion of data production, the efficiency of data management and a...
Data are presently being produced at an increased speed in different formats, which complicates the ...
peer reviewedThe scheduling of parallel tasks is a topic that has received a lot of attention in rec...
Abstract—The majority of large-scale data intensive applications executed by data centers are based ...
The majority of large-scale data intensive applications executed by data centers are based on MapRed...
The majority of large-scale data intensive applications carried out by information centers are based...
MapReduce is a framework proposed by Google for processing huge amounts of data in a distributed env...
Clouds which continue to garner interest from practitioners in industry and academia require effecti...
Abstract—MapReduce has become a popular framework for Big Data applications. While MapReduce has rec...
The energy-performance optimization of datacenters becomes ever challenging, due to heterogeneous wo...
The efficient use of energy is essential to address concerns of cost and sustainability. Many data c...
22 pagesInternational audienceMapReduce is emerged as a prominent programming model for data-intensi...
Interests have been growing in energy management of the cluster effectively in order to reduce the e...
MapReduce framework has been one of the most prominent ways for efficient processing large amount of...
MapReduce is a framework proposed by Google for processing huge amounts of data in a distributed env...
International audienceWith the explosion of data production, the efficiency of data management and a...
Data are presently being produced at an increased speed in different formats, which complicates the ...
peer reviewedThe scheduling of parallel tasks is a topic that has received a lot of attention in rec...