In applications of MapReduce, Terasort is one of the most successful ones, which has helped Hadoop to win the Sort Benchmark three times. While Terasort is known for its sorting speed on big data, its performance and energy consumption still can be optimized. We have analyzed the characteristics of Terasort and have identified the existence of idle notes, which does not only waste energy but also loses performance. Therefore, we optimize Terasort through a single-task distributed algorithm and a task self-resizing algorithm to save time and reduce the energy that is consumed by map nodes, which is caused by waiting for tasks and reduce nodes waiting for input. The algorithm proposed in this paper has proved to be effective in optimizing per...
The efficient use of energy is essential to address concerns of cost and sustainability. Many data c...
Over the last ten years MapReduce has emerged as one of the staples of distributed computing both in...
The energy-performance optimization of datacenters becomes ever challenging, due to heterogeneous wo...
In applications of MapReduce, Terasort is one of the most successful ones, which has helped Hadoop t...
The majority of large-scale data intensive applications executed by data centers are based on MapRed...
The majority of large-scale data intensive applications carried out by information centers are based...
With the explosion of data production, the efficiency of data management and analysis has been conce...
Abstract—The majority of large-scale data intensive applications executed by data centers are based ...
Abstract—The majority of large-scale data intensive appli-cations executed by data centers are based...
Hadoop biological systems become progressively significant for professionals of huge scale informati...
International audienceThe energy consumption of computational platforms has recently become a critic...
In todays scenario a word Big Data used by researchers is associated with large amount of data which...
It is reportedi that the electricity cost to operate a cluster may well exceed its acquisition cost,...
Most common huge volume data processing programs do counting, sorting, merging etc. Such programs re...
MapReduce is a framework for processing huge amounts of data in a distributed environment and Hadoop...
The efficient use of energy is essential to address concerns of cost and sustainability. Many data c...
Over the last ten years MapReduce has emerged as one of the staples of distributed computing both in...
The energy-performance optimization of datacenters becomes ever challenging, due to heterogeneous wo...
In applications of MapReduce, Terasort is one of the most successful ones, which has helped Hadoop t...
The majority of large-scale data intensive applications executed by data centers are based on MapRed...
The majority of large-scale data intensive applications carried out by information centers are based...
With the explosion of data production, the efficiency of data management and analysis has been conce...
Abstract—The majority of large-scale data intensive applications executed by data centers are based ...
Abstract—The majority of large-scale data intensive appli-cations executed by data centers are based...
Hadoop biological systems become progressively significant for professionals of huge scale informati...
International audienceThe energy consumption of computational platforms has recently become a critic...
In todays scenario a word Big Data used by researchers is associated with large amount of data which...
It is reportedi that the electricity cost to operate a cluster may well exceed its acquisition cost,...
Most common huge volume data processing programs do counting, sorting, merging etc. Such programs re...
MapReduce is a framework for processing huge amounts of data in a distributed environment and Hadoop...
The efficient use of energy is essential to address concerns of cost and sustainability. Many data c...
Over the last ten years MapReduce has emerged as one of the staples of distributed computing both in...
The energy-performance optimization of datacenters becomes ever challenging, due to heterogeneous wo...