Size-based scheduling with aging has, for long, been recognized as an effective approach to guarantee fairness and near-optimal system response times. We present HFSP, a scheduler introducing this technique to a real, multi-server, complex and widely used system such as Hadoop. Size-based scheduling requires a priori job size information, which is not available in Hadoop: HFSP builds such knowledge by estimating it on-line during job execution. Our experiments, which are based on realistic workloads generated via a standard benchmarking suite, pinpoint at a significant decrease in system response times with respect to the widely used Hadoop Fair scheduler, and show that HFSP is largely tolerant to job size estimation errors. © 2013 IEEE
At present, big data is very popular, because it has proved to be much successful in many fields suc...
Today scenario, we live in the data age and a key metric of existing times is the amount of data tha...
The exponential growth of collected data poses the challenge of efficient data processing among othe...
Abstract—Size-based scheduling with aging has, for long, been recognized as an effective approach to...
Size-based scheduling with aging has been recognized as an effective approach to guarantee fairness ...
Size-based scheduling with aging has been recognized as an effective approach to guarantee fairness ...
Size-based schedulers have very desirable performance properties: optimal or near-optimal response t...
The past decade have seen the rise of data-intensive scalable computing (DISC) systems, such as Hado...
Abstract—Size-based schedulers have very desirable performance properties: optimal or near-optimal r...
The majority of large-scale data severe applications executed by data centers are based on MapReduce...
La dernière décennie a vu l’émergence de systèmes parallèles pour l’analyse de grosse quantités de d...
Cloud computing is a power platform to deal with big data. Among several software frameworks used fo...
We study size-based schedulers, and focus on the impact of inaccurate job size information on respon...
Hadoop is a free, Java-based programming system that backings the preparing of vast informational co...
International audienceHadoop has been recently used to process a diverse variety of applications, sh...
At present, big data is very popular, because it has proved to be much successful in many fields suc...
Today scenario, we live in the data age and a key metric of existing times is the amount of data tha...
The exponential growth of collected data poses the challenge of efficient data processing among othe...
Abstract—Size-based scheduling with aging has, for long, been recognized as an effective approach to...
Size-based scheduling with aging has been recognized as an effective approach to guarantee fairness ...
Size-based scheduling with aging has been recognized as an effective approach to guarantee fairness ...
Size-based schedulers have very desirable performance properties: optimal or near-optimal response t...
The past decade have seen the rise of data-intensive scalable computing (DISC) systems, such as Hado...
Abstract—Size-based schedulers have very desirable performance properties: optimal or near-optimal r...
The majority of large-scale data severe applications executed by data centers are based on MapReduce...
La dernière décennie a vu l’émergence de systèmes parallèles pour l’analyse de grosse quantités de d...
Cloud computing is a power platform to deal with big data. Among several software frameworks used fo...
We study size-based schedulers, and focus on the impact of inaccurate job size information on respon...
Hadoop is a free, Java-based programming system that backings the preparing of vast informational co...
International audienceHadoop has been recently used to process a diverse variety of applications, sh...
At present, big data is very popular, because it has proved to be much successful in many fields suc...
Today scenario, we live in the data age and a key metric of existing times is the amount of data tha...
The exponential growth of collected data poses the challenge of efficient data processing among othe...