A well-known problem when executing data-intensive workloads with such frameworks as MapReduce is that small jobs with processing requirements counted in the minutes may suffer from the presence of huge jobs requiring hours or days of compute time, leading to a job slowdown distribution that is very variable and that is uneven across jobs of different sizes. Previous solutions to this problem for sequential or rigid jobs in single-server and distributed-server systems include priority-based FeedBack Queueing (FBQ), and Task Assignment by Guessing Sizes (TAGS), which kills and restarts from scratch on another server jobs that exceed the local time limit. In this paper, we derive four scheduling policies that are rightful descendants of exist...
MapReduce ecosystems are (still) widely popular for big data processing in data centers. To address ...
We consider the problem of task assignment in a distributed system (such as a distributed Web server...
The study of size-based and size-oblivious scheduling policies with inaccurate job size information ...
Many large-scale data analytics infrastructures are employed for a wide variety of jobs, ranging fro...
We consider a distributed server system and ask which policy should be used for assigning jobs (task...
In recent years there has been an extraordinary growth of large-scale data processing and related te...
We consider a distributed server system and ask which policy should be used for assigning jobs (task...
MapReduce is a popular parallel computing paradigm for large-scale data processing in clusters and d...
In job scheduling, the concept of malleability has been explored since many years ago. Research show...
This paper investigates the performance of task assignment policies for server farms as the variabil...
Abstract — This paper investigates the performance of task assignment policies for server farms as t...
MapReduce can speed up the execution of jobs operating over big data. A MapReduce job can be divided...
Abstract — The specific choice of workload task schedulers for Hadoop MapReduce applications can hav...
We consider a distributed server system and ask which policy should be used for assigning tasks to h...
We are entering a Big Data world. Many sectors of our economy are now guided by data-driven decision...
MapReduce ecosystems are (still) widely popular for big data processing in data centers. To address ...
We consider the problem of task assignment in a distributed system (such as a distributed Web server...
The study of size-based and size-oblivious scheduling policies with inaccurate job size information ...
Many large-scale data analytics infrastructures are employed for a wide variety of jobs, ranging fro...
We consider a distributed server system and ask which policy should be used for assigning jobs (task...
In recent years there has been an extraordinary growth of large-scale data processing and related te...
We consider a distributed server system and ask which policy should be used for assigning jobs (task...
MapReduce is a popular parallel computing paradigm for large-scale data processing in clusters and d...
In job scheduling, the concept of malleability has been explored since many years ago. Research show...
This paper investigates the performance of task assignment policies for server farms as the variabil...
Abstract — This paper investigates the performance of task assignment policies for server farms as t...
MapReduce can speed up the execution of jobs operating over big data. A MapReduce job can be divided...
Abstract — The specific choice of workload task schedulers for Hadoop MapReduce applications can hav...
We consider a distributed server system and ask which policy should be used for assigning tasks to h...
We are entering a Big Data world. Many sectors of our economy are now guided by data-driven decision...
MapReduce ecosystems are (still) widely popular for big data processing in data centers. To address ...
We consider the problem of task assignment in a distributed system (such as a distributed Web server...
The study of size-based and size-oblivious scheduling policies with inaccurate job size information ...