Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties on both the job arrival process and their execution time. Users typically provide a loose upper bound estimate for job execution times that are hardly useful. Previous studies attempted to improve these estimates using regression techniques. Although these attempts provide reasonable predictions, they require a long period of training data. Furthermore, aiming for perfect prediction may be of limited use for scheduling purposes. In this work, we propose a simpler approach by classifying jobs as small or large and prioritizing the execution of small jobs over large ones. Indeed, small jobs are the most impacted by queuing delays but they typi...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Temporal dependence in workloads creates peak congestion that can make service unavailable and reduc...
A well-known problem when executing data-intensive workloads with such frameworks as MapReduce is th...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
International audienceJob scheduling in high-performance computing platforms is a hard problem that ...
International audienceThe job management system is the HPC middleware responsible for distributing c...
International audienceDespite the impressive growth and size of super-computers, the computational p...
The infrastructure of High Performance Computing (HPC) systems is rapidly increasing in complexity a...
This article focuses on the problem of dealing with low accuracy of job runtime estimates provided b...
International audienceDynamic scheduling of tasks in large-scale HPC platforms is normally accomplis...
International audienceWe propose a novel job scheduling approach for homogeneous cluster computing p...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
Backfilling is a simple and effective way of improving the utilization of space-sharing schedulers. ...
We study size-based schedulers, and focus on the impact of inaccurate job size information on respon...
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching stra...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Temporal dependence in workloads creates peak congestion that can make service unavailable and reduc...
A well-known problem when executing data-intensive workloads with such frameworks as MapReduce is th...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
International audienceJob scheduling in high-performance computing platforms is a hard problem that ...
International audienceThe job management system is the HPC middleware responsible for distributing c...
International audienceDespite the impressive growth and size of super-computers, the computational p...
The infrastructure of High Performance Computing (HPC) systems is rapidly increasing in complexity a...
This article focuses on the problem of dealing with low accuracy of job runtime estimates provided b...
International audienceDynamic scheduling of tasks in large-scale HPC platforms is normally accomplis...
International audienceWe propose a novel job scheduling approach for homogeneous cluster computing p...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
Backfilling is a simple and effective way of improving the utilization of space-sharing schedulers. ...
We study size-based schedulers, and focus on the impact of inaccurate job size information on respon...
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching stra...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Temporal dependence in workloads creates peak congestion that can make service unavailable and reduc...
A well-known problem when executing data-intensive workloads with such frameworks as MapReduce is th...