International audienceThe job management system is the HPC middleware responsible for distributing computing power to applications. While such systems generate an ever increasing amount of data, they are characterized by uncertainties on some parameters like the job running times. The question raised in this work is: To what extent is it possible/useful to take into account predictions on the job running times for improving the global scheduling? We present a comprehensive study for answering this question assuming the popular EASY backfilling policy. More precisely, we rely on some classical methods in machine learning and propose new cost functions well-adapted to the problem. Then, we assess our proposed solutions through intensive simul...
One of the most important metrics of machine efficiency in HPC is job turnaround time, which is the ...
The issue of under-estimated length of jobs (parallel applications) on backfill-based scheduling is ...
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching stra...
International audienceThe job management system is the HPC middleware responsible for distributing c...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
The most commonly used scheduling algorithm for parallel super-computers is FCFS with backlling, as ...
International audienceEASY-Backfilling is a popular scheduling heuristic for allocating jobs in larg...
Backfilling is a simple and effective way of improving the utilization of space-sharing schedulers. ...
International audienceDynamic scheduling of tasks in large-scale HPC platforms is normally accomplis...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Abstract. Job scheduling policies for HPC centers have been extensively stud-ied in the last few yea...
International audienceJob scheduling in high-performance computing platforms is a hard problem that ...
This article focuses on the problem of dealing with low accuracy of job runtime estimates provided b...
The infrastructure of High Performance Computing (HPC) systems is rapidly increasing in complexity a...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
One of the most important metrics of machine efficiency in HPC is job turnaround time, which is the ...
The issue of under-estimated length of jobs (parallel applications) on backfill-based scheduling is ...
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching stra...
International audienceThe job management system is the HPC middleware responsible for distributing c...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
The most commonly used scheduling algorithm for parallel super-computers is FCFS with backlling, as ...
International audienceEASY-Backfilling is a popular scheduling heuristic for allocating jobs in larg...
Backfilling is a simple and effective way of improving the utilization of space-sharing schedulers. ...
International audienceDynamic scheduling of tasks in large-scale HPC platforms is normally accomplis...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Abstract. Job scheduling policies for HPC centers have been extensively stud-ied in the last few yea...
International audienceJob scheduling in high-performance computing platforms is a hard problem that ...
This article focuses on the problem of dealing with low accuracy of job runtime estimates provided b...
The infrastructure of High Performance Computing (HPC) systems is rapidly increasing in complexity a...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
One of the most important metrics of machine efficiency in HPC is job turnaround time, which is the ...
The issue of under-estimated length of jobs (parallel applications) on backfill-based scheduling is ...
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching stra...