International audienceEASY-Backfilling is a popular scheduling heuristic for allocating jobs in large scale High Performance Computing platforms. While its aggressive reservation mechanism is fast and prevents job starvation, it does not try to optimize any scheduling objective per se. We consider in this work the problem of tuning EASY using queue reordering policies. More precisely, we propose to tune the reordering using a simulation-based methodology. For a given system, we choose the policy in order to minimize the average waiting time. This methodology departs from the First-Come, First-Serve rule and introduces a risk on the maximum values of the waiting time, which we control using a queue thresholding mechanism. This new approach i...
Despite the impressive growth and size of super-computers, the computational power they provide stil...
When order release is applied, jobs are withheld in a backlog from where they are released to meet c...
Parallel jobs have different runtimes and numbers of threads/processes. Thus, scheduling parallel jo...
International audienceEASY-Backfilling is a popular scheduling heuristic for allocating jobs in larg...
International audienceThe EASY-FCFS heuristic is the basic building block of job scheduling policies...
International audienceDynamic scheduling of tasks in large-scale HPC platforms is normally accomplis...
International audienceThe job management system is the HPC middleware responsible for distributing c...
Abstract. Job scheduling policies for HPC centers have been extensively stud-ied in the last few yea...
When order release is applied, jobs are withheld in a backlog from where they are released to meet c...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
International audienceThis paper focuses on the resilient scheduling of parallel jobs on high-perfor...
resources are lost due to fragmentation. The problem is that the goal of high utilization may confli...
Scheduling jobs in High-Performance Computing (HPC) platforms typically involves heuristics consisti...
International audienceThis paper focuses on the resilient scheduling of parallel jobs on high-perfor...
High-Performance Computing (HPC) platforms are growing in size and complexity. In an adversarial man...
Despite the impressive growth and size of super-computers, the computational power they provide stil...
When order release is applied, jobs are withheld in a backlog from where they are released to meet c...
Parallel jobs have different runtimes and numbers of threads/processes. Thus, scheduling parallel jo...
International audienceEASY-Backfilling is a popular scheduling heuristic for allocating jobs in larg...
International audienceThe EASY-FCFS heuristic is the basic building block of job scheduling policies...
International audienceDynamic scheduling of tasks in large-scale HPC platforms is normally accomplis...
International audienceThe job management system is the HPC middleware responsible for distributing c...
Abstract. Job scheduling policies for HPC centers have been extensively stud-ied in the last few yea...
When order release is applied, jobs are withheld in a backlog from where they are released to meet c...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
International audienceThis paper focuses on the resilient scheduling of parallel jobs on high-perfor...
resources are lost due to fragmentation. The problem is that the goal of high utilization may confli...
Scheduling jobs in High-Performance Computing (HPC) platforms typically involves heuristics consisti...
International audienceThis paper focuses on the resilient scheduling of parallel jobs on high-perfor...
High-Performance Computing (HPC) platforms are growing in size and complexity. In an adversarial man...
Despite the impressive growth and size of super-computers, the computational power they provide stil...
When order release is applied, jobs are withheld in a backlog from where they are released to meet c...
Parallel jobs have different runtimes and numbers of threads/processes. Thus, scheduling parallel jo...