This article focuses on the problem of dealing with low accuracy of job runtime estimates provided by users of high performance computing systems. The main goal of the study is to evaluate the benefits on the system utilization of providing accurate estimations, in order to motivate users to make an effort to provide better estimates. We propose the Penalty Scheduling Policy for including information about user estimates. The experimental evaluation is performed over realistic workload and scenarios, and validated by the use of a job scheduler simulator. We simulated different static and dynamic scenarios, which emulate diverse user behavior regarding the estimation of jobs runtime. Results demonstrate that the accuracy of users runtime es...
This work analyses the benefits of a system in an HPC environment, which being able to get real perf...
Traditional scheduling techniques are of a by-gone era and do not cater for the dynamism of new and ...
Evaluating the performance of a computer system is based on using representative workloads. Common p...
To effectively manage High-Performance Computing (HPC) resources, it is essential to maximize return...
The infrastructure of High Performance Computing (HPC) systems is rapidly increasing in complexity a...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
One of the most important metrics of machine efficiency in HPC is job turnaround time, which is the ...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
International audienceJob scheduling in high-performance computing platforms is a hard problem that ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
International audienceThe job management system is the HPC middleware responsible for distributing c...
The most commonly used scheduling algorithm for parallel super-computers is FCFS with backlling, as ...
System administrators for parallel computers face many difficulties when managing job scheduling sys...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching stra...
This work analyses the benefits of a system in an HPC environment, which being able to get real perf...
Traditional scheduling techniques are of a by-gone era and do not cater for the dynamism of new and ...
Evaluating the performance of a computer system is based on using representative workloads. Common p...
To effectively manage High-Performance Computing (HPC) resources, it is essential to maximize return...
The infrastructure of High Performance Computing (HPC) systems is rapidly increasing in complexity a...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
One of the most important metrics of machine efficiency in HPC is job turnaround time, which is the ...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
International audienceJob scheduling in high-performance computing platforms is a hard problem that ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
International audienceThe job management system is the HPC middleware responsible for distributing c...
The most commonly used scheduling algorithm for parallel super-computers is FCFS with backlling, as ...
System administrators for parallel computers face many difficulties when managing job scheduling sys...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching stra...
This work analyses the benefits of a system in an HPC environment, which being able to get real perf...
Traditional scheduling techniques are of a by-gone era and do not cater for the dynamism of new and ...
Evaluating the performance of a computer system is based on using representative workloads. Common p...