As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching strategies become critical for keeping system utilization high while keeping waiting times low for jobs competing for HPC system resources. In this paper, we take a data-driven approach and investigate whether better dispatching decisions can be made by transforming the log data produced by an HPC system into useful knowledge about its workload. In particular, we focus on job duration, develop a data-driven approach to job duration prediction, and analyze the effect of different prediction approaches in making dispatching decisions using a real workload dataset collected from Eurora, a hybrid HPC system. Experiments on various dispatching methods...
One of the most important metrics of machine efficiency in HPC is job turnaround time, which is the ...
High-performance Computing (HPC) systems have become essential instruments in our modern society. As...
This article focuses on the problem of dealing with low accuracy of job runtime estimates provided b...
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching stra...
The demand for more powerful supercomputers continues to increase along with the types of applicatio...
In their march towards exascale performance, HPC systems are becoming increasingly more heterogeneou...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Power consumption of current High Performance Computing systems has to be reduced by at least one or...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
This works deals with the power-aware job dispatching problem in supercomputers; broadly speaking th...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
This work analyses the benefits of a system in an HPC environment, which being able to get real perf...
HPC systems are increasingly being used for big data analytics and predictive model building that em...
International audienceThe job management system is the HPC middleware responsible for distributing c...
One of the most important metrics of machine efficiency in HPC is job turnaround time, which is the ...
High-performance Computing (HPC) systems have become essential instruments in our modern society. As...
This article focuses on the problem of dealing with low accuracy of job runtime estimates provided b...
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching stra...
The demand for more powerful supercomputers continues to increase along with the types of applicatio...
In their march towards exascale performance, HPC systems are becoming increasingly more heterogeneou...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Power consumption of current High Performance Computing systems has to be reduced by at least one or...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
This works deals with the power-aware job dispatching problem in supercomputers; broadly speaking th...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
This work analyses the benefits of a system in an HPC environment, which being able to get real perf...
HPC systems are increasingly being used for big data analytics and predictive model building that em...
International audienceThe job management system is the HPC middleware responsible for distributing c...
One of the most important metrics of machine efficiency in HPC is job turnaround time, which is the ...
High-performance Computing (HPC) systems have become essential instruments in our modern society. As...
This article focuses on the problem of dealing with low accuracy of job runtime estimates provided b...