The paper is devoted to machine learning methods and algorithms for the supercomputer jobs execution prediction. The supercomputers statistics shows that the actual runtime of the most of the jobs substantially diverges from the time requested by the user. This reduces the efficiency of scheduling jobs, since an inaccurate job execution time estimation leads to a suboptimal jobs schedule. The job classification is considered, it is based on the difference between the job actual and the requested execution time. Forecast was made on the base of supercomputer multiuser job management system statistics by assigning a submitted job to one of the classes. The statistics of supercomputers MVS-100K and MVS-10P in the Joint Supercomputer Center ...
Every day, supercomputers execute 1000s of jobs with different characteristics. Data centers monitor...
AbstractIn heterogeneous and distributed environments it is necessary to create schedules for utilis...
Improving the reliability and performance are of utmost importance for any system. This thesis prese...
Large high-performance computing systems are built with increasing number of components with more CP...
This paper evaluates several main learning and heuris-tic techniques for application run time predic...
This work concerns with the application of Data mining models to the prediction of the running time ...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
Traditionally, mathematical optimization methods have been applied in manufacturing industries where...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Prediction of queue waiting times of jobs submitted to production parallel batch systems is importan...
The aim of this paper is to provide a description of machine learning based scheduling approach for ...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
To make effective job placement policies for a volatile large scale heterogeneous system or in grid ...
Abstract-In any organization’s talent management is becoming an increasingly crucial method of appro...
We present a technique for deriving predictions for the run times of parallel applications from the ...
Every day, supercomputers execute 1000s of jobs with different characteristics. Data centers monitor...
AbstractIn heterogeneous and distributed environments it is necessary to create schedules for utilis...
Improving the reliability and performance are of utmost importance for any system. This thesis prese...
Large high-performance computing systems are built with increasing number of components with more CP...
This paper evaluates several main learning and heuris-tic techniques for application run time predic...
This work concerns with the application of Data mining models to the prediction of the running time ...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
Traditionally, mathematical optimization methods have been applied in manufacturing industries where...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Prediction of queue waiting times of jobs submitted to production parallel batch systems is importan...
The aim of this paper is to provide a description of machine learning based scheduling approach for ...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
To make effective job placement policies for a volatile large scale heterogeneous system or in grid ...
Abstract-In any organization’s talent management is becoming an increasingly crucial method of appro...
We present a technique for deriving predictions for the run times of parallel applications from the ...
Every day, supercomputers execute 1000s of jobs with different characteristics. Data centers monitor...
AbstractIn heterogeneous and distributed environments it is necessary to create schedules for utilis...
Improving the reliability and performance are of utmost importance for any system. This thesis prese...