Abstract. Prediction of queue waiting times of jobs submitted to pro-duction parallel batch systems is important to provide overall estimates to users and can also help meta-schedulers make scheduling decisions. In this work, we have developed a framework for predicting ranges of queue waiting times for jobs by employing multi-class classification of similar jobs in history. Our hierarchical prediction strategy first predicts the point wait time of a job using dynamic k-Nearest Neighbor (kNN) method. It then performs a multi-class classification using Support Vec-tor Machines (SVMs) among all the classes of the jobs. The probabilities given by the SVM for the class predicted using k-NN and its neighbor-ing classes are used to provide a set ...
Metacomputing is a convenient and powerful abstraction for dealing with the complexities that arise ...
High performance grid computing is a key enabler of large scale collaborative computational science....
Abstract: This paper proposes a new scheduler to schedule parallel jobs on Clusters that may be part...
Prediction of queue waiting times of jobs submitted to production parallel batch systems is importan...
Production parallel systems are space-shared and employ batch queues in which the jobs submitted to ...
Abstract. Production parallel systems are space-shared and hence em-ploy batch queues in which the j...
Production parallel systems are space-shared, and resource allocation on such systems is usually per...
Abstract. When a moldable job is submitted to a space-sharing parallel computer, it must choose whet...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Batch processing machines that can process a group of jobs simultaneously are often encountered in s...
Most space-sharing resources presently operated by high performance computing centers employ some so...
The paper is devoted to machine learning methods and algorithms for the supercomputer jobs executio...
This paper proposes a two-level scheduler for dynamically scheduling a continuous stream of sequenti...
In many real world systems, servers are assigned to work in parallel to increase the system throughp...
Metacomputing is a convenient and powerful abstraction for dealing with the complexities that arise ...
High performance grid computing is a key enabler of large scale collaborative computational science....
Abstract: This paper proposes a new scheduler to schedule parallel jobs on Clusters that may be part...
Prediction of queue waiting times of jobs submitted to production parallel batch systems is importan...
Production parallel systems are space-shared and employ batch queues in which the jobs submitted to ...
Abstract. Production parallel systems are space-shared and hence em-ploy batch queues in which the j...
Production parallel systems are space-shared, and resource allocation on such systems is usually per...
Abstract. When a moldable job is submitted to a space-sharing parallel computer, it must choose whet...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Batch processing machines that can process a group of jobs simultaneously are often encountered in s...
Most space-sharing resources presently operated by high performance computing centers employ some so...
The paper is devoted to machine learning methods and algorithms for the supercomputer jobs executio...
This paper proposes a two-level scheduler for dynamically scheduling a continuous stream of sequenti...
In many real world systems, servers are assigned to work in parallel to increase the system throughp...
Metacomputing is a convenient and powerful abstraction for dealing with the complexities that arise ...
High performance grid computing is a key enabler of large scale collaborative computational science....
Abstract: This paper proposes a new scheduler to schedule parallel jobs on Clusters that may be part...