Prediction of queue waiting times of jobs submitted to production parallel batch systems is important to provide overall estimates to users and can also help meta-schedulers make scheduling decisions. In this work, we have developed a framework for predicting ranges of queue waiting times for jobs by employing multi-class classification of similar jobs in history. Our hierarchical prediction strategy first predicts the point wait time of a job using dynamic k-Nearest Neighbor (kNN) method. It then performs a multi-class classification using Support Vector Machines (SVMs) among all the classes of the jobs. The probabilities given by the SVM for the class predicted using k-NN and its neighboring classes are used to provide a set of ranges of ...
International audienceHigh Throughput Computing datacenters are a cornerstone of scientic discoverie...
We present a technique for deriving predictions for the run times of parallel applications from the ...
Metacomputing is a convenient and powerful abstraction for dealing with the complexities that arise ...
Prediction of queue waiting times of jobs submitted to production parallel batch systems is importan...
Production parallel systems are space-shared and employ batch queues in which the jobs submitted to ...
Abstract. Production parallel systems are space-shared and hence em-ploy batch queues in which the j...
Production parallel systems are space-shared, and resource allocation on such systems is usually per...
Abstract. When a moldable job is submitted to a space-sharing parallel computer, it must choose whet...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
The paper is devoted to machine learning methods and algorithms for the supercomputer jobs executio...
These resources accompany the paper entitled "A Machine Learning Approach to Waiting Time Prediction...
Most space-sharing resources presently operated by high performance computing centers employ some so...
Serving customers for any organization is a factor in making profit and extending business. Waiting ...
Batch processing machines that can process a group of jobs simultaneously are often encountered in s...
International audienceHigh Throughput Computing datacenters are a cornerstone of scientic discoverie...
We present a technique for deriving predictions for the run times of parallel applications from the ...
Metacomputing is a convenient and powerful abstraction for dealing with the complexities that arise ...
Prediction of queue waiting times of jobs submitted to production parallel batch systems is importan...
Production parallel systems are space-shared and employ batch queues in which the jobs submitted to ...
Abstract. Production parallel systems are space-shared and hence em-ploy batch queues in which the j...
Production parallel systems are space-shared, and resource allocation on such systems is usually per...
Abstract. When a moldable job is submitted to a space-sharing parallel computer, it must choose whet...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
The paper is devoted to machine learning methods and algorithms for the supercomputer jobs executio...
These resources accompany the paper entitled "A Machine Learning Approach to Waiting Time Prediction...
Most space-sharing resources presently operated by high performance computing centers employ some so...
Serving customers for any organization is a factor in making profit and extending business. Waiting ...
Batch processing machines that can process a group of jobs simultaneously are often encountered in s...
International audienceHigh Throughput Computing datacenters are a cornerstone of scientic discoverie...
We present a technique for deriving predictions for the run times of parallel applications from the ...
Metacomputing is a convenient and powerful abstraction for dealing with the complexities that arise ...