Production parallel systems are space-shared and employ batch queues in which the jobs submitted to the systems are made to wait before execution. Thus, jobs submitted to parallel batch systems incur queue waiting times in addition to the execution times. Prediction of these queue waiting times is important to provide overall estimates to the users and can also help meta-schedulers make scheduling decisions. In the first part of our research, we have developed an integrated framework PQStar for identification and prediction of jobs with short queue waiting times. Analyses of the job traces of supercomputers reveal that about 56 to 99% of the jobs incur queue waiting times of less than an hour. Hence, identifying these quick starters or job...
The most commonly used scheduling algorithm for parallel super-computers is FCFS with backlling, as ...
International audienceHigh Throughput Computing datacenters are a cornerstone of scientic discoverie...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Prediction of queue waiting times of jobs submitted to production parallel batch systems is importan...
Abstract. Production parallel systems are space-shared and hence em-ploy batch queues in which the j...
Production parallel systems are space-shared, and resource allocation on such systems is usually per...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
Abstract. When a moldable job is submitted to a space-sharing parallel computer, it must choose whet...
High performance grid computing is a key enabler of large scale collaborative computational science....
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
Most space-sharing resources presently operated by high performance computing centers employ some so...
In many traditional job scheduling settings, it is assumed that one knows the time it will take for ...
International audienceJob scheduling in high-performance computing platforms is a hard problem that ...
In many real world systems, servers are assigned to work in parallel to increase the system throughp...
These resources accompany the paper entitled "A Machine Learning Approach to Waiting Time Prediction...
The most commonly used scheduling algorithm for parallel super-computers is FCFS with backlling, as ...
International audienceHigh Throughput Computing datacenters are a cornerstone of scientic discoverie...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
Prediction of queue waiting times of jobs submitted to production parallel batch systems is importan...
Abstract. Production parallel systems are space-shared and hence em-ploy batch queues in which the j...
Production parallel systems are space-shared, and resource allocation on such systems is usually per...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
Abstract. When a moldable job is submitted to a space-sharing parallel computer, it must choose whet...
High performance grid computing is a key enabler of large scale collaborative computational science....
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
Most space-sharing resources presently operated by high performance computing centers employ some so...
In many traditional job scheduling settings, it is assumed that one knows the time it will take for ...
International audienceJob scheduling in high-performance computing platforms is a hard problem that ...
In many real world systems, servers are assigned to work in parallel to increase the system throughp...
These resources accompany the paper entitled "A Machine Learning Approach to Waiting Time Prediction...
The most commonly used scheduling algorithm for parallel super-computers is FCFS with backlling, as ...
International audienceHigh Throughput Computing datacenters are a cornerstone of scientic discoverie...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...