Abstract. When a moldable job is submitted to a space-sharing parallel computer, it must choose whether to begin execution on a small, avail-able cluster or wait in queue for more processors to become available. To make this decision, it must predict how long it will have to wait for the larger cluster. We propose statistical techniques for predicting these queue times, and develop an allocation strategy that uses these predic-tions. We present a workload model based on observed workloads at the San Diego Supercomputer Center and the Cornell Theory Center, and use this model to drive simulations of various allocation strategies. We nd that prediction-based allocation not only improves the turnaround time of individual jobs; it also improves...
The integration of clusters of computers into computational grids has recently gained the attention ...
Abstract. Standard job scheduling uses static job sizes which lacks flexibility regarding changing l...
In a Computational Grid which consists of many com-puter clusters, job start time predictions are us...
Most space-sharing resources presently operated by high performance computing centers employ some so...
Production parallel systems are space-shared, and resource allocation on such systems is usually per...
Abstract. Prediction of queue waiting times of jobs submitted to pro-duction parallel batch systems ...
Real-time resource scheduling is an important factor for improving the performance of cluster comput...
In this paper, we present a scheduling scheme to estimate the turnaround time of parallel...
Production parallel systems are space-shared and employ batch queues in which the jobs submitted to ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
scheduling In this paper, we utilize a bandwidth-centric job communication model that captures the i...
Abstract. Production parallel systems are space-shared and hence em-ploy batch queues in which the j...
A parallel real time system consists of several processors which are used to execute a set of paral...
The integration of clusters of computers into computational grids has recently gained the atten- tio...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
The integration of clusters of computers into computational grids has recently gained the attention ...
Abstract. Standard job scheduling uses static job sizes which lacks flexibility regarding changing l...
In a Computational Grid which consists of many com-puter clusters, job start time predictions are us...
Most space-sharing resources presently operated by high performance computing centers employ some so...
Production parallel systems are space-shared, and resource allocation on such systems is usually per...
Abstract. Prediction of queue waiting times of jobs submitted to pro-duction parallel batch systems ...
Real-time resource scheduling is an important factor for improving the performance of cluster comput...
In this paper, we present a scheduling scheme to estimate the turnaround time of parallel...
Production parallel systems are space-shared and employ batch queues in which the jobs submitted to ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
scheduling In this paper, we utilize a bandwidth-centric job communication model that captures the i...
Abstract. Production parallel systems are space-shared and hence em-ploy batch queues in which the j...
A parallel real time system consists of several processors which are used to execute a set of paral...
The integration of clusters of computers into computational grids has recently gained the atten- tio...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
The integration of clusters of computers into computational grids has recently gained the attention ...
Abstract. Standard job scheduling uses static job sizes which lacks flexibility regarding changing l...
In a Computational Grid which consists of many com-puter clusters, job start time predictions are us...