Abstract—Torus-based networks are prevalent on leadership-class petascale systems, providing a good balance between network cost and performance. The major disadvantage of this network architecture is its susceptibility to fragmentation. Many studies have attempted to reduce resource fragmentation in this architecture. Although the approaches suggested can make good allocation decisions reducing fragmentation at job start time, none of them considers a job’s walltime, which can cause resource fragmentation when neighboring jobs do not complete closely. In this paper, we propose a walltime-aware job allocation strategy, which adjacently packs jobs that finish around the same time, in order to minimize resource fragmentation caused by job len...
Contiguous allocation of parallel jobs usually suffers from the degrading effects of fragmentation a...
Abstract Efficient processor allocation and job scheduling algorithms are critical if the full compu...
The performance of contiguous allocation strategies can be significantly affected by the distributio...
Torus-connected network is widely used in modern supercomputers due to its linear per node cost scal...
Network interference of nearby jobs has been recently identified as the dominant reason for the high...
In this paper we investigate the problem of how to schedule n independent jobs on an m \Theta m toru...
Abstract—As systems scale toward exascale, many resources will become increasingly constrained. Whil...
Two strategies are used for the allocation of jobs to processors connected by mesh topologies: conti...
Abstract. This paper studies the influence that job placement may have on scheduling performance, in...
Two strategies are used for the allocation of jobs to processors connected by mesh topologies: conti...
Abstract. Recent success in building petascale computing systems poses new challenges in job schedul...
Two strategies are used for the allocation of jobs to processors connected by mesh topologies: conti...
Contiguous allocation of parallel jobs usually suffers from the degrading effects of fragmentation a...
Abstract—In order for many-task applications to be attrac-tive candidates for running on high-end su...
It has been recently discovered that on an unreliable server, the job completion time distribution f...
Contiguous allocation of parallel jobs usually suffers from the degrading effects of fragmentation a...
Abstract Efficient processor allocation and job scheduling algorithms are critical if the full compu...
The performance of contiguous allocation strategies can be significantly affected by the distributio...
Torus-connected network is widely used in modern supercomputers due to its linear per node cost scal...
Network interference of nearby jobs has been recently identified as the dominant reason for the high...
In this paper we investigate the problem of how to schedule n independent jobs on an m \Theta m toru...
Abstract—As systems scale toward exascale, many resources will become increasingly constrained. Whil...
Two strategies are used for the allocation of jobs to processors connected by mesh topologies: conti...
Abstract. This paper studies the influence that job placement may have on scheduling performance, in...
Two strategies are used for the allocation of jobs to processors connected by mesh topologies: conti...
Abstract. Recent success in building petascale computing systems poses new challenges in job schedul...
Two strategies are used for the allocation of jobs to processors connected by mesh topologies: conti...
Contiguous allocation of parallel jobs usually suffers from the degrading effects of fragmentation a...
Abstract—In order for many-task applications to be attrac-tive candidates for running on high-end su...
It has been recently discovered that on an unreliable server, the job completion time distribution f...
Contiguous allocation of parallel jobs usually suffers from the degrading effects of fragmentation a...
Abstract Efficient processor allocation and job scheduling algorithms are critical if the full compu...
The performance of contiguous allocation strategies can be significantly affected by the distributio...