Modern high-performance computing (HPC) system designs have converged to heavyweight nodes with growing numbers of processors. If schedulers on these systems allocate nodes in an exclusive and dedicated manner, many HPC applications and scientific workflows will be unable to fully utilize and benefit from such hardware. This is because at such extreme scale, it will be difficult for modern HPC applications to utilize all of the node-level resources on these systems. In this paper, we investigate the potential of moving away from dedicated node allocation and instead using intelligent coscheduling—where multiple jobs can share node-level resources—to improve node utilization and therefore job turnaround time. We design and implement a cosche...
There are many kinds of scientific applications that run on high throughput computational (HTC) grid...
Workstation clusters are emerging as a general-purpose computing platform for the execution of workl...
Applications in high-performance computing (HPC) may not use all available computational resources, ...
Fine-grained parallel applications require all their processes to run simultaneously on distinct pro...
Coscheduling has been shown to be a critical factor in achieving efficient parallel execution in tim...
Many breakthroughs in scientific and industrial research are supported by simulations and calculatio...
Many scientific and high-performance computing applications consist of multiple processes running on...
Computational scientists are eager to utilize computing resources to execute their applications to a...
In recent years, the number of processing units per compute node has been increasing. In order to ut...
Network interference of nearby jobs has been recently identified as the dominant reason for the high...
Simultaneous Multithreading machines benefit from jobscheduling software that monitors how well cos...
Next generation HPC applications will increasingly time-share system resources with emerging workloa...
HPC facilities typically use batch scheduling to space-share jobs. In this paper we revisit time-sha...
scheduling In this paper, we utilize a bandwidth-centric job communication model that captures the i...
Abstract—In order for many-task applications to be attrac-tive candidates for running on high-end su...
There are many kinds of scientific applications that run on high throughput computational (HTC) grid...
Workstation clusters are emerging as a general-purpose computing platform for the execution of workl...
Applications in high-performance computing (HPC) may not use all available computational resources, ...
Fine-grained parallel applications require all their processes to run simultaneously on distinct pro...
Coscheduling has been shown to be a critical factor in achieving efficient parallel execution in tim...
Many breakthroughs in scientific and industrial research are supported by simulations and calculatio...
Many scientific and high-performance computing applications consist of multiple processes running on...
Computational scientists are eager to utilize computing resources to execute their applications to a...
In recent years, the number of processing units per compute node has been increasing. In order to ut...
Network interference of nearby jobs has been recently identified as the dominant reason for the high...
Simultaneous Multithreading machines benefit from jobscheduling software that monitors how well cos...
Next generation HPC applications will increasingly time-share system resources with emerging workloa...
HPC facilities typically use batch scheduling to space-share jobs. In this paper we revisit time-sha...
scheduling In this paper, we utilize a bandwidth-centric job communication model that captures the i...
Abstract—In order for many-task applications to be attrac-tive candidates for running on high-end su...
There are many kinds of scientific applications that run on high throughput computational (HTC) grid...
Workstation clusters are emerging as a general-purpose computing platform for the execution of workl...
Applications in high-performance computing (HPC) may not use all available computational resources, ...