HPC facilities typically use batch scheduling to space-share jobs. In this paper we revisit time-sharing using a trace of over 2.4 million jobs obtained during 20 months of operation of a modern petascale supercomputer. Our simulations show that batch scheduling produces skewed distributions with much larger slowdowns for shorter-running, larger jobs, whereas time-sharing produces more uniform slowdowns. Consequently, for applications that strong scale, the turnaround time does not scale with batch scheduling, but it does with time-sharing, resulting in turnarounds that are orders of magnitude better at the largest scales. We also show that time-sharing can confer additional benefits in noisy systems and with modern programming practices. F...
Typical HPC job scheduler software determines scheduling order by a linear sum of weighted priority ...
Computational scientists are eager to utilize computing resources to execute their applications to a...
Nowadays many supercomputer users are dissatisfied with a long waiting time for their jobs in the su...
Abstract—In order for many-task applications to be attrac-tive candidates for running on high-end su...
Next generation HPC applications will increasingly time-share system resources with emerging workloa...
Many breakthroughs in scientific and industrial research are supported by simulations and calculatio...
Scheduling algorithms in parallel computers fall into two basic categories: time and space sharing a...
In the field of high performance computing (HPC), batch scheduling plays a critical role. They deter...
Modern high-performance computing (HPC) system designs have converged to heavyweight nodes with grow...
As part of the Massively Parallel Computing Initiative (MPCI) at the Lawrence Livermore National Lab...
The batch scheduler is an important system software serving as the interface between users and HPC s...
Modern high performance computing (HPC) systems exhibit rapid growth in size, both "horizontally" in...
Most space-sharing resources presently operated by high performance computing centers employ some so...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
International audienceWe propose a novel job scheduling approach for homogeneous cluster computing p...
Typical HPC job scheduler software determines scheduling order by a linear sum of weighted priority ...
Computational scientists are eager to utilize computing resources to execute their applications to a...
Nowadays many supercomputer users are dissatisfied with a long waiting time for their jobs in the su...
Abstract—In order for many-task applications to be attrac-tive candidates for running on high-end su...
Next generation HPC applications will increasingly time-share system resources with emerging workloa...
Many breakthroughs in scientific and industrial research are supported by simulations and calculatio...
Scheduling algorithms in parallel computers fall into two basic categories: time and space sharing a...
In the field of high performance computing (HPC), batch scheduling plays a critical role. They deter...
Modern high-performance computing (HPC) system designs have converged to heavyweight nodes with grow...
As part of the Massively Parallel Computing Initiative (MPCI) at the Lawrence Livermore National Lab...
The batch scheduler is an important system software serving as the interface between users and HPC s...
Modern high performance computing (HPC) systems exhibit rapid growth in size, both "horizontally" in...
Most space-sharing resources presently operated by high performance computing centers employ some so...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
International audienceWe propose a novel job scheduling approach for homogeneous cluster computing p...
Typical HPC job scheduler software determines scheduling order by a linear sum of weighted priority ...
Computational scientists are eager to utilize computing resources to execute their applications to a...
Nowadays many supercomputer users are dissatisfied with a long waiting time for their jobs in the su...