© 2017, The Author(s). A number of scientific applications run on current HPC systems would benefit from an approximate assessment of parallel overhead. In many instances a quick and simple method to obtain a general overview on the subject is regarded useful auxiliary information by the routine HPC user. Here we present such a method using just execution times for increasing numbers of parallel processing cores. We start out with several common scientific applications and measure the fraction of time spent in MPI communication. Forming the ratio of MPI time to overall execution time we obtain a smooth curve that can be parameterized by only two constants. We then use this two-parameter expression and extend Amdahl’s theorem with a new term...
We propose a new model for parallel speedup that is based on two parameters, the average parallelism...
We propose a massively parallel framework termed a parallel-pipeline model of execution that can be ...
Estimating I/O time of applications is critical for computing sys-tem research and developments, suc...
© 2018 The Author(s). Porting scientific key algorithms to HPC architectures requires a thorough und...
Abstract. In this paper we estimate parallel execution times, based on identifying separate “parts ”...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
A considerably fraction of science discovery is nowadays relying on computer simulations. High Per...
HPC application developers encounter significant challenges getting their codes to run correctly on ...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
this paper, we show some performance results from an implemention of a data-parallel programming lan...
In high-performance computing (HPC) environments, an appropriate amount of hardware resources must b...
Abstract — There is widespread belief in the computer science community that MPI is a difficult and ...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
Many parallel algorithm design models have been proposed for abstracting a large class of parallel a...
We propose a new model for parallel speedup that is based on two parameters, the average parallelism...
We propose a massively parallel framework termed a parallel-pipeline model of execution that can be ...
Estimating I/O time of applications is critical for computing sys-tem research and developments, suc...
© 2018 The Author(s). Porting scientific key algorithms to HPC architectures requires a thorough und...
Abstract. In this paper we estimate parallel execution times, based on identifying separate “parts ”...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
A considerably fraction of science discovery is nowadays relying on computer simulations. High Per...
HPC application developers encounter significant challenges getting their codes to run correctly on ...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
this paper, we show some performance results from an implemention of a data-parallel programming lan...
In high-performance computing (HPC) environments, an appropriate amount of hardware resources must b...
Abstract — There is widespread belief in the computer science community that MPI is a difficult and ...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
Many parallel algorithm design models have been proposed for abstracting a large class of parallel a...
We propose a new model for parallel speedup that is based on two parameters, the average parallelism...
We propose a massively parallel framework termed a parallel-pipeline model of execution that can be ...
Estimating I/O time of applications is critical for computing sys-tem research and developments, suc...