© 2018 The Author(s). Porting scientific key algorithms to HPC architectures requires a thorough understanding of the subtle balance between gain in performance and introduced overhead. Here we continue the development of our recently proposed technique that uses plain execution times to predict the extent of parallel overhead. The focus here is on an analytic solution that takes into account as many data points as there are unknowns, i.e. model parameters. A test set of 9 applications frequently used in scientific computing can be well described by the suggested model even including atypical cases that were originally not considered part of the development. However, the choice about which particular set of explicit data points will lead to...
Energy efficiency has become increasingly important in high performance computing (HPC), as power co...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
We address the problem of performance prediction for parallel programs executed on clusters of heter...
© 2017, The Author(s). A number of scientific applications run on current HPC systems would benefit ...
A method is presented for modeling application performance on parallel computers in terms of the per...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
Abstract. In this paper we estimate parallel execution times, based on identifying separate “parts ”...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
The increase in the use of parallel distributed architec-tures in order to solve large-scale scienti...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
The next-generation of supercomputers will feature a diverse mix of accelerator devices. The increas...
Prediction of the performance of parallel applications is a concept useful in several domains of sof...
Many parallel algorithm design models have been proposed for abstracting a large class of parallel a...
Many libraries in the HPC field use sophisticated algorithms with clear theoretical scalability expe...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
Energy efficiency has become increasingly important in high performance computing (HPC), as power co...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
We address the problem of performance prediction for parallel programs executed on clusters of heter...
© 2017, The Author(s). A number of scientific applications run on current HPC systems would benefit ...
A method is presented for modeling application performance on parallel computers in terms of the per...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
Abstract. In this paper we estimate parallel execution times, based on identifying separate “parts ”...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
The increase in the use of parallel distributed architec-tures in order to solve large-scale scienti...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
The next-generation of supercomputers will feature a diverse mix of accelerator devices. The increas...
Prediction of the performance of parallel applications is a concept useful in several domains of sof...
Many parallel algorithm design models have been proposed for abstracting a large class of parallel a...
Many libraries in the HPC field use sophisticated algorithms with clear theoretical scalability expe...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
Energy efficiency has become increasingly important in high performance computing (HPC), as power co...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
We address the problem of performance prediction for parallel programs executed on clusters of heter...