The efficient parallel execution of scientific applications is a key challenge in high-performance computing (HPC). With growing parallelism and heterogeneity of compute resources as well as increasingly complex software, performance analysis has become an indispensable tool in the development and optimization of parallel programs. This thesis presents a framework for systematic performance analysis of scalable, heterogeneous applications. Based on event traces, it automatically detects the critical path and inefficiencies that result in waiting or idle time, e.g. due to load imbalances between parallel execution streams. As a prerequisite for the analysis of heterogeneous programs, this thesis specifies inefficiency patterns for computatio...
HPC application developers encounter significant challenges getting their codes to run correctly on ...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
Cutting-edge science and engineering applications require petascale computing. Petascale computing p...
High Performance Computing is now a strategic resource as it allows to simulate complex phenomena in...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
Performance engineering is a fundamental task in high-performance computing (HPC). By definition, HP...
The use of accelerators in heterogeneous systems is an established approach in designing petascale a...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
This thesis deals with the problem of finding effective methods for programming and distributing dat...
The amount of parallelism in modern supercomputers currently grows from generation to generation, an...
With the plateauing of processor frequencies and increase in energy consumption in computing, applic...
Performance analysis is the task of monitor the behavior of a program execution. The main goal is to...
International audienceProgramming paradigms in High-Performance Computing have been shifting towards...
To help shrink the programmability-performance efficiency gap, we discuss that adaptive runtime syst...
HPC application developers encounter significant challenges getting their codes to run correctly on ...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
Cutting-edge science and engineering applications require petascale computing. Petascale computing p...
High Performance Computing is now a strategic resource as it allows to simulate complex phenomena in...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
Performance engineering is a fundamental task in high-performance computing (HPC). By definition, HP...
The use of accelerators in heterogeneous systems is an established approach in designing petascale a...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
This thesis deals with the problem of finding effective methods for programming and distributing dat...
The amount of parallelism in modern supercomputers currently grows from generation to generation, an...
With the plateauing of processor frequencies and increase in energy consumption in computing, applic...
Performance analysis is the task of monitor the behavior of a program execution. The main goal is to...
International audienceProgramming paradigms in High-Performance Computing have been shifting towards...
To help shrink the programmability-performance efficiency gap, we discuss that adaptive runtime syst...
HPC application developers encounter significant challenges getting their codes to run correctly on ...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
Cutting-edge science and engineering applications require petascale computing. Petascale computing p...