The KOJAK toolkit has been augmented with refined hardware performance counter support, including more convenient measurement specification, additional metric derivations and hierarchical structuring, and an extended algebra for integrating multiple experiments. Comprehensive automated analysis of a hybrid OpenMP/MPI parallel program, the ASC Purple sPPM benchmark, is demonstrated with performance experiments on equisized POWER4-II-based IBM Regatta p690+ cluster, Opteron-based Cray XD1 cluster and UltraSPARC-IV-based Sun Fire E25000 systems. Automatically assessed communication and synchronisation performance properties, combined with a rich set of measured and derived counter metrics, provide a holistic analysis context and facilitate mul...
High performance computing is playing an increasingly important role in the scientific community. As...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
The KOJAK toolkit has been augmented with refined hardware performance counter support, including mo...
Abstract. Today’s parallel computers with SMP nodes provide both multithread-ing and message passing...
The purpose of this project was to build an extensible cross-platform infrastructure to facilitate t...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
In this dissertation, we demonstrate that it is possible to develop methods of empirical hardware-co...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
In this paper we present a new technique for automatically measuring the performance of tasks, funct...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
The IPS-2 parallel program measurement tools pro-vide performance data from application programs, th...
High performance computing is playing an increasingly important role in the scientific community. As...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
The KOJAK toolkit has been augmented with refined hardware performance counter support, including mo...
Abstract. Today’s parallel computers with SMP nodes provide both multithread-ing and message passing...
The purpose of this project was to build an extensible cross-platform infrastructure to facilitate t...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
In this dissertation, we demonstrate that it is possible to develop methods of empirical hardware-co...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
In this paper we present a new technique for automatically measuring the performance of tasks, funct...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
The IPS-2 parallel program measurement tools pro-vide performance data from application programs, th...
High performance computing is playing an increasingly important role in the scientific community. As...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...