We present STATBench, an emulator of a scalable, lightweight, and effective tool to help debug extreme-scale parallel applications, the Stack Trace Analysis Tool (STAT). STAT periodically samples stack traces from application processes and organizes the samples into a call graph prex tree that depicts process equivalence classes based on trace similarities. We have de-veloped STATBench which only requires limited resources and yet allows us to evaluate the feasibility of and identify potential roadblocks to deploying STAT on entire large scale systems like the 131,072 processor BlueGene/L (BG/L) at Lawrence Livermore National Laboratory. In this paper, we describe the implementation of STATBench and show how our design strategy is generally...
With rising complexity of high performance computing systems and their parallel software, performanc...
To efficiently use the available parallelism in today's HPC systems, analyzing the performance of ap...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
We present the Stack Trace Analysis Tool (STAT) to aid in debugging extreme-scale applications. STAT...
Petascale systems will present several new challenges to performance and correctness tools. Such mac...
Petascale platforms with O(10{sup 5}) and O(10{sup 6}) processing cores are driving advancements in ...
Developing correct and efficient software for large scale systems is a challenging task. Developers ...
The massively parallel computer architectures emerged in the last years create the platform to redef...
Applications on todays massively parallel supercom-puters rely on performance analysis tools to guid...
Abstract—Benchmarks are essential for evaluating HPC hardware and software for petascale machines an...
ABSTRACT: Tracing allows the analysis of task interactions with each other and with the operating sy...
Concurrency levels in large-scale, distributed-memory supercomputers are rising exponentially. Moder...
Full-system simulators are increasingly finding their way into the consumer space for the purposes o...
We acknowledge funding by the EPSRC grant PAMELA EP/K008730/1.Full-system simulators are increasingl...
Abstract. Automatic trace analysis is an effective method of identifying complex performance phenome...
With rising complexity of high performance computing systems and their parallel software, performanc...
To efficiently use the available parallelism in today's HPC systems, analyzing the performance of ap...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
We present the Stack Trace Analysis Tool (STAT) to aid in debugging extreme-scale applications. STAT...
Petascale systems will present several new challenges to performance and correctness tools. Such mac...
Petascale platforms with O(10{sup 5}) and O(10{sup 6}) processing cores are driving advancements in ...
Developing correct and efficient software for large scale systems is a challenging task. Developers ...
The massively parallel computer architectures emerged in the last years create the platform to redef...
Applications on todays massively parallel supercom-puters rely on performance analysis tools to guid...
Abstract—Benchmarks are essential for evaluating HPC hardware and software for petascale machines an...
ABSTRACT: Tracing allows the analysis of task interactions with each other and with the operating sy...
Concurrency levels in large-scale, distributed-memory supercomputers are rising exponentially. Moder...
Full-system simulators are increasingly finding their way into the consumer space for the purposes o...
We acknowledge funding by the EPSRC grant PAMELA EP/K008730/1.Full-system simulators are increasingl...
Abstract. Automatic trace analysis is an effective method of identifying complex performance phenome...
With rising complexity of high performance computing systems and their parallel software, performanc...
To efficiently use the available parallelism in today's HPC systems, analyzing the performance of ap...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...