Exascale systems will be heterogeneous architectures with multiple levels of concurrency and energy constraints. In such a complex scenario, performance monitoring and runtime systems play a major role to obtain good application performance and scalability. Furthermore, online access to performance data becomes a necessity to decide how to schedule resources and orchestrate computational elements: processes, threads, tasks, etc. We present the Performance Introspection API, an extension of the IPM tool that provides online runtime access to performance data from an application while it runs. We describe its design and implementation and show its overhead on several test benchmarks. We also present a real test case using the Performance Intr...
Modern memory systems play a critical role in the performance of applications, but a detailed unders...
The purpose of this project was to build an extensible cross-platform infrastructure to facilitate t...
Traditional means of gathering performance data are trac-ing, which is limited by the available stor...
This paper introduces an infrastructure for efficiently collecting performance profiles from paralle...
Abstract—Traditional performance analysis techniques are performed after a parallel program has comp...
The IPS-2 parallel program measurement tools pro-vide performance data from application programs, th...
Extreme-scale computing requires a new perspective on the role of performance observation in the Exa...
On the road to Exascale computing, both performance and power areas are meant to be tackled at diffe...
For industrial systems performance, it is desired to keep the IT infrastructure competitive through ...
Future supercomputers will require application developers to expose much more parallelism than curre...
<p>The form to collect the data from the internal sensors and the hardware parameters, for each comp...
Identifying design patterns that limit the performance of multi-core algorithms is a challenging tas...
In this paper, we present VIProf, a full-system, performance sampling system capable of extracting r...
AbstractThe task of performance analysis and optimization grows more and more challenging with the i...
With high-end systems featuring multicore/multithreaded proces-sors and high component density, powe...
Modern memory systems play a critical role in the performance of applications, but a detailed unders...
The purpose of this project was to build an extensible cross-platform infrastructure to facilitate t...
Traditional means of gathering performance data are trac-ing, which is limited by the available stor...
This paper introduces an infrastructure for efficiently collecting performance profiles from paralle...
Abstract—Traditional performance analysis techniques are performed after a parallel program has comp...
The IPS-2 parallel program measurement tools pro-vide performance data from application programs, th...
Extreme-scale computing requires a new perspective on the role of performance observation in the Exa...
On the road to Exascale computing, both performance and power areas are meant to be tackled at diffe...
For industrial systems performance, it is desired to keep the IT infrastructure competitive through ...
Future supercomputers will require application developers to expose much more parallelism than curre...
<p>The form to collect the data from the internal sensors and the hardware parameters, for each comp...
Identifying design patterns that limit the performance of multi-core algorithms is a challenging tas...
In this paper, we present VIProf, a full-system, performance sampling system capable of extracting r...
AbstractThe task of performance analysis and optimization grows more and more challenging with the i...
With high-end systems featuring multicore/multithreaded proces-sors and high component density, powe...
Modern memory systems play a critical role in the performance of applications, but a detailed unders...
The purpose of this project was to build an extensible cross-platform infrastructure to facilitate t...
Traditional means of gathering performance data are trac-ing, which is limited by the available stor...