Abstract. We present PerfMiner, a system for the transparent collection, storage and presentation of thread-level hardware performance data across an entire cluster. Every sub-process/thread spawned by the user through the batch system is measured with near zero overhead and no dilation of run-time. Performance metrics are collected at the thread level using tool built on top of the Performance Application Programming Interface (PAPI). As the hardware counters are virtualized by the OS, the resulting counts are largely unaffected by other kernel or user processes. PerfMiner correlates this performance data with metadata from the batch system and places it in a database. Through a command line and web interface, the user can make queries to ...
Exascale systems will be heterogeneous architectures with multiple levels of concurrency and energy ...
Modern computing architectures change rapidly and exhibit high levels of complexity and heterogenei...
There is a growing need for systems that can monitor and analyze application performance data automa...
For industrial systems performance, it is desired to keep the IT infrastructure competitive through ...
In this paper we describe the architecture of PerfMC, a performance monitoring system for clusters o...
Project Specification Application performance is often assessed using the Performance Monitoring Un...
Profiling of an application identifies parts of the code being executed using the hardware performan...
As supercomputers continue to grow in scale and ca-pabilities, it is becoming increasingly difficult...
Large scale computer clusters have during the last years become dominant for making computations in ...
Identifying design patterns that limit the performance of multi-core algorithms is a challenging tas...
Performance Analysis is essential to fully exploit the potential of high-performance computers. With...
In this work, a standard and unified method for monitoring hardware accelerators in Reconfigurable C...
This paper introduces an infrastructure for efficiently collecting performance profiles from paralle...
AbstractThe task of performance analysis and optimization grows more and more challenging with the i...
As useful as performance counters are, the meaning of reported aggregate event counts is sometimes q...
Exascale systems will be heterogeneous architectures with multiple levels of concurrency and energy ...
Modern computing architectures change rapidly and exhibit high levels of complexity and heterogenei...
There is a growing need for systems that can monitor and analyze application performance data automa...
For industrial systems performance, it is desired to keep the IT infrastructure competitive through ...
In this paper we describe the architecture of PerfMC, a performance monitoring system for clusters o...
Project Specification Application performance is often assessed using the Performance Monitoring Un...
Profiling of an application identifies parts of the code being executed using the hardware performan...
As supercomputers continue to grow in scale and ca-pabilities, it is becoming increasingly difficult...
Large scale computer clusters have during the last years become dominant for making computations in ...
Identifying design patterns that limit the performance of multi-core algorithms is a challenging tas...
Performance Analysis is essential to fully exploit the potential of high-performance computers. With...
In this work, a standard and unified method for monitoring hardware accelerators in Reconfigurable C...
This paper introduces an infrastructure for efficiently collecting performance profiles from paralle...
AbstractThe task of performance analysis and optimization grows more and more challenging with the i...
As useful as performance counters are, the meaning of reported aggregate event counts is sometimes q...
Exascale systems will be heterogeneous architectures with multiple levels of concurrency and energy ...
Modern computing architectures change rapidly and exhibit high levels of complexity and heterogenei...
There is a growing need for systems that can monitor and analyze application performance data automa...