Concurrency levels in large-scale supercomputers are rising exponentially, and shared-memory nodes with hundreds of cores and non-uniform memory access latencies are expected within the next decade. However, even current petascale systems with tens of cores per node suffer from memory bottlenecks. As core counts increase, memory issues will become critical for the performance of large-scale supercomputers. Trace analysis tools are thus vital for diagnosing the root causes of memory problems. However, existing memory tracing tools are expensive due to prohibitively large trace sizes, or they collect only statistical summaries and omit potentially valuable information. In this paper, we present ScalaMemTrace, a novel technique for collecting ...
A powerful and widely-used method for analyzing the performance behavior of parallel programs is eve...
Memory tracing (executing additional code for every memory access of a program) is a powerful techni...
While much current research concerns multiprocessor design, few traces of parallel programs are avai...
Abstract—Analyzing the memory traces of multithreaded programs is a cumbersome and expensive process...
Event tracing of applications under dynamic execution is crucial for performance modeling, optimizat...
The divergence between processor and memory performance has been a well discussed aspect of computer...
Abstract. Performance analysis tools are an important component of the parallel program development ...
Performance analysis tools are an important component of the parallel program development and tuning...
A powerful and widely-used method for analyzing the performance behavior of parallel programs is eve...
This paper extends results concerning the recovery of accurate parallel program traces from corrupte...
Scaling a parallel program to modern supercomputers is challenging due to inter-process communicatio...
Characterizing the communication behavior of large-scale applications is a difficult and costly task...
//TRACE1 is a new approach for extracting and replaying traces of parallel applications to recreate ...
Event traces are required to correctly diagnose a number of performance problems that arise on today...
Mueller). Characterizing the communication behavior of large-scale applications is a difficult and c...
A powerful and widely-used method for analyzing the performance behavior of parallel programs is eve...
Memory tracing (executing additional code for every memory access of a program) is a powerful techni...
While much current research concerns multiprocessor design, few traces of parallel programs are avai...
Abstract—Analyzing the memory traces of multithreaded programs is a cumbersome and expensive process...
Event tracing of applications under dynamic execution is crucial for performance modeling, optimizat...
The divergence between processor and memory performance has been a well discussed aspect of computer...
Abstract. Performance analysis tools are an important component of the parallel program development ...
Performance analysis tools are an important component of the parallel program development and tuning...
A powerful and widely-used method for analyzing the performance behavior of parallel programs is eve...
This paper extends results concerning the recovery of accurate parallel program traces from corrupte...
Scaling a parallel program to modern supercomputers is challenging due to inter-process communicatio...
Characterizing the communication behavior of large-scale applications is a difficult and costly task...
//TRACE1 is a new approach for extracting and replaying traces of parallel applications to recreate ...
Event traces are required to correctly diagnose a number of performance problems that arise on today...
Mueller). Characterizing the communication behavior of large-scale applications is a difficult and c...
A powerful and widely-used method for analyzing the performance behavior of parallel programs is eve...
Memory tracing (executing additional code for every memory access of a program) is a powerful techni...
While much current research concerns multiprocessor design, few traces of parallel programs are avai...