We present a technique for performance analysis that helps users understand the communication behavior of their message passing applications. Our method automatically classifies individual communication operations and it reveals the cause of communication inefficiencies in the application. This classification allows the developer to focus quickly on the culprits of truly inefficient behavior, rather than manually foraging through massive amounts of performance data. Specifically, we trace the message operations of MPI applications and then classify each individual communication event using decision tree classification, a supervised learning technique. We train our decision tree using microbenchmarks that demonstrate both efficient and ineff...
The performance of the message-passing applications on a parallel system can vary and cause ine cien...
The significant gap between peak and realized performance of parallel machines motivates the need fo...
The ability to understand the factors contributing to parallel program performance are vital for und...
Applications may have unintended performance problems in spite of compiler optimizations, because of...
Event tracing is a powerful method for analyzing the performance behavior of parallel applications. ...
In the MPI parallel programming model, communication remains the bottleneck that prevents applicatio...
Analyzing parallel programs has become increasingly difficult due to the immense amount of informati...
Developing an efficient parallel application is not an easy task, and achieving a good performance r...
Today most complex scientific applications requires a large number of calculations to solve a partic...
In this article we present a novel approach to analyzing performance properties of parallel applicat...
Often parallel scientific applications are instrumented and traces are collected and analyzed to ide...
Parallel computers with SMP nodes provide both multithreading and message passing as their modes of ...
Abstract—Network congestion is one of the primary causes of performance degradation, performance var...
The goal of high performance computing is executing very large problems in the least amount of time,...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
The performance of the message-passing applications on a parallel system can vary and cause ine cien...
The significant gap between peak and realized performance of parallel machines motivates the need fo...
The ability to understand the factors contributing to parallel program performance are vital for und...
Applications may have unintended performance problems in spite of compiler optimizations, because of...
Event tracing is a powerful method for analyzing the performance behavior of parallel applications. ...
In the MPI parallel programming model, communication remains the bottleneck that prevents applicatio...
Analyzing parallel programs has become increasingly difficult due to the immense amount of informati...
Developing an efficient parallel application is not an easy task, and achieving a good performance r...
Today most complex scientific applications requires a large number of calculations to solve a partic...
In this article we present a novel approach to analyzing performance properties of parallel applicat...
Often parallel scientific applications are instrumented and traces are collected and analyzed to ide...
Parallel computers with SMP nodes provide both multithreading and message passing as their modes of ...
Abstract—Network congestion is one of the primary causes of performance degradation, performance var...
The goal of high performance computing is executing very large problems in the least amount of time,...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
The performance of the message-passing applications on a parallel system can vary and cause ine cien...
The significant gap between peak and realized performance of parallel machines motivates the need fo...
The ability to understand the factors contributing to parallel program performance are vital for und...