Tuning the performance of applications requires understanding the interactions between code and target architecture. This paper describes a performance modeling approach that not only makes accurate predictions about the behavior of an application on a target architecture for different inputs, but also provides guidance for tuning by highlighting the factors that limit performance in each section of a program. We introduce two new performance metrics that estimate the maximum gain expected from tuning different parts of an application, or from increasing the number of machine resources. We show how this metric helped identify a bottleneck in the ASCI Sweep3D benchmark where the lack of instruction-level parallelism limited performance. Tran...
UnrestrictedThe enormous and growing complexity of today's high-end systems has increased the alread...
New approaches are necessary to generate performance models in current systems due the het erogeneit...
We have developed an environment, based upon robust, existing, open source software, for tuning appl...
Tuning the performance of applications requires understanding the interactions between code and targ...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Abstract—A typical application tuning cycle repeats the fol-lowing three steps in a loop: performanc...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
As the speed gap widens between CPU and memory, memory hierarchy performance has become the bottlene...
Performance tuning, as carried out by compiler designers and application programmers to close the pe...
Applications may have unintended performance problems in spite of compiler optimizations, because of...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
ABSTRACT Goal-Directed Performance Tuning for Scientific Applications by Tien-Pao Shih Chair: Edward...
An effective methodology of performance evaluation and improvement enables application developers to...
Application performance often depends on achieved memory bandwidth. Achieved memory bandwidth varies...
Traditional means of gathering performance data are trac-ing, which is limited by the available stor...
UnrestrictedThe enormous and growing complexity of today's high-end systems has increased the alread...
New approaches are necessary to generate performance models in current systems due the het erogeneit...
We have developed an environment, based upon robust, existing, open source software, for tuning appl...
Tuning the performance of applications requires understanding the interactions between code and targ...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Abstract—A typical application tuning cycle repeats the fol-lowing three steps in a loop: performanc...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
As the speed gap widens between CPU and memory, memory hierarchy performance has become the bottlene...
Performance tuning, as carried out by compiler designers and application programmers to close the pe...
Applications may have unintended performance problems in spite of compiler optimizations, because of...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
ABSTRACT Goal-Directed Performance Tuning for Scientific Applications by Tien-Pao Shih Chair: Edward...
An effective methodology of performance evaluation and improvement enables application developers to...
Application performance often depends on achieved memory bandwidth. Achieved memory bandwidth varies...
Traditional means of gathering performance data are trac-ing, which is limited by the available stor...
UnrestrictedThe enormous and growing complexity of today's high-end systems has increased the alread...
New approaches are necessary to generate performance models in current systems due the het erogeneit...
We have developed an environment, based upon robust, existing, open source software, for tuning appl...