There are many metrics designed to assist in the performance debugging of large-scale parallel applications. We describe a new technique, called True Zeroing, that permits direct quantitative comparison of the guidance supplied by these metrics on real applications. We apply this technique to three programs that include both numeric and symbolic applications. We compare three existing metrics: Gprof, Critical Path, and Quartz/NPT, and several new variations. Critical Path provided the best overall guidance, but it was not infallible. We also include a set of recommendations to tool builders based on the experience gained during our case study. 1. Introduction Performance metrics are a tool to help programmers reduce the running time of th...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
[[abstract]]©1988 North-Holland-The authors outline an approach to the design of a set of interactiv...
With rising complexity of high performance computing systems and their parallel software, performanc...
There are many metrics designed to assist in the performance debugging of large-scale parallel appli...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Critical Path Profiling is a technique that provides guidance to help programmers try to improve the...
AbstractIn software development, a metric is the measurement of some characteristic of a program's p...
Traditional performance debugging and tuning of parallel programs is based on the "measure-modify" a...
Achieving a significant fraction of peak performance on a modern high-performance computer is a chal...
A new approach to monitoring the runtime behaviour of parallel programs will be presented. Our appro...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
The IPS-2 parallel program measurement tools pro-vide performance data from application programs, th...
This paper presents scalability as a basis for profiling and performance debugging of parallel progr...
This paper presents a profiling tool that allows the programmer to identify the regions of the progr...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
[[abstract]]©1988 North-Holland-The authors outline an approach to the design of a set of interactiv...
With rising complexity of high performance computing systems and their parallel software, performanc...
There are many metrics designed to assist in the performance debugging of large-scale parallel appli...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Critical Path Profiling is a technique that provides guidance to help programmers try to improve the...
AbstractIn software development, a metric is the measurement of some characteristic of a program's p...
Traditional performance debugging and tuning of parallel programs is based on the "measure-modify" a...
Achieving a significant fraction of peak performance on a modern high-performance computer is a chal...
A new approach to monitoring the runtime behaviour of parallel programs will be presented. Our appro...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
The IPS-2 parallel program measurement tools pro-vide performance data from application programs, th...
This paper presents scalability as a basis for profiling and performance debugging of parallel progr...
This paper presents a profiling tool that allows the programmer to identify the regions of the progr...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
[[abstract]]©1988 North-Holland-The authors outline an approach to the design of a set of interactiv...
With rising complexity of high performance computing systems and their parallel software, performanc...