Program activity graphs (PAGs) can be constructed from timestamped traces of appropriate execution events. Information about the activities on the k longest execution paths is useful in the analysis of parallel program performance. In this paper, four algorithms for finding the near--critical paths of PAGs are compared, including a best--first search (BFS) algorithm that is worst--case asymptotically optimal in terms of both time and space. Results confirming the practical efficiency of the BFS algorithm are presented for five application programs. A framework for using the near--critical path information is also described. The framework includes statistical summaries and visualization capabilities that build upon the foundation of existing...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
The use of accelerators in heterogeneous systems is an established approach in designing petascale a...
A programming tool that performs analysis of critical paths for parallel programs has been developed...
The critical path is one of the fundamental runtime characteristics of a parallel program. It identi...
Efficient performance tuning of parallel programs is often hard. Optimization is often done when the...
Bottlenecks and imbalance in parallel programs can significantly affect performance of parallel exec...
The evolution of computers is moving more and more towards multi-core processors and parallel progra...
[[abstract]]It is shown that during performance optimization some false paths may become sensitizabl...
Critical Path Profiling is a technique that provides guidance to help programmers try to improve the...
Recent research on processor microarchitecture suggests using instruction criticality as a metric to...
Detecting critical paths in traditional message pass-ing parallel programs can be useful for post-mo...
The dynamic evaluation of parallelizing compilers and the programs to which they are applied is a fi...
Hard real-time systems require tasks to finish in time. To guarantee the timeliness of such a system...
Many important workloads today, such as web-hosted services, are limited not by processor core perfo...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
The use of accelerators in heterogeneous systems is an established approach in designing petascale a...
A programming tool that performs analysis of critical paths for parallel programs has been developed...
The critical path is one of the fundamental runtime characteristics of a parallel program. It identi...
Efficient performance tuning of parallel programs is often hard. Optimization is often done when the...
Bottlenecks and imbalance in parallel programs can significantly affect performance of parallel exec...
The evolution of computers is moving more and more towards multi-core processors and parallel progra...
[[abstract]]It is shown that during performance optimization some false paths may become sensitizabl...
Critical Path Profiling is a technique that provides guidance to help programmers try to improve the...
Recent research on processor microarchitecture suggests using instruction criticality as a metric to...
Detecting critical paths in traditional message pass-ing parallel programs can be useful for post-mo...
The dynamic evaluation of parallelizing compilers and the programs to which they are applied is a fi...
Hard real-time systems require tasks to finish in time. To guarantee the timeliness of such a system...
Many important workloads today, such as web-hosted services, are limited not by processor core perfo...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
The use of accelerators in heterogeneous systems is an established approach in designing petascale a...