Developing distributed and parallel programs on today's multiprocessor architectures is still a challenging task. Particular distressing is the lack of effective performance tools that support the programmer in evaluating changes in code, problem and machine sizes, and target architectures. In this paper we introduce P3T+ which is a performance estimator for mostly regular HPF (High Performance Fortran) programs but partially covers also message passing programs (MPI). P3T+ is unique by modeling programs, compiler code transformations, and parallel and distributed architectures. It computes at compile-time a variety of performance parameters including work distribution, number of transfers, amount of data transferred, transfer times, comput...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Performance prediction is necessary and crucial in order to deal with multi-dimensional performance ...
P 3 T is an interactive performance estimator that assists users in performance tuning of scientif...
The area of parallelizing compilers for distributed memory multicomputers has seen considerable rese...
Despite the performance potential of parallel systems, several factors have hindered their widesprea...
Loops represent the core of most applications in that they contain the bulk of the computations. An ...
Most of the current compiler projects for distributed memory architectures leave the critical and ti...
The most important features that a parallel programming language should provide are portability, mod...
This paper presents a model to evaluate the performance and overhead of parallelizing sequential cod...
This paper presents a profiling tool that allows the programmer to identify the regions of the progr...
. High Performance Fortran (hpf) is a data-parallel Fortran for Distributed Memory Multiprocessors. ...
The performance of a computer system is important. One way of improving performance is to use multip...
pC++ is a language extension to C++ designed to allow programmers to compose distributed data struct...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Performance prediction is necessary and crucial in order to deal with multi-dimensional performance ...
P 3 T is an interactive performance estimator that assists users in performance tuning of scientif...
The area of parallelizing compilers for distributed memory multicomputers has seen considerable rese...
Despite the performance potential of parallel systems, several factors have hindered their widesprea...
Loops represent the core of most applications in that they contain the bulk of the computations. An ...
Most of the current compiler projects for distributed memory architectures leave the critical and ti...
The most important features that a parallel programming language should provide are portability, mod...
This paper presents a model to evaluate the performance and overhead of parallelizing sequential cod...
This paper presents a profiling tool that allows the programmer to identify the regions of the progr...
. High Performance Fortran (hpf) is a data-parallel Fortran for Distributed Memory Multiprocessors. ...
The performance of a computer system is important. One way of improving performance is to use multip...
pC++ is a language extension to C++ designed to allow programmers to compose distributed data struct...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Performance prediction is necessary and crucial in order to deal with multi-dimensional performance ...