The authors develop a model for the parallel performance of algorithms that consist of concurrent, two-dimensional wavefronts implemented in a message passing environment. The model, based on a LogGP machine parameterization, combines the separate contributions of computation and communication wavefronts. They validate the model on three important supercomputer systems, on up to 500 processors. They use data from a deterministic particle transport application taken from the ASCI workload, although the model is general to any wavefront algorithm implemented on a 2-D processor domain. They also use the validated model to make estimates of performance and scalability of wavefront algorithms on 100-TFLOPS computer systems expected to be in exis...
In distributed and vectorized computing there is a large number of highly different supercomputing p...
The IBM ASCI Blue-Pacific System is a scalable, distributed/shared memory architecture designed to r...
233 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1987.The peak performance of a mul...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors introduced a performance model for parallel, multidimensional, wavefront calculations wi...
This paper develops a plug-and-play reusable LogGP model that can be used to predict the runtime and...
Pipelined wavefront computations are a ubiquitous class of parallel algorithm used for the solution ...
We study, using analytic models and simulation, the performance of the multifrontal methods on distr...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
The two key factors affecting the performance of tera-scale computations are the parallel efficiency...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
This thesis presents the results of a simulation study of the performance of a message-passing multi...
In distributed and vectorized computing there is a large number of highly different supercomputing p...
The IBM ASCI Blue-Pacific System is a scalable, distributed/shared memory architecture designed to r...
233 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1987.The peak performance of a mul...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors introduced a performance model for parallel, multidimensional, wavefront calculations wi...
This paper develops a plug-and-play reusable LogGP model that can be used to predict the runtime and...
Pipelined wavefront computations are a ubiquitous class of parallel algorithm used for the solution ...
We study, using analytic models and simulation, the performance of the multifrontal methods on distr...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
The two key factors affecting the performance of tera-scale computations are the parallel efficiency...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
This thesis presents the results of a simulation study of the performance of a message-passing multi...
In distributed and vectorized computing there is a large number of highly different supercomputing p...
The IBM ASCI Blue-Pacific System is a scalable, distributed/shared memory architecture designed to r...
233 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1987.The peak performance of a mul...