The authors develop a model for the parallel performance of algorithms that consist of concurrent, two-dimensional wavefronts implemented in a message-passing environment. The model, based on a LogGP machine parameterization, combines the separate contributions of computation and communication wavefronts. The authors validate the model on three important supercomputer systems, on up to 500 processors. They use data from a deterministic particle transport application taken from the ASCI workload, although the model is general to any wavefront algorithm implemented on a 2-D processor domain. They also use the validated model to make estimates of performance and scalability of wavefront algorithms on 100 TFLOPS computer systems expected to be ...
This thesis presents the results of a simulation study of the performance of a message-passing multi...
The IBM ASCI Blue-Pacific System is a scalable, distributed/shared memory architecture designed to r...
This dissertation presents a parallel pipelined computational model for radar signal processing appl...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors introduced a performance model for parallel, multidimensional, wavefront calculations wi...
This paper develops a plug-and-play reusable LogGP model that can be used to predict the runtime and...
Pipelined wavefront computations are a ubiquitous class of parallel algorithm used for the solution ...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
We study, using analytic models and simulation, the performance of the multifrontal methods on distr...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
The two key factors affecting the performance of tera-scale computations are the parallel efficiency...
In distributed and vectorized computing there is a large number of highly different supercomputing p...
This thesis presents the results of a simulation study of the performance of a message-passing multi...
The IBM ASCI Blue-Pacific System is a scalable, distributed/shared memory architecture designed to r...
This dissertation presents a parallel pipelined computational model for radar signal processing appl...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors introduced a performance model for parallel, multidimensional, wavefront calculations wi...
This paper develops a plug-and-play reusable LogGP model that can be used to predict the runtime and...
Pipelined wavefront computations are a ubiquitous class of parallel algorithm used for the solution ...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
We study, using analytic models and simulation, the performance of the multifrontal methods on distr...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
The two key factors affecting the performance of tera-scale computations are the parallel efficiency...
In distributed and vectorized computing there is a large number of highly different supercomputing p...
This thesis presents the results of a simulation study of the performance of a message-passing multi...
The IBM ASCI Blue-Pacific System is a scalable, distributed/shared memory architecture designed to r...
This dissertation presents a parallel pipelined computational model for radar signal processing appl...