Pipelined wavefront computations are a ubiquitous class of parallel algorithm used for the solution of a number of scientific and engineering applications. This paper investigates three optimisations to the generic pipelined wavefront algorithm, which are investigated through the use of predictive analytic models. The modelling of potential optimisations is supported by a recently developed reusable LogGP-based analytic performance model, which allows the speculative evaluation, of each optimisation within the context of an industry-strength pipelined wavefront benchmark, developed and maintained by the United Kingdom Atomic Weapons Establishment (AWE). The paper details the quantitative and qualitative benefits of: (1) parallelising comput...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
The advent of modern High Performance Computing (HPC) has facilitated the use of powerful supercompu...
The cost of state-of-the-art supercomputing resources makes each individual purchase a length and ex...
Pipelined wavefront computations are a ubiquitous class of parallel algorithm used for the solution ...
Pipelined wavefront computations are an ubiquitous class of high performance parallel algorithms us...
This paper details the development and application of a model for predictive performance analysis of...
The authors introduced a performance model for parallel, multidimensional, wavefront calculations wi...
This paper develops a plug-and-play reusable LogGP model that can be used to predict the runtime and...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
Pipelined wavefront applications form a large portion of the high performance scientific computing w...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
Abstract. Wavefront computations are common in scientific applications. Although it is well understo...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
The advent of modern High Performance Computing (HPC) has facilitated the use of powerful supercompu...
The cost of state-of-the-art supercomputing resources makes each individual purchase a length and ex...
Pipelined wavefront computations are a ubiquitous class of parallel algorithm used for the solution ...
Pipelined wavefront computations are an ubiquitous class of high performance parallel algorithms us...
This paper details the development and application of a model for predictive performance analysis of...
The authors introduced a performance model for parallel, multidimensional, wavefront calculations wi...
This paper develops a plug-and-play reusable LogGP model that can be used to predict the runtime and...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
Pipelined wavefront applications form a large portion of the high performance scientific computing w...
The authors develop a model for the parallel performance of algorithms that consist of concurrent, t...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
Abstract. Wavefront computations are common in scientific applications. Although it is well understo...
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectur...
This paper develops a highly accurate LogGP model of a complex wavefront application that uses MPI c...
The advent of modern High Performance Computing (HPC) has facilitated the use of powerful supercompu...
The cost of state-of-the-art supercomputing resources makes each individual purchase a length and ex...