Finite di↵erence methods continue to provide an important and parallelisable approach to many numerical simulations problems. Iterative multigrid and multilevel algorithms can converge faster than ordinary finite di↵erence methods but can be more dicult to parallelise. Data parallel paradigms tend to lend themselves particularly well to solving regular mesh PDEs whereby low latency communications and high compute to communications ratios can yield high levels of computational eciency and raw performance. We report on some practical algorithmic and data layout approaches and on performance data on a range of Graphical Processing Units (GPUs) with CUDA. We focus on the use of multiple GPU devices with a single CPU host
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
This book brings together research on numerical methods adapted for Graphics Processing Units (GPUs)...
The continued development of improved algorithms and architecture for numerical simulations is at th...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
Graphical processing Units (GPUs) are finding widespread use as accelerators in computer clusters. I...
Modern GPUs (graphical processing units) are a common source of processing power inmany supercompute...
Abstract. Fast, robust and efficient multigrid solvers are a key numer-ical tool in the solution of ...
Processor technology is still dramatically advancing and promises enormous improvements in processin...
Abstract. Algebraic multigrid methods for large, sparse linear systems are a necessity in many compu...
A finite element code is developed in which all computational expensive steps are performed on a gra...
This paper discusses the implementation of a numerical algorithm for simulating incompressible fluid...
This thesis spans several research areas, where the main topics being parallel programming based on ...
We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA para...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
This book brings together research on numerical methods adapted for Graphics Processing Units (GPUs)...
The continued development of improved algorithms and architecture for numerical simulations is at th...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
Graphical processing Units (GPUs) are finding widespread use as accelerators in computer clusters. I...
Modern GPUs (graphical processing units) are a common source of processing power inmany supercompute...
Abstract. Fast, robust and efficient multigrid solvers are a key numer-ical tool in the solution of ...
Processor technology is still dramatically advancing and promises enormous improvements in processin...
Abstract. Algebraic multigrid methods for large, sparse linear systems are a necessity in many compu...
A finite element code is developed in which all computational expensive steps are performed on a gra...
This paper discusses the implementation of a numerical algorithm for simulating incompressible fluid...
This thesis spans several research areas, where the main topics being parallel programming based on ...
We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA para...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
This book brings together research on numerical methods adapted for Graphics Processing Units (GPUs)...