GPUs have become an important part of large-scale and high-performance physics simulations, due to their superior performance [11] and energy effiency [23] over CPUs. This thesis examines how to accelerate an existing CPU stencil code, that is originally parallelized through message passing, with GPUs. Our first research question is how to utilize the CPU cores alongside GPUs when the bulk of the computation is happening on GPUs. Secondly, we investigate how to address the performance bottleneck of data movement between CPU and GPU when there is a need to perform computational tasks originally intended to be executed on CPUs. Lastly, we investigate how the performance bottleneck of communication between processes can be alleviated t...
International audienceToday, it is possible to associate multiple CPUs and multiple GPUs in a single...
AbstractA high-productivity framework for multi-GPU and multi-CPU computation of stencil application...
GPUs are widely used in high performance computing, due to their high computational power and high p...
The programming of GPUs (Graphics Processing Units) is ready for practical applications; the largest...
Special Section on Parallel, Distributed, and Reconfigurable Computing, and NetworkingGraphics proce...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
We present a new compiler framework for truly heterogeneous 3D stencil computation on GPU clusters. ...
Particle-in-cell (PIC) is a simulation technique for plasma physics. The large number of particles i...
Stencil computations arise in many scientific computing do-mains, and often represent time-critical ...
Using two full applications with different characteristics, this thesis explores the performance and...
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Fundação de Amparo à Pesquisa do...
Graphic processors are becoming faster and faster. Computational power within graphic processing uni...
Nowadays, the most powerful supercomputers in the world, needed for solving complex models and simu...
With processor clock speeds having stagnated, parallel computing architectures have achieved a break...
International audienceToday, it is possible to associate multiple CPUs and multiple GPUs in a single...
AbstractA high-productivity framework for multi-GPU and multi-CPU computation of stencil application...
GPUs are widely used in high performance computing, due to their high computational power and high p...
The programming of GPUs (Graphics Processing Units) is ready for practical applications; the largest...
Special Section on Parallel, Distributed, and Reconfigurable Computing, and NetworkingGraphics proce...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
We present a new compiler framework for truly heterogeneous 3D stencil computation on GPU clusters. ...
Particle-in-cell (PIC) is a simulation technique for plasma physics. The large number of particles i...
Stencil computations arise in many scientific computing do-mains, and often represent time-critical ...
Using two full applications with different characteristics, this thesis explores the performance and...
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Fundação de Amparo à Pesquisa do...
Graphic processors are becoming faster and faster. Computational power within graphic processing uni...
Nowadays, the most powerful supercomputers in the world, needed for solving complex models and simu...
With processor clock speeds having stagnated, parallel computing architectures have achieved a break...
International audienceToday, it is possible to associate multiple CPUs and multiple GPUs in a single...
AbstractA high-productivity framework for multi-GPU and multi-CPU computation of stencil application...
GPUs are widely used in high performance computing, due to their high computational power and high p...