Special Section on Parallel, Distributed, and Reconfigurable Computing, and NetworkingGraphics processing units (GPUs) are highly efficient architectures for parallel stencil code; however, the small device (i.e., GPU) memory capacity (several tens of GBs) necessitates the use of out-of-core computation to process excess data. Great programming effort is needed to manually implement efficient out-of-core stencil code. To relieve such programming burdens, directive-based frameworks emerged, such as the pipelined accelerator (PACC); however, they usually lack specific optimizations to reduce data transfer. In this paper, we extend PACC with two data-centric optimizations to address data transfer problems. The first is a direct-mapping scheme ...
Stencil computations form the basis for computer simulations across almost every field of science, s...
International audienceHardware accelerators are classic scientific coprocessors in HPC machines. How...
Modern supercomputers rely on accelerators to speed up highly parallel workloads. Intricate programm...
We present a new compiler framework for truly heterogeneous 3D stencil computation on GPU clusters. ...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
AbstractA high-productivity framework for multi-GPU and multi-CPU computation of stencil application...
Stencil computations arise in many scientific computing do-mains, and often represent time-critical ...
GPUs have become an important part of large-scale and high-performance physics simulations, due to ...
PDE discretization schemes yielding stencil-like computing patterns are commonly used for seismic mo...
International audienceStencil computations are widely used in many scientific domains, and are there...
Using two full applications with different characteristics, this thesis explores the performance and...
Graphics processing units (GPUs) have become prevalent in modern computing systems. While their high...
Chandrasekaran, SunitaProcessor architectures have been rapidly evolving for decades. From the intro...
Stencil computations form the basis for computer simulations across almost every field of science, s...
The growth of data to be processed in the Oil & Gas industry matches the requirements imposed by evo...
Stencil computations form the basis for computer simulations across almost every field of science, s...
International audienceHardware accelerators are classic scientific coprocessors in HPC machines. How...
Modern supercomputers rely on accelerators to speed up highly parallel workloads. Intricate programm...
We present a new compiler framework for truly heterogeneous 3D stencil computation on GPU clusters. ...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
AbstractA high-productivity framework for multi-GPU and multi-CPU computation of stencil application...
Stencil computations arise in many scientific computing do-mains, and often represent time-critical ...
GPUs have become an important part of large-scale and high-performance physics simulations, due to ...
PDE discretization schemes yielding stencil-like computing patterns are commonly used for seismic mo...
International audienceStencil computations are widely used in many scientific domains, and are there...
Using two full applications with different characteristics, this thesis explores the performance and...
Graphics processing units (GPUs) have become prevalent in modern computing systems. While their high...
Chandrasekaran, SunitaProcessor architectures have been rapidly evolving for decades. From the intro...
Stencil computations form the basis for computer simulations across almost every field of science, s...
The growth of data to be processed in the Oil & Gas industry matches the requirements imposed by evo...
Stencil computations form the basis for computer simulations across almost every field of science, s...
International audienceHardware accelerators are classic scientific coprocessors in HPC machines. How...
Modern supercomputers rely on accelerators to speed up highly parallel workloads. Intricate programm...