International audienceIn this work, we investigate the global memory access mechanism on recent GPUs. For the purpose of this study, we created specific benchmark programs, which allowed us to explore the scheduling of global memory transactions. Thus, we formulate a model capable of estimating the execution time for a large class of applications. Our main goal is to facilitate optimisation of regular data-parallel applications on GPUs. As an example, we finally describe our CUDA implementations of LBM flow solvers on which our model was able to estimate performance with less than 5% relative error
Several efforts have been performed to improve LBM defects related to its computational performance....
Since Graphics Processing Units (CPUs) have increasingly gained popularity amoung non-graphic and co...
During the past two decades, the lattice Boltzmann method (LBM) has been increasingly acknowledged a...
International audienceIn this work, we investigate the global memory access mechanism on recent GPUs...
International audienceIn this work, we investigate the global memory access mech- anism on recent GP...
International audienceEmerging many-core processors, like CUDA capable nVidia GPUs, are promising pl...
AbstractEmerging many-core processors, like CUDA capable nVidia GPUs, are promising platforms for re...
Today, we are living a growing demand of larger and more efficient computational resources from the ...
The scientific community in its never-ending road of larger and more efficient computational resourc...
International audienceThe lattice Boltzmann method (LBM) is an innovative and promising approach in ...
National audienceThe popularization of graphic processing units (GPUs) has led to their extensive us...
AbstractGraphics Processing Units (GPUs), originally developed for computer games, now provide compu...
The Lattice Boltzmann method (LBM) for solving fluid flow is naturally well suited to an efficient i...
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its...
International audienceIn this paper, we describe the implementation of a multi-graphical processing ...
Several efforts have been performed to improve LBM defects related to its computational performance....
Since Graphics Processing Units (CPUs) have increasingly gained popularity amoung non-graphic and co...
During the past two decades, the lattice Boltzmann method (LBM) has been increasingly acknowledged a...
International audienceIn this work, we investigate the global memory access mechanism on recent GPUs...
International audienceIn this work, we investigate the global memory access mech- anism on recent GP...
International audienceEmerging many-core processors, like CUDA capable nVidia GPUs, are promising pl...
AbstractEmerging many-core processors, like CUDA capable nVidia GPUs, are promising platforms for re...
Today, we are living a growing demand of larger and more efficient computational resources from the ...
The scientific community in its never-ending road of larger and more efficient computational resourc...
International audienceThe lattice Boltzmann method (LBM) is an innovative and promising approach in ...
National audienceThe popularization of graphic processing units (GPUs) has led to their extensive us...
AbstractGraphics Processing Units (GPUs), originally developed for computer games, now provide compu...
The Lattice Boltzmann method (LBM) for solving fluid flow is naturally well suited to an efficient i...
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its...
International audienceIn this paper, we describe the implementation of a multi-graphical processing ...
Several efforts have been performed to improve LBM defects related to its computational performance....
Since Graphics Processing Units (CPUs) have increasingly gained popularity amoung non-graphic and co...
During the past two decades, the lattice Boltzmann method (LBM) has been increasingly acknowledged a...