International audienceIn this work, we investigate the global memory access mechanism on recent GPUs. For the purpose of this study, we created specific benchmark programs, which allowed us to explore the scheduling of global memory transactions. Thus, we formulate a model capable of estimating the execution time for a large class of applications. Our main goal is to facilitate optimisation of regular data-parallel applications on GPUs. As an example, we finally describe our CUDA implementations of LBM flow solvers on which our model was able to estimate performance with less than 5% relative error
Since Graphics Processing Units (CPUs) have increasingly gained popularity amoung non-graphic and co...
During the past two decades, the lattice Boltzmann method (LBM) has been increasingly acknowledged a...
In the last three years, GPUs are more and more being used for general purpose applications instead ...
International audienceIn this work, we investigate the global memory access mechanism on recent GPUs...
International audienceIn this work, we investigate the global memory access mech- anism on recent GP...
International audienceEmerging many-core processors, like CUDA capable nVidia GPUs, are promising pl...
AbstractEmerging many-core processors, like CUDA capable nVidia GPUs, are promising platforms for re...
Today, we are living a growing demand of larger and more efficient computational resources from the ...
The scientific community in its never-ending road of larger and more efficient computational resourc...
National audienceThe popularization of graphic processing units (GPUs) has led to their extensive us...
International audienceThe lattice Boltzmann method (LBM) is an innovative and promising approach in ...
The Lattice Boltzmann method (LBM) for solving fluid flow is naturally well suited to an efficient i...
AbstractGraphics Processing Units (GPUs), originally developed for computer games, now provide compu...
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its...
Several efforts have been performed to improve LBM defects related to its computational performance....
Since Graphics Processing Units (CPUs) have increasingly gained popularity amoung non-graphic and co...
During the past two decades, the lattice Boltzmann method (LBM) has been increasingly acknowledged a...
In the last three years, GPUs are more and more being used for general purpose applications instead ...
International audienceIn this work, we investigate the global memory access mechanism on recent GPUs...
International audienceIn this work, we investigate the global memory access mech- anism on recent GP...
International audienceEmerging many-core processors, like CUDA capable nVidia GPUs, are promising pl...
AbstractEmerging many-core processors, like CUDA capable nVidia GPUs, are promising platforms for re...
Today, we are living a growing demand of larger and more efficient computational resources from the ...
The scientific community in its never-ending road of larger and more efficient computational resourc...
National audienceThe popularization of graphic processing units (GPUs) has led to their extensive us...
International audienceThe lattice Boltzmann method (LBM) is an innovative and promising approach in ...
The Lattice Boltzmann method (LBM) for solving fluid flow is naturally well suited to an efficient i...
AbstractGraphics Processing Units (GPUs), originally developed for computer games, now provide compu...
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its...
Several efforts have been performed to improve LBM defects related to its computational performance....
Since Graphics Processing Units (CPUs) have increasingly gained popularity amoung non-graphic and co...
During the past two decades, the lattice Boltzmann method (LBM) has been increasingly acknowledged a...
In the last three years, GPUs are more and more being used for general purpose applications instead ...