The lattice Boltzmann method (LBM) is an innovative and promising ap-proach in computational fluid dynamics. From an algorithmic standpoint it reduces to a regular data parallel procedure and is therefore well-suited to high performance computations. Numerous works report efficient implementations of the LBM for the GPU, but very few mention multi-GPU versions and even fewer GPU cluster implementations. Yet, to be of practical interest, GPU LBM solvers need to be able to perform large scale simulations. In the present contri-bution, we describe an efficient LBM implementation for CUDA GPU clusters. Our solver consists of a set of MPI communication routines and a CUDA kernel specifically designed to handle three-dimensional partitioning of t...
Heterogeneous clusters are a widely utilized class of supercomputers assembled from different types ...
International audienceEmerging many-core processors, like CUDA capable nVidia GPUs, are promising pl...
Today, we are living a growing demand of larger and more efficient computational resources from the ...
International audienceThe lattice Boltzmann method (LBM) is an innovative and promising approach in ...
Many-core processors, such as graphic processing units (GPUs), are promising platforms for intrinsic...
During the past two decades, the lattice Boltzmann method (LBM) has been increasingly acknowledged a...
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its...
The Lattice Boltzmann method (LBM) for solving fluid flow is naturally well suited to an efficient i...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
AbstractEmerging many-core processors, like CUDA capable nVidia GPUs, are promising platforms for re...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
Lattice Boltzmann (LB) methods are widely used today to describe the dynamics of fluids. Key adva...
In this paper, we describe the implementation of a multi-GPU fluid flow solver based on the lattice ...
Today, we are living a growing demand of larger and more efficient computational resources from the ...
Heterogeneous clusters are a widely utilized class of supercomputers assembled from different types ...
International audienceEmerging many-core processors, like CUDA capable nVidia GPUs, are promising pl...
Today, we are living a growing demand of larger and more efficient computational resources from the ...
International audienceThe lattice Boltzmann method (LBM) is an innovative and promising approach in ...
Many-core processors, such as graphic processing units (GPUs), are promising platforms for intrinsic...
During the past two decades, the lattice Boltzmann method (LBM) has been increasingly acknowledged a...
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its...
The Lattice Boltzmann method (LBM) for solving fluid flow is naturally well suited to an efficient i...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
AbstractEmerging many-core processors, like CUDA capable nVidia GPUs, are promising platforms for re...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
Lattice Boltzmann (LB) methods are widely used today to describe the dynamics of fluids. Key adva...
In this paper, we describe the implementation of a multi-GPU fluid flow solver based on the lattice ...
Today, we are living a growing demand of larger and more efficient computational resources from the ...
Heterogeneous clusters are a widely utilized class of supercomputers assembled from different types ...
International audienceEmerging many-core processors, like CUDA capable nVidia GPUs, are promising pl...
Today, we are living a growing demand of larger and more efficient computational resources from the ...