We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively parallel systems based on multi-core processors. Our code describes 2D multi-phase compressible flows. We analyze the performance bottlenecks that we find as we gradually expose a larger fraction of the available parallelism, and derive appropriate solutions. We obtain a sustained performance for this ready-for-physics code that is a large fraction of peak. Our results can be easily applied to most present (or planned) HPC architectures, based on latest generation multi-core Intel processor architectures
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
Lattice Boltzmann (LB) methods are widely used today to describe the dynamics of fluids. Key adva...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
AbstractWe develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for mas...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
Abstract We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for ma...
We describe the implementation and optimization of a state-of-the-art Lattice Boltzmann code for com...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
When designing and implementing highly efficient scientific applications for parallel computers such...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
AbstractIn this paper we report on our early experience on porting, optimizing and benchmarking a La...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
Lattice Boltzmann (LB) methods are widely used today to describe the dynamics of fluids. Key adva...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
AbstractWe develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for mas...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
Abstract We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for ma...
We describe the implementation and optimization of a state-of-the-art Lattice Boltzmann code for com...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
When designing and implementing highly efficient scientific applications for parallel computers such...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
AbstractIn this paper we report on our early experience on porting, optimizing and benchmarking a La...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
Lattice Boltzmann (LB) methods are widely used today to describe the dynamics of fluids. Key adva...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...