AbstractWe develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively parallel systems based on multi-core processors. Our code describes 2D multi-phase compressible flows. We analyze the performance bottlenecks that we find as we gradually expose a larger fraction of the available parallelism, and derive appropriate solutions. We obtain a sustained performance for this ready-for-physics code that is a large fraction of peak. Our results can be easily applied to most present (or planned) HPC architectures, based on latest generation multi-core Intel processor architectures
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
Abstract We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for ma...
AbstractWe develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for mas...
We describe the implementation and optimization of a state-of-the-art Lattice Boltzmann code for com...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
When designing and implementing highly efficient scientific applications for parallel computers such...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
AbstractIn this paper we report on our early experience on porting, optimizing and benchmarking a La...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
Abstract We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for ma...
AbstractWe develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for mas...
We describe the implementation and optimization of a state-of-the-art Lattice Boltzmann code for com...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
When designing and implementing highly efficient scientific applications for parallel computers such...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
AbstractIn this paper we report on our early experience on porting, optimizing and benchmarking a La...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...