Algorithms with low computational intensity show interesting per-formance and power consumption behavior on multicore proces-sors. We choose the lattice-Boltzmann method (LBM) as a pro-totype for this scenario in order to show if and how single-chip performance and power characteristics can be generalized to the highly parallel case. LBM is an algorithm for CFD simulations that has gained popularity due to its ease of implementation and suit-ability for complex geometries. In this paper we perform a thor-ough analysis of a sparse-lattice LBM implementation on the Intel Sandy Bridge processor. Starting from a single-core performance model we can describe the intra-chip saturation characteristics of the code and its optimal operating point in...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
We describe the implementation and optimization of a state-of-the-art Lattice Boltzmann code for com...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
This paper presents the performance analysis for both the computing performance and the energy effic...
Numerical analysts and programmers are currently facing a conceptual change in processor technology....
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
In this paper we report results of the analysis of computational performances and energy efficiency ...
With computer simulations real world phenomena can be analyzed in great detail. Computational fluid ...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
The increased availability of modern embedded many-core architectures supporting floating-point oper...
Lattice Boltzmann method (LBM) is an important computational fluid dynamics (CFD) approach to solvin...
In this paper we report results of the analysis of computational performances and energy efficiency ...
Abstract We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for ma...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
We describe the implementation and optimization of a state-of-the-art Lattice Boltzmann code for com...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
This paper presents the performance analysis for both the computing performance and the energy effic...
Numerical analysts and programmers are currently facing a conceptual change in processor technology....
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
In this paper we report results of the analysis of computational performances and energy efficiency ...
With computer simulations real world phenomena can be analyzed in great detail. Computational fluid ...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
The increased availability of modern embedded many-core architectures supporting floating-point oper...
Lattice Boltzmann method (LBM) is an important computational fluid dynamics (CFD) approach to solvin...
In this paper we report results of the analysis of computational performances and energy efficiency ...
Abstract We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for ma...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
We describe the implementation and optimization of a state-of-the-art Lattice Boltzmann code for com...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...