We present an auto-tuning approach to optimize application performance on emerging multicore architectures. The methodology extends the idea of search-based performance optimizations, popular in linear algebra and FFT libraries, to application-specific computational kernels. Our work applies this strategy to a lattice Boltzmann application (LBMHD) that historically has made poor use of scalar microprocessors due to its complex data structures and memory access patterns. We explore one of the broadest sets of multicore architectures in the HPC literature, including the Intel Xeon E5345 (Clovertown), AMD Opteron 2214 (Santa Rosa), AMD Opteron 2356 (Barcelona), Sun T5140 T2+ (Victoria Falls), as well as a QS20 IBM Cell Blade. Rather than hand-...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
Abstract The architecture of high performance computing systems is becoming more and more heterogene...
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
We apply auto-tuning to a hybrid MPI-pthreads lattice Boltzmann computation running on the Cray XT4 ...
We are witnessing a rapid evolution of HPC node architectures and on-chip parallelism as power and c...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
When designing and implementing highly ecient scienti c applications for parallel computers such a...
The Lattice Boltzmann method (LBM) for solving fluid flow is naturally well suited to an efficient i...
Abstract In this paper we report on our early experience on porting, optimizing and benchmarking a...
AbstractIn this paper we report on our early experience on porting, optimizing and benchmarking a La...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
Abstract The architecture of high performance computing systems is becoming more and more heterogene...
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
We apply auto-tuning to a hybrid MPI-pthreads lattice Boltzmann computation running on the Cray XT4 ...
We are witnessing a rapid evolution of HPC node architectures and on-chip parallelism as power and c...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
When designing and implementing highly ecient scienti c applications for parallel computers such a...
The Lattice Boltzmann method (LBM) for solving fluid flow is naturally well suited to an efficient i...
Abstract In this paper we report on our early experience on porting, optimizing and benchmarking a...
AbstractIn this paper we report on our early experience on porting, optimizing and benchmarking a La...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
We develop a Lattice Boltzmann code for computational fluid-dynamics and optimize it for massively p...
Abstract The architecture of high performance computing systems is becoming more and more heterogene...
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its...