A multi-GPU implementation of a D2Q37 lattice Boltzmann code

Biferale, L.
Mantovani, F.
Pivanti, M.
Pozzati, F.
Sbragaglia, M.
Scagliarini, Andrea
Schifano, S.F.
Toschi, F.
Tripiccione, R.

Open link

Publication date

January 2012

DOI

10.1007/978-3-642-31464-3_65

Publisher

Springer

Abstract

We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluster based on Nvidia Fermi processors. We analyze how to optimize the algorithm for GP-GPU architectures, describe the implementation choices that we have adopted and compare our performance results with an implementation optimized for latest generation multi-core CPUs. Our program runs at ˜¿30% of the double-precision peak performance of one GPU and shows almost linear scaling when run on the multi-GPU cluster. Keywords: Computational fluid-dynamics – Lattice Boltzmann methods – GP-GPUs computin

Extracted data

We use cookies to provide a better user experience.

Data Protection

A multi-GPU implementation of a D2Q37 lattice Boltzmann code

Abstract

Extracted data

A multi-GPU implementation of a D2Q37 lattice Boltzmann code

Abstract

Extracted data

Related items

Related items