Scaling large-data computations on multi-GPU accelerators

Sabne, Amit
Sakdhnagool, Putt
Eigenmann, Rudolf

Open link

Publication date

January 2013

DOI

10.1145/2464996.2465023

Publisher

Association for Computing Machinery (ACM)

Citation count (estimate)

Abstract

Modern supercomputers rely on accelerators to speed up highly parallel workloads. Intricate programming models, limited device memory sizes and overheads of data transfers between CPU and accelerator memories are among the open challenges that restrict the widespread use of accelerators. First, this paper proposes a mechanism and an implementation to automatically pipeline the CPU-GPU memory channel so as to overlap the GPU computation with the memory copies, alleviating the data transfer overhead. Second, in doing so, the paper presents a technique called Computation Splitting, COSP, that caters to arbitrary device memory sizes and automatically manages to run out-of-card OpenMP-like applications on GPUs. Third, a novel adaptive runtime tu...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Scaling large-data computations on multi-GPU accelerators

Abstract

Extracted data

Scaling large-data computations on multi-GPU accelerators

Abstract

Extracted data

Related items

Related items