Efficient intranode communication in GPU-accelerated systems

Feng Ji
Ashwin M. Aji
James Dinan
Darius Buntinas
Pavan Balaji
Rajeev Thakur
Wu-chun Feng
Xiaosong Ma

Publication date

January 2012

DOI

10.1109/ipdpsw.2012.227

Abstract

Abstract—Accelerator awareness has become a pressing issue in data movement models, such as MPI, because of the rapid deployment of systems that utilize accelerators. In our previous work, we developed techniques to enhance MPI with accelerator awareness, thus allowing applications to easily and efficiently communicate data between accelerator memories. In this paper, we extend this work with techniques to perform efficient data movement between accelerators within the same node using a DMA-assisted, peer-to-peer intranode communication technique that was recently introduced for NVIDIA GPUs. We present a detailed design of our new approach to intranode communication and evaluate its improvement to communication and application performance u...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Efficient intranode communication in GPU-accelerated systems

Abstract

Extracted data

Efficient intranode communication in GPU-accelerated systems

Abstract

Extracted data

Related items

Related items