Beyond the socket: NUMA-aware GPUs

Ugljesa, Milic
Villa, Oreste
Bolotin, Evgeny
Arunkumar, Akhil
Ebrahimi, Eiman
Jaleel, Aamer
Ramirez, Alex
Nellans, David

Open link

Publication date

October 2017

DOI

10.1145/3123939.3124534

Publisher

Association for Computing Machinery (ACM)

Abstract

GPUs achieve high throughput and power efficiency by employing many small single instruction multiple thread (SIMT) cores. To minimize scheduling logic and performance variance they utilize a uniform memory system and leverage strong data parallelism exposed via the programming model. With Moore's law slowing, for GPUs to continue scaling performance (which largely depends on SIMT core count) they are likely to embrace multi-socket designs where transistors are more readily available. However when moving to such designs, maintaining the illusion of a uniform memory system is increasingly difficult. In this work we investigate multi-socket non-uniform memory access (NUMA) GPU designs and show that significant changes are needed to both the G...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Beyond the socket: NUMA-aware GPUs

Abstract

Extracted data

Beyond the socket: NUMA-aware GPUs

Abstract

Extracted data

Related items

Related items