Selective replication in memory-side GPU caches

Zhao, Xia
Jahre, Magnus
Eeckhout, Lieven

Open link

Publication date

January 2020

DOI

10.1109/micro50266.2020.00082

Publisher

IEEE

Abstract

Data-intensive applications put immense strain on the memory systems of Graphics Processing Units (GPUs). To cater to this need, GPU memory systems distribute requests across independent units to provide high bandwidth by servicing requests (mostly) in parallel. We find that this strategy breaks down for shared data structures because the shared Last-Level Cache (LLC) organization used by contemporary GPUs stores shared data in a single LLC slice. Shared data requests are hence serialized - resulting in data-intensive applications not being provided with the bandwidth they require. A private LLC organization can provide high bandwidth, but it is often undesirable since it significantly reduces the effective LLC capacity. In this work, we pr...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Selective replication in memory-side GPU caches

Abstract

Extracted data

Selective replication in memory-side GPU caches

Abstract

Extracted data

Topics

Related items

Topics

Related items