A Quantitative Study of Locality in GPU Caches

Lal, Sohan
Juurlink, Ben

Open PDF

Open link

Publication date

December 2020

DOI

10.14279/depositonce-10108.2

Journal

1611-3349

Abstract

Traditionally, GPUs only had programmer-managed caches. The advent of hardware-managed caches accelerated the use of GPUs for general-purpose computing. However, as GPU caches are shared by thousands of threads, they are usually a victim of contention and can suffer from thrashing and high miss rate, in particular, for memory-divergent workloads. As data locality is crucial for performance, there have been several efforts focusing on exploiting data locality in GPUs. However, there is a lack of quantitative analysis of data locality and data reuse in GPUs. In this paper, we quantitatively study the data locality and its limits in GPUs. We observe that data locality is much higher than exploited by current GPUs. We show that, on the one hand...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A Quantitative Study of Locality in GPU Caches

Abstract

Extracted data

A Quantitative Study of Locality in GPU Caches

Abstract

Extracted data

Topics

Related items

Topics

Related items