Managing the memory hierarchy in GPUs

Dublish, Saumay Kumar

Publication date

July 2018

Publisher

The University of Edinburgh

Abstract

Pervasive use of GPUs across multiple disciplines is a result of continuous adaptation of the GPU architectures to address the needs of upcoming application domains. One such vital improvement is the introduction of the on-chip cache hierarchy, used primarily to filter the high bandwidth demand to the off-chip memory. However, in contrast to traditional CPUs, the cache hierarchy in GPUs is presented with significantly different challenges such as cache thrashing and bandwidth bottlenecks, arising due to small caches and high levels of memory traffic. These challenges lead to severe congestion across the memory hierarchy, resulting in high memory access latencies. In memory-intensive applications, such high memory access latencies of...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Managing the memory hierarchy in GPUs

Abstract

Extracted data

Managing the memory hierarchy in GPUs

Abstract

Extracted data

Related items

Related items