North Carolina State University

Chao Li
Shuaiwen Leon Song
Hongwen Dai
Albert Sidelnik
Siva Kumar
Sastry Hari
Huiyang Zhou

Publication date

January 2016

Abstract

This paper presents novel cache optimizations for massively parallel, throughput-oriented architectures like GPUs. L1 data caches (L1 D-caches) are critical resources for provid-ing high-bandwidth and low-latency data accesses. How-ever, the high number of simultaneous requests from single-instruction multiple-thread (SIMT) cores makes the limited capacity of L1 D-caches a performance and energy bottle-neck, especially for memory-intensive applications. We ob-serve that the memory access streams to L1 D-caches for many applications contain a significant amount of requests with low reuse, which greatly reduce the cache efficacy. Ex-isting GPU cache management schemes are either based on conditional/reactive solutions or hit-rate based design...

Extracted data

We use cookies to provide a better user experience.

Data Protection

North Carolina State University

Abstract

Extracted data

North Carolina State University

Abstract

Extracted data

Topics

Related items

Topics

Related items