Affine Vector Cache for memory bandwidth savings

Collange, Caroline
Kouyoumdjian, Alexandre

Publication date

January 2011

Publisher

HAL CCSD

Abstract

Preserving memory locality is a major issue in highly-multithreaded architectures such as GPUs. These architectures hide latency by maintaining a large number of threads in flight. As each thread needs to maintain a private working set, all threads collectively put tremendous pressure on on-chip memory arrays, at significant cost in area and power. We show that thread-private data in GPU-like implicit SIMD architectures can be compressed by a factor up to 16 by taking advantage of correlations between values held by different threads. We propose the Affine Vector Cache, a compressed cache design that complements the first level cache. Evaluation by simulation on the SDK and Rodinia benchmarks shows that a 32KB L1 cache assisted by a 16KB AV...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Affine Vector Cache for memory bandwidth savings

Abstract

Extracted data

Affine Vector Cache for memory bandwidth savings

Abstract

Extracted data

Related items

Related items