A Detailed GPU Cache Model Based on Reuse Distance Theory

Cedric Nugteren
Gert-jan Van Den Braak
Henk Corporaal
Henri Bal

Publication date

December 2015

Abstract

As modern GPUs rely partly on their on-chip memories to counter the imminent off-chip memory wall, the efficient use of their caches has become important for performance and energy. However, optimising cache locality system-atically requires insight into and prediction of cache be-haviour. On sequential processors, stack distance or reuse distance theory is a well-known means to model cache be-haviour. However, it is not straightforward to apply this theory to GPUs, mainly because of the parallel execution model and fine-grained multi-threading. This work extends reuse distance to GPUs by modelling: 1) the GPU’s hier-archy of threads, warps, threadblocks, and sets of active threads, 2) conditional and non-uniform latencies, 3) cache associa...

A Detailed GPU Cache Model Based on Reuse Distance Theory

Abstract

Extracted data

Related items