Scaling Distributed Cache Hierarchies through Computation and Data Co-Scheduling

Beckmann, Nathan Zachary
Tsai, Po-An
Sanchez, Daniel

Open PDF

Open link

Publication date

February 2015

DOI

10.1109/hpca.2015.7056061

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

Cache hierarchies are increasingly non-uniform, so for systems to scale efficiently, data must be close to the threads that use it. Moreover, cache capacity is limited and contended among threads, introducing complex capacity/latency tradeoffs. Prior NUCA schemes have focused on managing data to reduce access latency, but have ignored thread placement; and applying prior NUMA thread placement schemes to NUCA is inefficient, as capacity, not bandwidth, is the main constraint. We present CDCS, a technique to jointly place threads and data in multicores with distributed shared caches. We develop novel monitoring hardware that enables fine-grained space allocation on large caches, and data movement support to allow frequent full-chip reconfigur...