Reducing cache coherence traffic with a NUMA-aware runtime approach

Caheny, Paul
Álvarez Martí, Lluc
Derradji, Said
Valero Cortés, Mateo
Moreto Planas, Miquel
Casas Guix, Marc

Open PDF

Open link

Publication date

May 2018

DOI

10.1109/TPDS.2017.2787123

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

ISSN

1045-9219

Journal

IEEE Transactions on Parallel and Distributed Systems

Abstract

Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provide for scaling core count and memory capacity. Also, the flat memory address space they offer considerably improves programmability. However, ccNUMA architectures require sophisticated and expensive cache coherence protocols to enforce correctness during parallel executions, which trigger a significant amount of on- and off-chip traffic in the system. This paper analyses how coherence traffic may be best constrained in a large, real ccNUMA platform comprising 288 cores through the use of a joint hardware/software approach. For several benchmarks, we study coherence traffic in detail under the influence of an added hierarchical cache layer in t...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Reducing cache coherence traffic with a NUMA-aware runtime approach

Abstract

Extracted data

Reducing cache coherence traffic with a NUMA-aware runtime approach

Abstract

Extracted data

Related items

Related items