Bundling: Reducing the overhead of multiprocessor prefetchers

Dan Wallin
Erik Hagersten

Publication date

January 2004

Abstract

Prefetching has proven to be a useful technique for re-ducing cache misses in multiprocessors at the cost of in-creased coherence traffic. This is especially troublesome for snoop-based systems, where the available coherence band-width often is the scalability bottleneck. The bundling technique presented in this paper reduces the overhead caused by prefetching in two ways: piggy-backing prefetches with normal requests, and requiring only one device to perform the snoop lookup for each prefetch transaction. This can reduce both the address bandwidth and the number of snoop lookups compared with a non-prefetching system. We describe bundling implementations for two important transaction types: reads and upgrades. While bundling could reduce t...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Bundling: Reducing the overhead of multiprocessor prefetchers

Abstract

Extracted data

Bundling: Reducing the overhead of multiprocessor prefetchers

Abstract

Extracted data

Related items

Related items