Combining cooperative software/hardware prefetching and cache replacement

Zhenlin Wang
Kathryn S. Mckinley
Doug Burger

Publication date

January 2004

Abstract

Data prefetching is an effective technique to hide memory latency and thus bridge the increasing processor-memory performance gap. Our previous work presents guided region prefetching (GRP), a hardware/software cooperative prefetching technique which cost-effectively tolerates L2 latencies. The compiler hints improve L2 prefetching accuracy and reduce bus bandwidth consumption compared to hardware only prefetching. However, some useless prefetches remain to degrade memory performance. This paper first explores a more aggressive GRP prefetching scheme which pushes L2 prefetches into the L1, similar to the IBM Power 4 and 5 cache designs. This approach yields some additional performance improvements. This work then combines GRP with evict-me,...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Combining cooperative software/hardware prefetching and cache replacement

Abstract

Extracted data

Combining cooperative software/hardware prefetching and cache replacement

Abstract

Extracted data

Related items

Related items