A case for MLP-aware cache replacement

Moinuddin K. Qureshi
Daniel N. Lynch
Onur Mutlu
Yale N. Patt

Publication date

January 2006

Abstract

Performance loss due to long-latency memory accesses can be reduced by servicing multiple memory accesses concurrently. The notion of generating and servicing long-latency cache misses in parallel is called Memory Level Parallelism (MLP). MLP is not uniform across cache misses – some misses occur in isolation while some occur in parallel with other misses. Isolated misses are more costly on performance than parallel misses. However, tradi-tional cache replacement is not aware of the MLP-dependent cost differential between different misses. Cache replacement, if made MLP-aware, can improve performance by reducing the number of performance-critical isolated misses. This paper makes two key contributions. First, it proposes a framework for MLP...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A case for MLP-aware cache replacement

Abstract

Extracted data

A case for MLP-aware cache replacement

Abstract

Extracted data

Related items

Related items