DRAM row-buffers have become a critical level of cache in the memory hierarchy to exploit spatial locality in the cache miss stream. Row-buffer conflicts occur when a sequence of requests on different pages goes to the same memory bank, causing higher memory access latency than requests to the same row or to different banks. In this study, we first show that the address mapping symmetry between the cache and DRAM is the inherent source of row-buffer conflicts. Breaking the symmetry to reduce the conflicts and to retain the spatial locality, we propose and evaluate a permutation-based page interleaving scheme. We have also evaluated and compared two representative cache mapping schemes that break the symmetry at the cache level. We show that...
The memory system is often the weakest link in the performance of today’s computers. Cache design ha...
Read and write requests from a processor contend for the main memory data bus. System performance de...
Modern cache designs exploit spatial locality by fetching large blocks of data called cache lines on...
In computer systems with large, physically-indexed, direct-mapped caches, a poor mapping from virtua...
This paper proposes an optimization by an alternative approach to memory mapping. Caches with low se...
The cache interference is found to play a critical role in optimizing cache allocation among concurr...
Applications often under-utilize cache space and there are no software locality optimization techniq...
Memory (cache, DRAM, and disk) is in charge of providing data and instructions to a computer\u27s pr...
This paper describes a method for improving the performance of a large direct-mapped cache by reduci...
This thesis studies the use of software methods to improve memory performance in a heterogeneous cac...
The cache interference is found to play a critical role in optimizing cache allocation among concurr...
Hardware trends have produced an increasing disparity between processor speeds and memory access tim...
textContemporary DRAM systems have maintained impressive scaling by managing a careful balance betwe...
There is a lack of flexibility in a fixed page location system because every location within a page ...
The increasing capacity of NAND flash memory leads to large RAM footprint on address mapping in the ...
The memory system is often the weakest link in the performance of today’s computers. Cache design ha...
Read and write requests from a processor contend for the main memory data bus. System performance de...
Modern cache designs exploit spatial locality by fetching large blocks of data called cache lines on...
In computer systems with large, physically-indexed, direct-mapped caches, a poor mapping from virtua...
This paper proposes an optimization by an alternative approach to memory mapping. Caches with low se...
The cache interference is found to play a critical role in optimizing cache allocation among concurr...
Applications often under-utilize cache space and there are no software locality optimization techniq...
Memory (cache, DRAM, and disk) is in charge of providing data and instructions to a computer\u27s pr...
This paper describes a method for improving the performance of a large direct-mapped cache by reduci...
This thesis studies the use of software methods to improve memory performance in a heterogeneous cac...
The cache interference is found to play a critical role in optimizing cache allocation among concurr...
Hardware trends have produced an increasing disparity between processor speeds and memory access tim...
textContemporary DRAM systems have maintained impressive scaling by managing a careful balance betwe...
There is a lack of flexibility in a fixed page location system because every location within a page ...
The increasing capacity of NAND flash memory leads to large RAM footprint on address mapping in the ...
The memory system is often the weakest link in the performance of today’s computers. Cache design ha...
Read and write requests from a processor contend for the main memory data bus. System performance de...
Modern cache designs exploit spatial locality by fetching large blocks of data called cache lines on...