This paper describes Cache Equalizer (CE), a novel distributed cache management scheme for large scale chip multiprocessors (CMPs). Our work is motivated by large asymmetry in cache sets usages. CE decouples the physical locations of cache blocks from their addresses for the sake of reducing misses caused by destructive interferences. Temporal pressure at the on-chip last-level cache, is continuously collected at a group (comprised of cache sets) granularity, and periodically recorded at the memory controller to guide the placement process. An incoming block is consequently placed at a cache group that exhibits the minimum pressure. CE provides Quality of Service (QoS) by robustly offering better performance than the baseline shared NUCA ca...
As the number of cores on Chip Multi-Processor (CMP) increases, the need for effective utilization (...
The increasing speed-gap between processor and memory and the limited memory bandwidth make last-lev...
In this thesis we present a comparative analysis of shared cache management techniquesfor chip multi...
This paper describes Cache Equalizer (CE), a novel distributed cache management scheme for large sca...
One of the key requirements to obtaining high performance from chip multiprocessors (CMPs) is to eff...
This paper describes Constrained Associative-Mapping-of-Tracking-Entries (C-AMTE), a scalable mechan...
As the momentum behind Chip Multi-Processors (CMPs) continues to grow, Last Level Cache (LLC) manage...
Increases in on-chip communication delay and the large working sets of server and scientific workloa...
Chip multiprocessors have the potential to exploit thread level parallelism, particularly attractive...
Microprocessor industry has converged on chip multiprocessor (CMP) as the architecture of choice to ...
This paper describes Constrained Associative-Mapping-of-Tracking-Entries (C-AMTE), a scalable mechan...
The number of processor cores and on-chip cache size has been increasing on chip multiprocessors (CM...
The number of processor cores and on-chip cache size has been increasing on chip multiprocessors (CM...
Abstract— Chip Multiprocessor (CMP) systems have become the reference architecture for designing mi...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
As the number of cores on Chip Multi-Processor (CMP) increases, the need for effective utilization (...
The increasing speed-gap between processor and memory and the limited memory bandwidth make last-lev...
In this thesis we present a comparative analysis of shared cache management techniquesfor chip multi...
This paper describes Cache Equalizer (CE), a novel distributed cache management scheme for large sca...
One of the key requirements to obtaining high performance from chip multiprocessors (CMPs) is to eff...
This paper describes Constrained Associative-Mapping-of-Tracking-Entries (C-AMTE), a scalable mechan...
As the momentum behind Chip Multi-Processors (CMPs) continues to grow, Last Level Cache (LLC) manage...
Increases in on-chip communication delay and the large working sets of server and scientific workloa...
Chip multiprocessors have the potential to exploit thread level parallelism, particularly attractive...
Microprocessor industry has converged on chip multiprocessor (CMP) as the architecture of choice to ...
This paper describes Constrained Associative-Mapping-of-Tracking-Entries (C-AMTE), a scalable mechan...
The number of processor cores and on-chip cache size has been increasing on chip multiprocessors (CM...
The number of processor cores and on-chip cache size has been increasing on chip multiprocessors (CM...
Abstract— Chip Multiprocessor (CMP) systems have become the reference architecture for designing mi...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
As the number of cores on Chip Multi-Processor (CMP) increases, the need for effective utilization (...
The increasing speed-gap between processor and memory and the limited memory bandwidth make last-lev...
In this thesis we present a comparative analysis of shared cache management techniquesfor chip multi...