Destination-set prediction can improve the latency/bandwidth tradeoff in shared-memory multiprocessors. The destination set is the collection of processors that receive a particular coherence request. Snooping protocols send requests to the maximal destination set (i.e., all processors), reducing latency for cache-to-cache misses at the expense of increased traffic. Directory protocols send requests to the minimal destination set, reducing bandwidth at the expense of an indirection through the directory for cache-to-cache misses. Recently proposed hybrid protocols trade-off latency and bandwidth by directly sending requests to a predicted destination set. This paper explores the destination-set predictor design space, focusing on a collecti...
This paper proposes and evaluates Sharing/Timing Adaptive Push (STAP), a dynamic scheme for preempti...
Coherence misses in shared-memory multiprocessors account for a substantial fraction of execution ti...
for the degree of Master of Science. Recent research indicates that prediction-based coherence optim...
Destination-set prediction can improve the latency/bandwidth tradeoff in shared-memory multiprocesso...
This work explores the possibility of using speculation at the directories in a cache coherent non-u...
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright...
textThis dissertation explores techniques for reducing the costs of inter-processor communication i...
Efficient data supply to the processor is the one of the keys to achieve high performance. However, ...
This paper advocates that cache coherence protocols use a bandwidth adaptive approach to adjust to v...
One common cause of poor performance in large-scale shared-memory multiprocessors is limited memory ...
Recent research advocates using general message predictors to learn and predict the coherence activi...
The goal of this paper is to gain insight into the relative performance of communication mechanisms ...
The transition to multi-core architectures can be attributed mainly to fundamental limitations in cl...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2010.CMOS scaling trends allow ...
Design complexity and limited power budget are causing the number of cores on the same chip to grow ...
This paper proposes and evaluates Sharing/Timing Adaptive Push (STAP), a dynamic scheme for preempti...
Coherence misses in shared-memory multiprocessors account for a substantial fraction of execution ti...
for the degree of Master of Science. Recent research indicates that prediction-based coherence optim...
Destination-set prediction can improve the latency/bandwidth tradeoff in shared-memory multiprocesso...
This work explores the possibility of using speculation at the directories in a cache coherent non-u...
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright...
textThis dissertation explores techniques for reducing the costs of inter-processor communication i...
Efficient data supply to the processor is the one of the keys to achieve high performance. However, ...
This paper advocates that cache coherence protocols use a bandwidth adaptive approach to adjust to v...
One common cause of poor performance in large-scale shared-memory multiprocessors is limited memory ...
Recent research advocates using general message predictors to learn and predict the coherence activi...
The goal of this paper is to gain insight into the relative performance of communication mechanisms ...
The transition to multi-core architectures can be attributed mainly to fundamental limitations in cl...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2010.CMOS scaling trends allow ...
Design complexity and limited power budget are causing the number of cores on the same chip to grow ...
This paper proposes and evaluates Sharing/Timing Adaptive Push (STAP), a dynamic scheme for preempti...
Coherence misses in shared-memory multiprocessors account for a substantial fraction of execution ti...
for the degree of Master of Science. Recent research indicates that prediction-based coherence optim...