Abstract—Researchers have proposed numerous directory techniques to address multicore scalability whose behavior de-pends on the CPU’s particular configuration, e.g. core count and cache size. As CPUs continue to scale, it is essential to explore the directory’s architecture dependences. However, this is challenging using detailed simulation given the large number of CPU configurations that are possible. This paper proposes to use multicore reuse distance analysis to study coherence directories. We develop a framework to extract the directory access stream from parallel LRU stacks, enabling rapid analysis of the directory’s accesses and contents across both core count and cache size scaling. We also implement our framework in a profiler, an...
Emerging computer architectures will feature drastically decreased flops/byte (ratio of peak process...
Conventional directory coherence operates at the finest granularity possible, that of a cache block....
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2010.CMOS scaling trends allow ...
Directories are one key part of a processor's cache coherence hardware, and constitute one of the ma...
The trend for multicore CPUs is towards increasing core count. One of the key limiters to scaling wi...
A key challenge in architecting a multicore processor is efficiently maintaining cache coherence. Di...
Multicore Reuse Distance (RD) analysis is a powerful tool that can potentially provide a parallel pr...
Understanding multicore memory behavior is crucial, but can be challenging due to the complex cache ...
This paper presents and validates methods to extend reuse distance analysis of application locality ...
The performance and energy efficiency of multicore systems are increasingly dominated by the costs o...
Performance on multicore processors is determined largely by on-chip cache. Computer architects hav...
Understanding multicore memory behavior is crucial, but can be challenging due to the cache hierarc...
As multicore processors implementing shared-memory programming models have become commonplace, analy...
Recent research shows that the occupancy of the coherence controllers is a major performance bottlen...
As the number of cores increases on chip multiprocessors, coherence is fast becoming a central issue...
Emerging computer architectures will feature drastically decreased flops/byte (ratio of peak process...
Conventional directory coherence operates at the finest granularity possible, that of a cache block....
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2010.CMOS scaling trends allow ...
Directories are one key part of a processor's cache coherence hardware, and constitute one of the ma...
The trend for multicore CPUs is towards increasing core count. One of the key limiters to scaling wi...
A key challenge in architecting a multicore processor is efficiently maintaining cache coherence. Di...
Multicore Reuse Distance (RD) analysis is a powerful tool that can potentially provide a parallel pr...
Understanding multicore memory behavior is crucial, but can be challenging due to the complex cache ...
This paper presents and validates methods to extend reuse distance analysis of application locality ...
The performance and energy efficiency of multicore systems are increasingly dominated by the costs o...
Performance on multicore processors is determined largely by on-chip cache. Computer architects hav...
Understanding multicore memory behavior is crucial, but can be challenging due to the cache hierarc...
As multicore processors implementing shared-memory programming models have become commonplace, analy...
Recent research shows that the occupancy of the coherence controllers is a major performance bottlen...
As the number of cores increases on chip multiprocessors, coherence is fast becoming a central issue...
Emerging computer architectures will feature drastically decreased flops/byte (ratio of peak process...
Conventional directory coherence operates at the finest granularity possible, that of a cache block....
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2010.CMOS scaling trends allow ...