Recent research suggests that there are large variations in a cache's spatial usage, both within and across programs. Unfortunately, conventional caches typically employ fixed cache line sizes to balance the exploitation of spatial and temporal locality, and to avoid prohibitive cache fill bandwidth demands. The resulting inability of conventional caches to exploit spatial variations leads to sub-optimal performance and unnecessary cache power dissipation. This paper describes the Spatial Pattern Predictor (SPP), a cost-effective hardware mechanism that accurately predicts reference patterns within a spatial group (i.e., a contiguous region of data in memory) at runtime. The key observation enabling an accurate, yet low-cost, SPP design is ...
The increasing speed gap between processor microarchitectures and memory technologies can potentiall...
This paper studies the theory of caching and temporal and spatial locality. We show the following re...
Cache performance analysis is becoming increasingly important in microprocessor design. This work ex...
Modern cache designs exploit spatial locality by fetching large blocks of data called cache lines on...
Cache memories are commonly implemented through multiple memory banks to improve bandwidth and laten...
Journal ArticleThe speed gap between processors and memory system is becoming the performance bottle...
Cache replacement and branch prediction are two important microarchitectural prediction techniques f...
Efficient data supply to the processor is the one of the keys to achieve high performance. However, ...
Modern processors rely heavily on speculation to provide performance. Techniques such as branch pred...
Improving cache performance requires understanding cache behavior. However, measuring cache performa...
Improving cache performance requires understanding cache behavior. However, measuring cache performa...
As the gap between memory and processor performance continues to grow, more and more programs will ...
Low-latency data access is essential for performance. To achieve this, processors use fast first-lev...
With the increasing performance gap between the processor and the memory, the importance of caches i...
The central data structures for many applications in scientific computing are large multidimensional...
The increasing speed gap between processor microarchitectures and memory technologies can potentiall...
This paper studies the theory of caching and temporal and spatial locality. We show the following re...
Cache performance analysis is becoming increasingly important in microprocessor design. This work ex...
Modern cache designs exploit spatial locality by fetching large blocks of data called cache lines on...
Cache memories are commonly implemented through multiple memory banks to improve bandwidth and laten...
Journal ArticleThe speed gap between processors and memory system is becoming the performance bottle...
Cache replacement and branch prediction are two important microarchitectural prediction techniques f...
Efficient data supply to the processor is the one of the keys to achieve high performance. However, ...
Modern processors rely heavily on speculation to provide performance. Techniques such as branch pred...
Improving cache performance requires understanding cache behavior. However, measuring cache performa...
Improving cache performance requires understanding cache behavior. However, measuring cache performa...
As the gap between memory and processor performance continues to grow, more and more programs will ...
Low-latency data access is essential for performance. To achieve this, processors use fast first-lev...
With the increasing performance gap between the processor and the memory, the importance of caches i...
The central data structures for many applications in scientific computing are large multidimensional...
The increasing speed gap between processor microarchitectures and memory technologies can potentiall...
This paper studies the theory of caching and temporal and spatial locality. We show the following re...
Cache performance analysis is becoming increasingly important in microprocessor design. This work ex...