The increasing gap between processor and main memory speeds has become a serious bottleneck towards further improvement in system performance. Data prefetching techniques have been proposed to hide the performance impact of such long memory latencies. But most of the currently proposed data prefetchers predict future memory accesses based on current memory misses. This limits the opportunity that can be exploited to guide prefetching. In this thesis, we propose a branch-directed data prefetcher that uses the high prediction accuracies of current-generation branch predictors to predict a future basic block trace that the program will execute and issues prefetches for all the identified memory instructions contained therein. We also propose a...
Hardware predictors are widely used to improve the performance of modern processors. These predictor...
The “Memory Wall”, the vast gulf between processor execution speed and memory latency, has led to th...
General purpose processors were once designed with the major goal of maximizing performance. As powe...
Modern superscalar pipelines have tremendous capacity to consume the instruction stream. This has be...
The increasing gap between processor and main memory speeds has become a serious bottleneck towards ...
Microprocessor performance has been increasing at an exponential rate while memory system performanc...
Hardware prefetching is an effective technique for hiding cache miss latencies in modern processor d...
The “Memory Wall” [1], is the gap in performance between the processor and the main memory. Over the...
Effective data prefetching requires accurate mechanisms to predict both “which” cache blocks to pref...
The large number of cache misses of current applications coupled with the increasing cache miss late...
It is well known that memory latency is a major deterrent to achieving the maximum possible performa...
Cache performance analysis is becoming increasingly important in microprocessor design. This work ex...
Abstract—Computer architecture is beset by two opposing trends. Technology scaling and deep pipelini...
CPU speeds double approximately every eighteen months, while main memory speeds double only about ev...
Memory latency becoming an increasing important performance bottleneck as the gap between processor ...
Hardware predictors are widely used to improve the performance of modern processors. These predictor...
The “Memory Wall”, the vast gulf between processor execution speed and memory latency, has led to th...
General purpose processors were once designed with the major goal of maximizing performance. As powe...
Modern superscalar pipelines have tremendous capacity to consume the instruction stream. This has be...
The increasing gap between processor and main memory speeds has become a serious bottleneck towards ...
Microprocessor performance has been increasing at an exponential rate while memory system performanc...
Hardware prefetching is an effective technique for hiding cache miss latencies in modern processor d...
The “Memory Wall” [1], is the gap in performance between the processor and the main memory. Over the...
Effective data prefetching requires accurate mechanisms to predict both “which” cache blocks to pref...
The large number of cache misses of current applications coupled with the increasing cache miss late...
It is well known that memory latency is a major deterrent to achieving the maximum possible performa...
Cache performance analysis is becoming increasingly important in microprocessor design. This work ex...
Abstract—Computer architecture is beset by two opposing trends. Technology scaling and deep pipelini...
CPU speeds double approximately every eighteen months, while main memory speeds double only about ev...
Memory latency becoming an increasing important performance bottleneck as the gap between processor ...
Hardware predictors are widely used to improve the performance of modern processors. These predictor...
The “Memory Wall”, the vast gulf between processor execution speed and memory latency, has led to th...
General purpose processors were once designed with the major goal of maximizing performance. As powe...