Cache performance analysis is becoming increasingly important in microprocessor design. This work examines the effect of mispredicted-path memory references on cache performance and proposes a new algorithm to prefetch instruction cache lines. The first cache study describes the effects of deep speculation on data cache miss rates and memory traffic. The major result was that traffic is only slightly increased by speculation--less than 15% for most SPEC benchmarks even when executing 50 instructions down a mispredicted path. Cache misses are not significantly increased because the mispredicted paths references act as prefetch operations for later correct path references. The latter result led to the development of a new instruction cache pr...
Data prefetching effectively reduces the negative effects of long load latencies on the performance ...
Prefetching is an important technique for reducing the average latency of memory accesses in scalabl...
Modern superscalar pipelines have tremendous capacity to consume the instruction stream. This has be...
As the degree of instruction-level parallelism in superscalar architectures increases, the gap betwe...
It is well known that memory latency is a major deterrent to achieving the maximum possible performa...
Instruction cache misses can severely limit the performance of both superscalar processors and high ...
The speculated execution of threads in a multithreaded architecture plus the branch prediction used ...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
Compiler-directed cache prefetching has the poten-tial to hide much of the high memory latency seen ...
CPU speeds double approximately every eighteen months, while main memory speeds double only about ev...
Instruction prefetching is an important aspect of contemporary high performance computer architectur...
High-performance multiprocessor systems built around out-of-order processors with aggressive branch ...
The large number of cache misses of current applications coupled with the increasing cache miss late...
The full text of this article is not available on SOAR. WSU users can access the article via IEEE Xp...
Data prefetching effectively reduces the negative effects of long load latencies on the performance ...
Prefetching is an important technique for reducing the average latency of memory accesses in scalabl...
Modern superscalar pipelines have tremendous capacity to consume the instruction stream. This has be...
As the degree of instruction-level parallelism in superscalar architectures increases, the gap betwe...
It is well known that memory latency is a major deterrent to achieving the maximum possible performa...
Instruction cache misses can severely limit the performance of both superscalar processors and high ...
The speculated execution of threads in a multithreaded architecture plus the branch prediction used ...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
Compiler-directed cache prefetching has the poten-tial to hide much of the high memory latency seen ...
CPU speeds double approximately every eighteen months, while main memory speeds double only about ev...
Instruction prefetching is an important aspect of contemporary high performance computer architectur...
High-performance multiprocessor systems built around out-of-order processors with aggressive branch ...
The large number of cache misses of current applications coupled with the increasing cache miss late...
The full text of this article is not available on SOAR. WSU users can access the article via IEEE Xp...
Data prefetching effectively reduces the negative effects of long load latencies on the performance ...
Prefetching is an important technique for reducing the average latency of memory accesses in scalabl...
Modern superscalar pipelines have tremendous capacity to consume the instruction stream. This has be...