Abstract-Design parameters interact in complex ways in modern processors, especially because out-of-order issue and decoupling buffers allow latencies to be overlapped. Trade-offs among instruction-window size, branch-prediction accuracy, and instruction- and data-cache size can change as these parameters move through different domains. For example, modeling unrealistic caches can under- or overstate the benefits of better prediction or a larger instruction window. Avoiding such pitfalls requires understanding how all these parameters interact. Because such methodological mistakes are common, this paper provides a comprehensive set of SimpleScalar simulation results from SPECint95 programs, showing the interactions among these major structu...
Despite years of study, branch mispredictions remain as a significant performance impediment in pipe...
Instruction cache performance is critical to instruction fetch efficiency and overall processor perf...
Processor architectures will increasingly rely on issuing multiple instructions to make full use of ...
Detailed micro-architecture studies often require time-consuming, cycle-accurate simulation. Unfortu...
Achieving high instruction issue rates depends on the ability to dynamically predict branches. We co...
Branch prediction accuracy is a very important factor for superscalar processor performance. The abi...
Due to cost, time, and flexibility constraints, simulators are needed to explore the design space wh...
For many applications, branch mispredictions and cache misses limit a processor’s performance to a l...
Cache performance analysis is becoming increasingly important in microprocessor design. This work ex...
Accurate branch prediction is critical to performance; mispredicted branches mean that ten’s of cycl...
As the gap between memory and processor performance continues to grow, more and more programs will ...
It has been claimed that the execution time of a program can often be predicted more accurately on a...
The use of caches poses a difficult tradeoff for architects of real-time systems. While caches provi...
Application-specific system-on-chip platforms create the opportunity to customize the cache configur...
Accurate cache and branch predictor simulation is a crucial factor when evaluating the performance a...
Despite years of study, branch mispredictions remain as a significant performance impediment in pipe...
Instruction cache performance is critical to instruction fetch efficiency and overall processor perf...
Processor architectures will increasingly rely on issuing multiple instructions to make full use of ...
Detailed micro-architecture studies often require time-consuming, cycle-accurate simulation. Unfortu...
Achieving high instruction issue rates depends on the ability to dynamically predict branches. We co...
Branch prediction accuracy is a very important factor for superscalar processor performance. The abi...
Due to cost, time, and flexibility constraints, simulators are needed to explore the design space wh...
For many applications, branch mispredictions and cache misses limit a processor’s performance to a l...
Cache performance analysis is becoming increasingly important in microprocessor design. This work ex...
Accurate branch prediction is critical to performance; mispredicted branches mean that ten’s of cycl...
As the gap between memory and processor performance continues to grow, more and more programs will ...
It has been claimed that the execution time of a program can often be predicted more accurately on a...
The use of caches poses a difficult tradeoff for architects of real-time systems. While caches provi...
Application-specific system-on-chip platforms create the opportunity to customize the cache configur...
Accurate cache and branch predictor simulation is a crucial factor when evaluating the performance a...
Despite years of study, branch mispredictions remain as a significant performance impediment in pipe...
Instruction cache performance is critical to instruction fetch efficiency and overall processor perf...
Processor architectures will increasingly rely on issuing multiple instructions to make full use of ...