Contrary to existing work that demonstrate significant improvements in performance with larger reorder buffers, the work presented in this dissertation shows that larger instruction windows do not necessarily provide the significant improvements in performance. By using detailed models of the DRAM system and the memory subsystem, we show that increasing out-of-order aggressiveness by increasing reorder buffer sizes beyond 128 entries no longer buys any improvement in processor performance. In fact we observe that it can actually degrade processor performance. Additionally, this dissertation demonstrates a non-intuitive problem associated with the out-of-order execution of memory instructions: the reordering of memory instructions can cause ...
The memory wall is the predicted situation where improvements to processor speed will be masked by t...
High-performance multiprocessor systems built around out-of-order processors with aggressive branch ...
Abstract-Design parameters interact in complex ways in modern processors, especially because out-of-...
The use of large instruction windows coupled with aggressive out-of-order and prefetching capabiliti...
The use of large instruction windows coupled with aggressive out-of order and prefetching capabiliti...
textMain memory system performance is crucial for high performance microprocessors. Even though the...
Processor efficiency can be described with the help of a number of desirable effects or metrics, f...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
Current microprocessors improve performance by exploiting instruction-level parallelism (ILP). ILP h...
Given a fixed CPU architecture and a fixed DRAM timing specification, there is still a large design ...
The RAMpage memory hierarchy addresses the growing concern about the memory wall -- the possibility ...
textHigh-performance processors tolerate latency using out-of-order execution. Unfortunately, today...
The performance characteristics of modern DRAM memory systems are impacted by two primary attributes...
The design and implementation of the commodity memory architecture has resulted in significant limit...
For decades, main memory has enjoyed the continuous scaling of its physical substrate: DRAM (Dynamic...
The memory wall is the predicted situation where improvements to processor speed will be masked by t...
High-performance multiprocessor systems built around out-of-order processors with aggressive branch ...
Abstract-Design parameters interact in complex ways in modern processors, especially because out-of-...
The use of large instruction windows coupled with aggressive out-of-order and prefetching capabiliti...
The use of large instruction windows coupled with aggressive out-of order and prefetching capabiliti...
textMain memory system performance is crucial for high performance microprocessors. Even though the...
Processor efficiency can be described with the help of a number of desirable effects or metrics, f...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
Current microprocessors improve performance by exploiting instruction-level parallelism (ILP). ILP h...
Given a fixed CPU architecture and a fixed DRAM timing specification, there is still a large design ...
The RAMpage memory hierarchy addresses the growing concern about the memory wall -- the possibility ...
textHigh-performance processors tolerate latency using out-of-order execution. Unfortunately, today...
The performance characteristics of modern DRAM memory systems are impacted by two primary attributes...
The design and implementation of the commodity memory architecture has resulted in significant limit...
For decades, main memory has enjoyed the continuous scaling of its physical substrate: DRAM (Dynamic...
The memory wall is the predicted situation where improvements to processor speed will be masked by t...
High-performance multiprocessor systems built around out-of-order processors with aggressive branch ...
Abstract-Design parameters interact in complex ways in modern processors, especially because out-of-...