Processor power constraints have come to the forefront over the last decade, heralded by the stagnation of clock frequency scaling. High-performance core and cache designs often utilize power-hungry techniques to increase parallelism. Conversely, the most energy-efficient designs opt for a serial execution to avoid unnecessary overheads. While both of these extremes constitute one-size-fits-all approaches, a judicious mix of parallel and serial execution has the potential to achieve the best of both high-performing and energy-efficient designs. This dissertation examines such hybrid designs for cores and caches. Firstly, we introduce a novel, hybrid out-of-order/in-order core microarchitecture. Instructions that are steered towards in-order...
The current trend in a processor design has moved from multicore to manycore (tens to hundreds, or m...
abstract: As the number of cores per chip increases, maintaining cache coherence becomes prohibitive...
Out-of-order engines are the basis for nearly every high performance general purpose processor today...
This dissertation addresses two sets of challenges facing processor design as the industry enters th...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
textOne of the major limiters to computer system performance has been the access to main memory, wh...
2018 Fall.Includes bibliographical references.As the performance gap between CPU and main memory con...
This thesis proposes a buffered dual access mode cache to reduce power consumption in multicore cach...
This work describes a cache architecture and memory model for 1000+ core microprocessors. Our appro...
textModern microprocessors devote a large portion of their chip area to caches in order to bridge t...
Multi-core processors have become the dominant processor architecture with 2, 4, and 8 cores on a ch...
The increasing gap between processor and main memory speeds has become a serious bottleneck towards ...
Real-time computing is not just fast computing but time-predictable computing. Many tasks in safety-...
University of Minnesota Master of Science thesis. October 2014. Major: Electrical Engineering. Advis...
Modern superscalar pipelines have tremendous capacity to consume the instruction stream. This has be...
The current trend in a processor design has moved from multicore to manycore (tens to hundreds, or m...
abstract: As the number of cores per chip increases, maintaining cache coherence becomes prohibitive...
Out-of-order engines are the basis for nearly every high performance general purpose processor today...
This dissertation addresses two sets of challenges facing processor design as the industry enters th...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
textOne of the major limiters to computer system performance has been the access to main memory, wh...
2018 Fall.Includes bibliographical references.As the performance gap between CPU and main memory con...
This thesis proposes a buffered dual access mode cache to reduce power consumption in multicore cach...
This work describes a cache architecture and memory model for 1000+ core microprocessors. Our appro...
textModern microprocessors devote a large portion of their chip area to caches in order to bridge t...
Multi-core processors have become the dominant processor architecture with 2, 4, and 8 cores on a ch...
The increasing gap between processor and main memory speeds has become a serious bottleneck towards ...
Real-time computing is not just fast computing but time-predictable computing. Many tasks in safety-...
University of Minnesota Master of Science thesis. October 2014. Major: Electrical Engineering. Advis...
Modern superscalar pipelines have tremendous capacity to consume the instruction stream. This has be...
The current trend in a processor design has moved from multicore to manycore (tens to hundreds, or m...
abstract: As the number of cores per chip increases, maintaining cache coherence becomes prohibitive...
Out-of-order engines are the basis for nearly every high performance general purpose processor today...