Microprocessors are traditionally designed to provide “best overall” performance across a wide range of applications and operating environments. Several groups have proposed hardware techniques that save energy by “downsizing” hardware resources that are underutilized by particular applications. We explore the converse: “upsizing” hardware resources in order to improve performance relative to an aggressively clocked baseline processor. Our proposal depends critically on the ability to change frequencies independently in separate domains of a globally asynchronous, locally synchronous (GALS) microprocessor. We use a variant of our multiple clock domain (MCD) processor, with four independently clocked domains. Each domain is streamlined with...
Journal ArticleConventional microarchitectures choose a single memory hierarchy design point target...
Journal ArticleAlthough microprocessor performance continues to increase at a rapid pace, the growin...
Modern microprocessors integrate a growing number of compo-nents on a single chip, such as processor...
Microprocessors are traditionally designed to provide "best overall" performance across a ...
Journal ArticleAs clock frequency increases and feature size decreases, clock distribution and wire...
We analyze an Alpha 21264-like Globally–Asynchronous, Locally–Synchronous (GALS) processor organized...
Multiple clock domains is one solution to the increasing problem of propagating the clock signal acr...
Abstract — This paper presents methods for addressing two sources of variability in the context of m...
In this paper, we present a clustered, multiple-clock domain (CMCD) microarchitecture that combines ...
The energy demands of modern mobile devices have driven a trend towards heterogeneous multi-core sys...
Journal ArticleThe ever increasing demand for high clock speeds and the desire to exploit abundant ...
The continuing advances in VLSI technology have fueled dramatic performance gains for general-purpo...
This thesis is concerned with hardware approaches for maximizing the number of independent instructi...
Heterogeneous processors such as Arm’s big.LITTLE have become popular as they offer a choice betwee...
Journal ArticleClustered microarchitectures are an attractive alternative to large monolithic super...
Journal ArticleConventional microarchitectures choose a single memory hierarchy design point target...
Journal ArticleAlthough microprocessor performance continues to increase at a rapid pace, the growin...
Modern microprocessors integrate a growing number of compo-nents on a single chip, such as processor...
Microprocessors are traditionally designed to provide "best overall" performance across a ...
Journal ArticleAs clock frequency increases and feature size decreases, clock distribution and wire...
We analyze an Alpha 21264-like Globally–Asynchronous, Locally–Synchronous (GALS) processor organized...
Multiple clock domains is one solution to the increasing problem of propagating the clock signal acr...
Abstract — This paper presents methods for addressing two sources of variability in the context of m...
In this paper, we present a clustered, multiple-clock domain (CMCD) microarchitecture that combines ...
The energy demands of modern mobile devices have driven a trend towards heterogeneous multi-core sys...
Journal ArticleThe ever increasing demand for high clock speeds and the desire to exploit abundant ...
The continuing advances in VLSI technology have fueled dramatic performance gains for general-purpo...
This thesis is concerned with hardware approaches for maximizing the number of independent instructi...
Heterogeneous processors such as Arm’s big.LITTLE have become popular as they offer a choice betwee...
Journal ArticleClustered microarchitectures are an attractive alternative to large monolithic super...
Journal ArticleConventional microarchitectures choose a single memory hierarchy design point target...
Journal ArticleAlthough microprocessor performance continues to increase at a rapid pace, the growin...
Modern microprocessors integrate a growing number of compo-nents on a single chip, such as processor...