Conventional CAD methodologies optimize a processor module for correct operation and prohibit timing violations during nominal operation. We propose recovery-driven design, a design approach that optimizes a processor module for a target timing error rate instead of correct operation. The target error rate is chosen based on how many errors can be gainfully tolerated by a hardware or software error resilience mechanism. We show that significant power bene ts are possible from a recovery-driven design approach that deliberately allows errors caused by voltage overscaling to occur during nominal operation, while relying on an error resilience technique to tolerate these errors. We present a detailed evaluation and analysis of such a design-le...
Resilient design techniques are used to (i) ensure correct operation under dynamic variations and to...
Recently, supply voltage has been reduced for low power applications, and near threshold computing (...
As transistor technology scales ever further, hardware reliability is becoming harder to manage. Th...
Conventional computer-aided design (CAD) methodologies optimize a processor module for correct opera...
Conventional CAD methodologies optimize a processor module for correct operation and prohibit timing...
Conventional CAD methodologies optimize a processor module for correct operation, and prohibit timin...
Current processor designs have a critical operating point that sets a hard limit on voltage scaling....
Timing speculation has been proposed as a technique for maximizing energy efficiency of processors w...
Technology scaling has progressed to enable integrated circuits with extremely high density enabling...
Many programs exhibit application level error resilience which allows certain subcomputations to exe...
As traditional approaches for reducing power in microprocessors are being exhausted, extreme power c...
In recent years, circuit reliability in modern high-performance processors has become increasingly i...
In recent years, circuit reliability in modern high-performance processors has become increasingly i...
Over two decades of research has led to numerous low-power design techniques being reported. Two pop...
Soft errors (or Transient faults) are temporary faults that arise in a circuit due to a variety of i...
Resilient design techniques are used to (i) ensure correct operation under dynamic variations and to...
Recently, supply voltage has been reduced for low power applications, and near threshold computing (...
As transistor technology scales ever further, hardware reliability is becoming harder to manage. Th...
Conventional computer-aided design (CAD) methodologies optimize a processor module for correct opera...
Conventional CAD methodologies optimize a processor module for correct operation and prohibit timing...
Conventional CAD methodologies optimize a processor module for correct operation, and prohibit timin...
Current processor designs have a critical operating point that sets a hard limit on voltage scaling....
Timing speculation has been proposed as a technique for maximizing energy efficiency of processors w...
Technology scaling has progressed to enable integrated circuits with extremely high density enabling...
Many programs exhibit application level error resilience which allows certain subcomputations to exe...
As traditional approaches for reducing power in microprocessors are being exhausted, extreme power c...
In recent years, circuit reliability in modern high-performance processors has become increasingly i...
In recent years, circuit reliability in modern high-performance processors has become increasingly i...
Over two decades of research has led to numerous low-power design techniques being reported. Two pop...
Soft errors (or Transient faults) are temporary faults that arise in a circuit due to a variety of i...
Resilient design techniques are used to (i) ensure correct operation under dynamic variations and to...
Recently, supply voltage has been reduced for low power applications, and near threshold computing (...
As transistor technology scales ever further, hardware reliability is becoming harder to manage. Th...