Future microprocessors will be highly susceptible to transient errors as the sizes of transistors decrease due to CMOS scaling. Prior techniques advocated full scale structural or temporal redundancy to achieve fault tolerance. Though they can provide complete fault coverage, they incur significant area and/or performance overhead. It is desirable to have a mechanism that can provide, incomplete, but still sufficiently high fault coverage with negligible area and/or performance cost. To achieve this goal, in this paper, we examine exploiting speculative structures that already exist in modern processors to provide partial fault coverage. We start by quantifying how much the faulty program deviates from the correct program execution in term...
Over three decades of continuous scaling in CMOS technology has led to tremendous improvements in pr...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
Redundant threading architectures duplicate all instructions to detect and possibly recover from tra...
Abstract—Symptom-based fault-tolerant methods are attrac-tive, since they replicate a minor part of ...
Abstract—Continuous circuit and wire miniaturization in-creasingly exert more pressure on the comput...
Speculative processors include performance oriented modules that improve the processor performance b...
CMOS scaling increases susceptibility of microprocessors to transient faults. Most current proposals...
Continuous circuit and wire miniaturization increasingly exert more pressure on the computer designe...
textSilicon reliability has reemerged as a very important problem in digital system design. As volta...
Abstract—As silicon technology continues to scale down and validation expenses continue to increase,...
With continued CMOS scaling, future shipped hardware will be increasingly vulnerable to in-the-field...
Embedded systems are increasingly deployed in harsh environments that their components were not nece...
Increasing design complexity for current and future generations of microelectronic technologies lead...
Intermittent hardware faults are hard to diagnose as they occur non-deterministically. Hardware-only...
Branch prediction feeds a speculative execution processor core with instructions. Branch mispredicti...
Over three decades of continuous scaling in CMOS technology has led to tremendous improvements in pr...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
Redundant threading architectures duplicate all instructions to detect and possibly recover from tra...
Abstract—Symptom-based fault-tolerant methods are attrac-tive, since they replicate a minor part of ...
Abstract—Continuous circuit and wire miniaturization in-creasingly exert more pressure on the comput...
Speculative processors include performance oriented modules that improve the processor performance b...
CMOS scaling increases susceptibility of microprocessors to transient faults. Most current proposals...
Continuous circuit and wire miniaturization increasingly exert more pressure on the computer designe...
textSilicon reliability has reemerged as a very important problem in digital system design. As volta...
Abstract—As silicon technology continues to scale down and validation expenses continue to increase,...
With continued CMOS scaling, future shipped hardware will be increasingly vulnerable to in-the-field...
Embedded systems are increasingly deployed in harsh environments that their components were not nece...
Increasing design complexity for current and future generations of microelectronic technologies lead...
Intermittent hardware faults are hard to diagnose as they occur non-deterministically. Hardware-only...
Branch prediction feeds a speculative execution processor core with instructions. Branch mispredicti...
Over three decades of continuous scaling in CMOS technology has led to tremendous improvements in pr...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
Redundant threading architectures duplicate all instructions to detect and possibly recover from tra...