Intermittent hardware faults are hard to diagnose as they occur non-deterministically. Hardware-only diagnosis techniques incur significant power and area overheads. On the other hand, software-only diagnosis techniques have low power and area overheads, but have limited visibility into many micro-architectural structures and hence cannot diagnose faults in them. To overcome these limitations, we propose a hardware-software integrated framework for diagnosing intermittent faults. The hardware part of our framework, called SCRIBE continuously records the resource usage information of every instruction in the processor, and exposes it to the software layer. SCRIBE has 0.95% on-chip area overhead, incurs a performance overhead of 12% and pow...
There is broad consensus among academic and industrial researchers in computer architecture that har...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
Abstract. Intermittent failures and nondeterministic behavior complicate and compromise the effectiv...
Abstract—Intermittent hardware faults are hard to diagnose as they occur non-deterministically at th...
Over three decades of continuous scaling in CMOS technology has led to tremendous improvements in pr...
It is a great challenge to build reliable computer systems with unreliable hardware and buggy softwa...
Abstract—This work proposes a new, software-based, defect detection and diagnosis technique. We intr...
Abstract—Intermittent hardware faults are bursts of errors that last from a few CPU cycles to a few ...
Abstract—As silicon technology continues to scale down and validation expenses continue to increase,...
With continued CMOS scaling, future shipped hardware will be increasingly vulnerable to in-the-field...
With continued CMOS scaling, future shipped hardware will be increasingly vulnerable to in-the-field...
Hardware errors are projected to increase in modern computer systems due to shrinking feature sizes ...
Aggressive scaling of CMOS transistors has enabled extensive system integration and building faster ...
Unpredictable hardware faults and software bugs lead to application crashes, incorrect computations,...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
There is broad consensus among academic and industrial researchers in computer architecture that har...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
Abstract. Intermittent failures and nondeterministic behavior complicate and compromise the effectiv...
Abstract—Intermittent hardware faults are hard to diagnose as they occur non-deterministically at th...
Over three decades of continuous scaling in CMOS technology has led to tremendous improvements in pr...
It is a great challenge to build reliable computer systems with unreliable hardware and buggy softwa...
Abstract—This work proposes a new, software-based, defect detection and diagnosis technique. We intr...
Abstract—Intermittent hardware faults are bursts of errors that last from a few CPU cycles to a few ...
Abstract—As silicon technology continues to scale down and validation expenses continue to increase,...
With continued CMOS scaling, future shipped hardware will be increasingly vulnerable to in-the-field...
With continued CMOS scaling, future shipped hardware will be increasingly vulnerable to in-the-field...
Hardware errors are projected to increase in modern computer systems due to shrinking feature sizes ...
Aggressive scaling of CMOS transistors has enabled extensive system integration and building faster ...
Unpredictable hardware faults and software bugs lead to application crashes, incorrect computations,...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
There is broad consensus among academic and industrial researchers in computer architecture that har...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
Abstract. Intermittent failures and nondeterministic behavior complicate and compromise the effectiv...