A new ABFT architecture is proposed to tolerate multiple soft-errors with low overheads. It memorizes operands on a stack upon error detection and corrects errors by recomputing. This allows uninterrupted input data streams to be processed without data loss. 1: Introduction Several methods based on physical, electrical or logical principles have been proposed to reduce the susceptibility to error in digital systems. A system-level method, called Algorithmic-Based Fault Tolerance (ABFT), was originally proposed to obtain fault-tolerant processor array systems [1]. ABFT is a concurrent error detection/correction technique with relatively low hardware and time redundancy. This method operates on encoded input data sets and produces encoded ou...
Abstract In this brief an approach is proposed to achieve energy savings from reduced voltage opera...
As chip technology keeps on shrinking towards higher densities and lower operating vol- tages, memo...
With few exceptions, the two issues of algorithm design and fault tolerance for processor arrays hav...
An important consideration in the design of high performance multiprocessor systems is to ensure the...
In Algorithm-based fault tolerance (ABFT), fault tolerance is tailored to the algorithm performed. M...
In Algorithm-based fault tolerance (ABFT), fault tolerance is tailored to the algorithm performed. M...
A reduction in the minimum attainable feature size in integrated circuits h.s lead to the possibilit...
We present a new approach to fault tolerance for High Performance Computing system. Our approach is ...
Abstract- The rapid progress in VLSI technology has reduced the cost of hardware, allowing multiple ...
There is broad consensus among academic and industrial researchers in computer architecture that har...
Soft error tolerance is a hot research topic for modern microprocessors. We have been investigating ...
As late-CMOS process scaling leads to increasingly variable circuits/logic and as most post-CMOS tec...
Emerging high-performance computing platforms, with large component counts and lower power margins, ...
This paper speculates that technology trends pose new challenges for fault tolerance in microprocess...
Extensive researches have been done on developing and optimizing algorithm-based fault tolerance (AB...
Abstract In this brief an approach is proposed to achieve energy savings from reduced voltage opera...
As chip technology keeps on shrinking towards higher densities and lower operating vol- tages, memo...
With few exceptions, the two issues of algorithm design and fault tolerance for processor arrays hav...
An important consideration in the design of high performance multiprocessor systems is to ensure the...
In Algorithm-based fault tolerance (ABFT), fault tolerance is tailored to the algorithm performed. M...
In Algorithm-based fault tolerance (ABFT), fault tolerance is tailored to the algorithm performed. M...
A reduction in the minimum attainable feature size in integrated circuits h.s lead to the possibilit...
We present a new approach to fault tolerance for High Performance Computing system. Our approach is ...
Abstract- The rapid progress in VLSI technology has reduced the cost of hardware, allowing multiple ...
There is broad consensus among academic and industrial researchers in computer architecture that har...
Soft error tolerance is a hot research topic for modern microprocessors. We have been investigating ...
As late-CMOS process scaling leads to increasingly variable circuits/logic and as most post-CMOS tec...
Emerging high-performance computing platforms, with large component counts and lower power margins, ...
This paper speculates that technology trends pose new challenges for fault tolerance in microprocess...
Extensive researches have been done on developing and optimizing algorithm-based fault tolerance (AB...
Abstract In this brief an approach is proposed to achieve energy savings from reduced voltage opera...
As chip technology keeps on shrinking towards higher densities and lower operating vol- tages, memo...
With few exceptions, the two issues of algorithm design and fault tolerance for processor arrays hav...