Reliability is emerging as an important design criterion in modern systems due to increasing transient fault rates. Hardware fault-tolerance techniques, commonly used to address this, introduce high design costs. As alternative, software Signature-Monitoring (SM) schemes based on compiler assertions are an efficient method for control-flow-error detection. Existing SM techniques do not consider application-specific-information causing unnecessary overheads. In this paper, compile-time Control-Flow-Graph (CFG) topology analysis is used to place best-suited assertions at optimal locations of the assembly code to reduce overheads. Our evaluation with representative workloads shows fault-coverage increase with overheads close to Assertion- base...
This paper describes a general technique to identify control flow errors in parallel programs, which...
Abstract. As modern supercomputing systems reach the peta-flop performance range, they grow in both ...
As machines increase in scale, it is predicted that failure rates of supercomputers will correspondi...
Reliability is emerging as an important design criterion in modern systems due to increasing transie...
Due to harsher working environments, soft errors or erroneous bit-flips occur more frequently in mic...
A variety of applications have arisen where it is worthwhile to apply code optimizations directly to...
ISBN: 4930813670This paper addresses the detection of permanent and transient faults in complex VLSI...
Abstract—This paper evaluates the concurrent error detection capabilities of system-level checks, us...
Soft-error detection in FPGAs typically requires replication, doubling the required area. We propose...
Software-based methods for the detection of control-flow errors caused by transient fault usually co...
Software-based fault tolerance techniques are a low-cost way to protect processors against soft erro...
This paper presents a technique to derive and implement error detectors to protect an application fr...
A common requirement of embedded software in charge of safety tasks is to guarantee the identificati...
Recent increase of transient fault rates has made processor reliability a major concern. Moreover pe...
Optimization outside of traditional frameworks is emerging as a new research focus in the compiler c...
This paper describes a general technique to identify control flow errors in parallel programs, which...
Abstract. As modern supercomputing systems reach the peta-flop performance range, they grow in both ...
As machines increase in scale, it is predicted that failure rates of supercomputers will correspondi...
Reliability is emerging as an important design criterion in modern systems due to increasing transie...
Due to harsher working environments, soft errors or erroneous bit-flips occur more frequently in mic...
A variety of applications have arisen where it is worthwhile to apply code optimizations directly to...
ISBN: 4930813670This paper addresses the detection of permanent and transient faults in complex VLSI...
Abstract—This paper evaluates the concurrent error detection capabilities of system-level checks, us...
Soft-error detection in FPGAs typically requires replication, doubling the required area. We propose...
Software-based methods for the detection of control-flow errors caused by transient fault usually co...
Software-based fault tolerance techniques are a low-cost way to protect processors against soft erro...
This paper presents a technique to derive and implement error detectors to protect an application fr...
A common requirement of embedded software in charge of safety tasks is to guarantee the identificati...
Recent increase of transient fault rates has made processor reliability a major concern. Moreover pe...
Optimization outside of traditional frameworks is emerging as a new research focus in the compiler c...
This paper describes a general technique to identify control flow errors in parallel programs, which...
Abstract. As modern supercomputing systems reach the peta-flop performance range, they grow in both ...
As machines increase in scale, it is predicted that failure rates of supercomputers will correspondi...