This paper presents two control-flow error recovery techniques, CFE Recovery using Data-flow graph Consideration and CFE Recovery using Macro block-level Check pointing. These techniques are proposed with regards to thread interactions in the programs. These techniques try to moderate the high memory and performance overheads of conventional control-flow checking techniques. The proposed recovery techniques are composed of two phases of control-flow error detection and recovery. These phases are designed by means of inserting additional instructions into program at compile time considering dependency graph, extracted from control-flow and data-flow dependencies among basic blocks and thread interactions in the programs. In order to evaluate...
Software-based methods for the detection of control-flow errors caused by transient fault usually co...
We propose a scheme for transient-fault recovery called Simultaneously and Redundantly Threaded proc...
This study focuses on how to confine error recovery to the immediate environment of a failed computa...
Abstract This paper presents a software-based technique to mitigate Control-flow Errors CFEs in mult...
In this paper, a software behavior-based technique is presented to detect control-flow errors in mul...
Shrinking microprocessor feature size and growing transistor density may increase the soft-error rat...
This work presents a new Dual-Core LockStep approach to enhance fault tolerance in microprocessors. ...
The improvement of dependability in computing systems requires the evaluation of fault tolerance mec...
In modern safety-critical embedded systems reliability and performance are two important criteria. I...
This thesis addresses three important steps in the selection of error detection mechanisms for micro...
Abstract—This paper evaluates the concurrent error detection capabilities of system-level checks, us...
Despite the intense efforts to prevent programmers from writing code with memory errors, memory corr...
This paper describes a general technique to identify control flow errors in parallel programs, which...
© 2016 ACM. Relentless technology scaling has made transistors more vulnerable to soft, or transient...
Faults are common-place and inevitable in complex applications. Hence, automated techniques are nece...
Software-based methods for the detection of control-flow errors caused by transient fault usually co...
We propose a scheme for transient-fault recovery called Simultaneously and Redundantly Threaded proc...
This study focuses on how to confine error recovery to the immediate environment of a failed computa...
Abstract This paper presents a software-based technique to mitigate Control-flow Errors CFEs in mult...
In this paper, a software behavior-based technique is presented to detect control-flow errors in mul...
Shrinking microprocessor feature size and growing transistor density may increase the soft-error rat...
This work presents a new Dual-Core LockStep approach to enhance fault tolerance in microprocessors. ...
The improvement of dependability in computing systems requires the evaluation of fault tolerance mec...
In modern safety-critical embedded systems reliability and performance are two important criteria. I...
This thesis addresses three important steps in the selection of error detection mechanisms for micro...
Abstract—This paper evaluates the concurrent error detection capabilities of system-level checks, us...
Despite the intense efforts to prevent programmers from writing code with memory errors, memory corr...
This paper describes a general technique to identify control flow errors in parallel programs, which...
© 2016 ACM. Relentless technology scaling has made transistors more vulnerable to soft, or transient...
Faults are common-place and inevitable in complex applications. Hence, automated techniques are nece...
Software-based methods for the detection of control-flow errors caused by transient fault usually co...
We propose a scheme for transient-fault recovery called Simultaneously and Redundantly Threaded proc...
This study focuses on how to confine error recovery to the immediate environment of a failed computa...