A method for fault tolerance in concurrently executing computer programs is presented. The present invention controls the re-execution of concurrent programs in order to avoid a recurrence of synchronization failure. The invention (i) traces an execution, (ii) detects a synchronization failure, (iii) determines a control strategy, and (iv) re-executes under control. Control is achieved by tracing information during an execution and using this information to add synchronizations during the re-execution.Board of Regents, University of Texas Syste
Byzantine fault tolerance typically is achieved via state-machine replication, which requires the ex...
We present a formal approach to implement fault-tolerance in real-time embedded systems. The initial...
A method is presented for programming correct and efficient cooperation in a set of sequential modul...
A method for fault tolerance in concurrently executing computer programs is presented. The present i...
Abstract. Concurrent programs often encounter failures, such as races, owing to the presence of sync...
Faults in computer control systems cause great economic losses and endanger human beings. In order t...
Faults in computer control systems cause great economic losses and endanger human beings. In order t...
A new approach to software fault tolerance in concurrent programs modeled as reactive systems is pro...
A system architecture called the recovery metaprogram (RMP) is proposed. It separates the applicatio...
Software Development and Management Lab., Dept. of ComputingRefereed conference paper2001-2002 > Aca...
Debugging, which entails locating program faults responsible for a program failure, is more difficul...
The monitor concept provides a structured and flexible high-level programming construct to control c...
In this paper, we describe new protocols augmenting traditional cache coherency mechanisms to implem...
) Anish ARORA 1 Department of Computer Science The Ohio State University anish@cis.ohio-state.edu...
A new approach to software fault tolerance in concurrent programs modeled as reactive systems k prop...
Byzantine fault tolerance typically is achieved via state-machine replication, which requires the ex...
We present a formal approach to implement fault-tolerance in real-time embedded systems. The initial...
A method is presented for programming correct and efficient cooperation in a set of sequential modul...
A method for fault tolerance in concurrently executing computer programs is presented. The present i...
Abstract. Concurrent programs often encounter failures, such as races, owing to the presence of sync...
Faults in computer control systems cause great economic losses and endanger human beings. In order t...
Faults in computer control systems cause great economic losses and endanger human beings. In order t...
A new approach to software fault tolerance in concurrent programs modeled as reactive systems is pro...
A system architecture called the recovery metaprogram (RMP) is proposed. It separates the applicatio...
Software Development and Management Lab., Dept. of ComputingRefereed conference paper2001-2002 > Aca...
Debugging, which entails locating program faults responsible for a program failure, is more difficul...
The monitor concept provides a structured and flexible high-level programming construct to control c...
In this paper, we describe new protocols augmenting traditional cache coherency mechanisms to implem...
) Anish ARORA 1 Department of Computer Science The Ohio State University anish@cis.ohio-state.edu...
A new approach to software fault tolerance in concurrent programs modeled as reactive systems k prop...
Byzantine fault tolerance typically is achieved via state-machine replication, which requires the ex...
We present a formal approach to implement fault-tolerance in real-time embedded systems. The initial...
A method is presented for programming correct and efficient cooperation in a set of sequential modul...