A system is described for sharing a memory in a fault-tolerant computer. The memory is under the direct control and monitoring of error detecting and error diagnostic units in the fault-tolerant computer. This computer verifies that data to and from the memory is legally encoded and verifies that words read from the memory at a desired address are, in fact, actually delivered from that desired address. The means are provided for a second processor, which is independent of the direct control and monitoring of the error checking and diagnostic units of the fault-tolerant computer, and to share the memory of the fault-tolerant computer. Circuitry is included to verify that: (1) the processor has properly accessed a desired memory location in ...
Fault tolerant systems require the ability to detect and recover from physical damage caused by the ...
This thesis studies the relationship by creating a tool (FTAPE) that integrates a high stress worklo...
In this paper, we focus on the problem of recovering processor failures in shared memory multiproces...
A set of building block circuits is described which can be used with commercially available micropro...
A methodology for the design of a tightly coupled, highly reliable microprocessor based computer sys...
This thesis focuses on the issue of reliability and fault tolerance in Distributed Shared Memory Mul...
Fault-tolerant computing began between 1965 and 1970, probably with the highly reliable ...
Described in detail are: (1) results obtained in modifying the onboard data management system softwa...
We develop an availability solution, called SafetyNet, that uses a unified, lightweight checkpoint/r...
The concept of backward recovery is now well established as a means of restoring a consistent state ...
Fault tolerance in distributed shared memory through replication has yet to be explored. This resear...
[[abstract]]© 1997 Institute of Electrical and Electronics Engineers - Highly reliable interleaved m...
International audienceDue to the increasing number of their components, Scalable Shared Memory Multi...
This paper describes a new method for providing transparent fault tolerance for parallel application...
A fault-tolerant multiprocessor architecture is reported. This architecture, together with a compreh...
Fault tolerant systems require the ability to detect and recover from physical damage caused by the ...
This thesis studies the relationship by creating a tool (FTAPE) that integrates a high stress worklo...
In this paper, we focus on the problem of recovering processor failures in shared memory multiproces...
A set of building block circuits is described which can be used with commercially available micropro...
A methodology for the design of a tightly coupled, highly reliable microprocessor based computer sys...
This thesis focuses on the issue of reliability and fault tolerance in Distributed Shared Memory Mul...
Fault-tolerant computing began between 1965 and 1970, probably with the highly reliable ...
Described in detail are: (1) results obtained in modifying the onboard data management system softwa...
We develop an availability solution, called SafetyNet, that uses a unified, lightweight checkpoint/r...
The concept of backward recovery is now well established as a means of restoring a consistent state ...
Fault tolerance in distributed shared memory through replication has yet to be explored. This resear...
[[abstract]]© 1997 Institute of Electrical and Electronics Engineers - Highly reliable interleaved m...
International audienceDue to the increasing number of their components, Scalable Shared Memory Multi...
This paper describes a new method for providing transparent fault tolerance for parallel application...
A fault-tolerant multiprocessor architecture is reported. This architecture, together with a compreh...
Fault tolerant systems require the ability to detect and recover from physical damage caused by the ...
This thesis studies the relationship by creating a tool (FTAPE) that integrates a high stress worklo...
In this paper, we focus on the problem of recovering processor failures in shared memory multiproces...