This thesis deals with principles and techniques of fault tolerance for distributed embedded systems. A layered approach is taken to achieve high dependability by structuring error detection and recovery mechanisms into three layers. The first layer consists of mechanisms implemented in hardware, either at the circuit or the micro-architectural level. Many integrated circuits, especially microprocessors, are provided with such mechanisms in order to mask transient hardware faults and to detect permanent ones. To prevent software faults and hardware faults not captured at the hardware layer from causing node failures, it is desirable to introduce node-layer mechanisms. While they may depend on hardware support such as memory protection, they...
AbstractThe increasing failure rate in High Performance Computing encourages the investigation of fa...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
Software Implementation of Multi-Processor Fault Tolerance for Real-Time processing is addressed in ...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
This thesis deals with cost-effective design and validation of fault tolerant distributed real-time ...
Due to the character of the original source materials and the nature of batch digitization, quality ...
A general framework for the design and analysis of distributed fault-tolerant systems is proposed in...
Critical real-time embedded systems need to apply fault tolerance strategies to deal with operation ...
We present a new software architecture in which all concepts necessary to achieve fault tolerance ca...
This paper presents an experimental dependability evaluation of a small real-time kernel called Artk...
Distributed computers systems are increasingly being embedded in complex products such as automobile...
This paper proposes a membership protocol for fault-tolerant distributed systems and describes the u...
The increasing failure rate in High Performance Computing encourages the investigation of fault tole...
Critical embedded systems need a dependable operating system and application. Despite all efforts to...
Real-time embedded systems for safety-critical applications have to introduce fault tolerance mechan...
AbstractThe increasing failure rate in High Performance Computing encourages the investigation of fa...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
Software Implementation of Multi-Processor Fault Tolerance for Real-Time processing is addressed in ...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
This thesis deals with cost-effective design and validation of fault tolerant distributed real-time ...
Due to the character of the original source materials and the nature of batch digitization, quality ...
A general framework for the design and analysis of distributed fault-tolerant systems is proposed in...
Critical real-time embedded systems need to apply fault tolerance strategies to deal with operation ...
We present a new software architecture in which all concepts necessary to achieve fault tolerance ca...
This paper presents an experimental dependability evaluation of a small real-time kernel called Artk...
Distributed computers systems are increasingly being embedded in complex products such as automobile...
This paper proposes a membership protocol for fault-tolerant distributed systems and describes the u...
The increasing failure rate in High Performance Computing encourages the investigation of fault tole...
Critical embedded systems need a dependable operating system and application. Despite all efforts to...
Real-time embedded systems for safety-critical applications have to introduce fault tolerance mechan...
AbstractThe increasing failure rate in High Performance Computing encourages the investigation of fa...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
Software Implementation of Multi-Processor Fault Tolerance for Real-Time processing is addressed in ...