A distributed system is a system composed of a set of autonomous computation units endowed with communication abilities in order to solve a global task. This model is general enough to describe any kind of network (LAN, sensor network, ...). When the size of a distributed system gets larger or when it is deployed in hazardous environments, the possibility that some elements of the system are subject to faults (failure, memory corruption, hacking, ...) become impossible to elude. Faults can be classified according to duration, span, or nature. In this thesis, we focus on distributed systems that simultaneously tolerate several kinds of faults using three classical problems as case studies. We present first a distributed protocol simulating a...
We consider issues of fault tolerance for distributed computing systems at two levels of system desi...
This thesis concentrates on the design of new algorithms for fault-tolerant systems based on system-...
Unreliable fault detectors can be used to solve the consensus problem in asynchronous distributed sy...
A distributed system is a system composed of a set of autonomous computation units endowed with comm...
When the desired reliability of a computing system exceeds that of its individual hardware componen...
This book covers the most essential techniques for designing and building dependable distributed sys...
International audienceThis book presents the most important fault-tolerant distributed programming a...
Distributed systems ranging from small local area networks to large wide area networks like the Inte...
A Byzantine fault is an arbitrary behavior on the part of a hardware component, a software module or...
Developing fault-tolerant distributed protocols is a difficult task. The difficulty of this task in...
Reliable communication is a fundamental primitive in distributed systems prone to Byzantine (i.e. ar...
Unreliable fault detectors can be used to solve the consensus problem in asyn-chronous distributed s...
Designing a distributed fault tolerance algorithm requires careful analysis of both fault models and...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
PhD ThesisMany fault-tolerant group communication middleware systems have been implemented assuming ...
We consider issues of fault tolerance for distributed computing systems at two levels of system desi...
This thesis concentrates on the design of new algorithms for fault-tolerant systems based on system-...
Unreliable fault detectors can be used to solve the consensus problem in asynchronous distributed sy...
A distributed system is a system composed of a set of autonomous computation units endowed with comm...
When the desired reliability of a computing system exceeds that of its individual hardware componen...
This book covers the most essential techniques for designing and building dependable distributed sys...
International audienceThis book presents the most important fault-tolerant distributed programming a...
Distributed systems ranging from small local area networks to large wide area networks like the Inte...
A Byzantine fault is an arbitrary behavior on the part of a hardware component, a software module or...
Developing fault-tolerant distributed protocols is a difficult task. The difficulty of this task in...
Reliable communication is a fundamental primitive in distributed systems prone to Byzantine (i.e. ar...
Unreliable fault detectors can be used to solve the consensus problem in asyn-chronous distributed s...
Designing a distributed fault tolerance algorithm requires careful analysis of both fault models and...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
PhD ThesisMany fault-tolerant group communication middleware systems have been implemented assuming ...
We consider issues of fault tolerance for distributed computing systems at two levels of system desi...
This thesis concentrates on the design of new algorithms for fault-tolerant systems based on system-...
Unreliable fault detectors can be used to solve the consensus problem in asynchronous distributed sy...