This report describes an approach enabling automatic structural reconfigurations of distributed applications based on configuration management in order to compensate for node and network failures. The major goal of the approach is to maintain the relevant application functionality after failures automatically.This goalis achieved by a dedicated system model and by a decentralized reconfiguration algorithm based on it. The system model provides support for redundant application object storage and for application-level consistency based on distributed checkpoints. The reconfiguration algorithm detects failures, computes a compensating configuration, and realizes this new configuration. ...
In this paper, we present a holistic approach to provide fault tolerance for distributed reactive pr...
This paper introduces a network fault model for distributed applications developed with the Mozart p...
Migration is a powerful technique in distributed systems providing many benefits. The granularity o...
We present a new software architecture in which all concepts necessary to achieve fault tolerance ca...
International audienceDistributed computing infrastructures support system and network fault-toleran...
This paper presents a novel architectural approach to support fault tolerance in component-based dis...
This work deals with high performance computing on large scale platforms like computing grids. Compu...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
The increasing failure rate in High Performance Computing encourages the investigation of fault tole...
Fault tolerance is essential to any soft real-time distributed system; besides correctness and timel...
Ubiquitous connectivity of web, mobile, and IoT computing platforms has fostered a variety of distri...
An important emerging requirement for distributed systems is adaptive QoS, which necessitates more f...
AbstractThe increasing failure rate in High Performance Computing encourages the investigation of fa...
This paper presents a method to design reliable applications composed of communicating entities. Thi...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
In this paper, we present a holistic approach to provide fault tolerance for distributed reactive pr...
This paper introduces a network fault model for distributed applications developed with the Mozart p...
Migration is a powerful technique in distributed systems providing many benefits. The granularity o...
We present a new software architecture in which all concepts necessary to achieve fault tolerance ca...
International audienceDistributed computing infrastructures support system and network fault-toleran...
This paper presents a novel architectural approach to support fault tolerance in component-based dis...
This work deals with high performance computing on large scale platforms like computing grids. Compu...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
The increasing failure rate in High Performance Computing encourages the investigation of fault tole...
Fault tolerance is essential to any soft real-time distributed system; besides correctness and timel...
Ubiquitous connectivity of web, mobile, and IoT computing platforms has fostered a variety of distri...
An important emerging requirement for distributed systems is adaptive QoS, which necessitates more f...
AbstractThe increasing failure rate in High Performance Computing encourages the investigation of fa...
This paper presents a method to design reliable applications composed of communicating entities. Thi...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
In this paper, we present a holistic approach to provide fault tolerance for distributed reactive pr...
This paper introduces a network fault model for distributed applications developed with the Mozart p...
Migration is a powerful technique in distributed systems providing many benefits. The granularity o...