This paper presents an approach towards specifying and verifying adaptive distributed systems. We here take fault-handling as an example of adaptive behavior and propose a modeling language Sandal for describing fault-prone message-passing systems. One of the unique mechanisms of the language is a linguistic support for abstracting typical faults such as unexpected termination of processes and random loss of messages. The Sandal compiler translates a model into a set of NuSMV modules. During the compilation process, faults specified in the model will be woven into the output. One can thus enjoy full-automatic exhaustive fault-injection without writing faulty behaviors explicitly. We demonstrate the advantage of the language by verifying a m...
To optimize performance in a fault-tolerant distributed system, it is often necessary to enforce dif...
Dependability is a qualitative term referring to a system's ability to meet its service requirements...
This paper describes a method for automated analysis of fault-tolerance properties of distributed sy...
. An adaptive computing system is one that modifies its behavior based on changes in the environment...
Fault-tolerance is an important requirement in distributed computing systems. However, designing ap...
International audienceThis book presents the most important fault-tolerant distributed programming a...
This dissertation focuses on the area of improving programming language support for constructing fau...
Motivated by the close relation between real-time and fault tolerance, we investigate the foundation...
Distributed programs are hard to get right because they are required to be open, scalable, long-runn...
Distributed systems can be subject to various kinds of partial failures, and building fault-toleranc...
Ubiquitous connectivity of web, mobile, and IoT computing platforms has fostered a variety of distri...
In this document, we discuss application requirements for fault tolerance in the framework of distri...
Fault-tolerant distributed algorithms play an important role in many critical/high-availability appl...
Distributed programs are hard to get right because they are required to be open, scalable, long-runn...
International audienceIn a network consisting of several thousands computers, the occurrence of faul...
To optimize performance in a fault-tolerant distributed system, it is often necessary to enforce dif...
Dependability is a qualitative term referring to a system's ability to meet its service requirements...
This paper describes a method for automated analysis of fault-tolerance properties of distributed sy...
. An adaptive computing system is one that modifies its behavior based on changes in the environment...
Fault-tolerance is an important requirement in distributed computing systems. However, designing ap...
International audienceThis book presents the most important fault-tolerant distributed programming a...
This dissertation focuses on the area of improving programming language support for constructing fau...
Motivated by the close relation between real-time and fault tolerance, we investigate the foundation...
Distributed programs are hard to get right because they are required to be open, scalable, long-runn...
Distributed systems can be subject to various kinds of partial failures, and building fault-toleranc...
Ubiquitous connectivity of web, mobile, and IoT computing platforms has fostered a variety of distri...
In this document, we discuss application requirements for fault tolerance in the framework of distri...
Fault-tolerant distributed algorithms play an important role in many critical/high-availability appl...
Distributed programs are hard to get right because they are required to be open, scalable, long-runn...
International audienceIn a network consisting of several thousands computers, the occurrence of faul...
To optimize performance in a fault-tolerant distributed system, it is often necessary to enforce dif...
Dependability is a qualitative term referring to a system's ability to meet its service requirements...
This paper describes a method for automated analysis of fault-tolerance properties of distributed sy...