It is notoriously hard to develop dependable distributed systems. This is partly due to the difficulties in foreseeing various corner cases and failure scenarios while implementing a system that will be deployed over an asynchronous network. In contrast, reasoning about the desired distributed system behavior and the corresponding invariants is easier than reasoning about the code itself. Further, the invariants can be used for testing, theorem proving, and runtime enforcement. In this paper, we propose an approach to observe the system behavior and automatically infer invariants which reveal implementation bugs. Using our tool, Avenger, we automatically generate a large number of potentially relevant properties, check them within the time ...
We consider the problem of provably verifying that an asyn-chronous message-passing system satisfies...
This paper describes a method for automated analysis of fault-tolerance properties of distributed sy...
Observation of global properties of a distributed program is required in many applications such as d...
It is notoriously hard to develop dependable distributed systems. This is partly due to the difficul...
It is notoriously hard to develop dependable distributed systems. This is partly due to the difficul...
System monitoring can help to detect abnormalities and avoid failures. Crafting monitors for today’s...
Robust distributed systems commonly employ high-level recov-ery mechanisms enabling the system to re...
We propose a new approach for developing and deploying distributed systems, in which nodes predict d...
Distributed systems are difficult to debug and understand. A key reason for this is distributed sta...
Abstract—In today’s distributed information systems, a large amount of monitoring data such as log f...
Failures in computing systems are unavoidable. Therefore, it is important to detect and diagnose fai...
Invariants are stable relationships among system metrics expected to hold during normal operating co...
Developing correct and efficient software for large scale systems is a challenging task. Developers ...
tract When evaluated to true, a stable property remains true forever. Such a stable property may cha...
Today's software systems often have poor reliability. In addition to losses of billions, software de...
We consider the problem of provably verifying that an asyn-chronous message-passing system satisfies...
This paper describes a method for automated analysis of fault-tolerance properties of distributed sy...
Observation of global properties of a distributed program is required in many applications such as d...
It is notoriously hard to develop dependable distributed systems. This is partly due to the difficul...
It is notoriously hard to develop dependable distributed systems. This is partly due to the difficul...
System monitoring can help to detect abnormalities and avoid failures. Crafting monitors for today’s...
Robust distributed systems commonly employ high-level recov-ery mechanisms enabling the system to re...
We propose a new approach for developing and deploying distributed systems, in which nodes predict d...
Distributed systems are difficult to debug and understand. A key reason for this is distributed sta...
Abstract—In today’s distributed information systems, a large amount of monitoring data such as log f...
Failures in computing systems are unavoidable. Therefore, it is important to detect and diagnose fai...
Invariants are stable relationships among system metrics expected to hold during normal operating co...
Developing correct and efficient software for large scale systems is a challenging task. Developers ...
tract When evaluated to true, a stable property remains true forever. Such a stable property may cha...
Today's software systems often have poor reliability. In addition to losses of billions, software de...
We consider the problem of provably verifying that an asyn-chronous message-passing system satisfies...
This paper describes a method for automated analysis of fault-tolerance properties of distributed sy...
Observation of global properties of a distributed program is required in many applications such as d...