With the increasing functionality and complexity of distributed systems, resource failures are inevitable. While numerous models and algorithms for dealing with failures exist, the lack of public trace data sets and tools has prevented meaningful comparisons. To facilitate the design, validation, and comparison of fault-tolerant models and algorithms, we have created the Failure Trace Archive (FTA) as an online public repository of availability traces taken from diverse parallel and distributed systems. Our main contributions in this study are the following. First, we describe the design of the archive, in particular the rationale of the standard FTA format, and the design of a toolbox that facilitates automated analysis of trace data sets....
Distributed software systems have become the backbone of Internet services. Failures in pro-duction ...
Abstract. Distributed systems such as grids, peer-to-peer systems, and even Internet DNS servers hav...
The level of trust on log-based dependability characterization of complex distributed systems, is bi...
With the increasing functionality and complexity of distributed systems, resource failures are inevi...
International audienceWith the increasing functionality and complexity of distributed systems, resou...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
International audienceAbstract With the increasing presence, scale, and complexity of distributed sy...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
Distributed systems such as grids, peer-to-peer systems, and even Internet DNS servers have grown si...
International audienceDistributed systems such as grids, peer-to-peer systems, and even Internet DNS...
Part 1: Full Research PapersInternational audienceEvery large multi-site infrastructure such as Grid...
Today’s distributed system infrastructures usually consist of multiple systems that cooperate to del...
Distributed software systems have become the backbone of Internet services. Failures in pro-duction ...
Abstract. Distributed systems such as grids, peer-to-peer systems, and even Internet DNS servers hav...
The level of trust on log-based dependability characterization of complex distributed systems, is bi...
With the increasing functionality and complexity of distributed systems, resource failures are inevi...
International audienceWith the increasing functionality and complexity of distributed systems, resou...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
International audienceAbstract With the increasing presence, scale, and complexity of distributed sy...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
Distributed systems such as grids, peer-to-peer systems, and even Internet DNS servers have grown si...
International audienceDistributed systems such as grids, peer-to-peer systems, and even Internet DNS...
Part 1: Full Research PapersInternational audienceEvery large multi-site infrastructure such as Grid...
Today’s distributed system infrastructures usually consist of multiple systems that cooperate to del...
Distributed software systems have become the backbone of Internet services. Failures in pro-duction ...
Abstract. Distributed systems such as grids, peer-to-peer systems, and even Internet DNS servers hav...
The level of trust on log-based dependability characterization of complex distributed systems, is bi...