A distributed system is a collection of nodes connected by a network, an ideal platform to provide high reliable computing due to the redundancy supplied by a great number of nodes. Node faults and network connection faults can be masked reconfiguring the system. However, sequential faults, that affect multiple nodes can decrease the performance of the system affecting the system reliability and availability. To avoid this, failed nodes should be reintegrated as soon as possible. This paper details the problem of reintegration of failed nodes in a replicated UNIX file system. We built a prototype with the recovery protocols required by the reintegration procedure
This report compares two strategies for crash fault tolerance of nodes in distributed systems: activ...
A new method for redo logging and recovery is described. It is designed to work in a data sharing sy...
Abstract — The paper1 focuses on distributed file systems in P2P networks. We introduce a novel file...
Sistemas distribuídos representam uma plataforma ideal para implementação de sistemas computacionais...
Replicated systems are a kind of distributed systems whose main goal is to ensure that computer sys...
Nixdorf Computer The initial design for a distributed, fault-tolerant version of UNIX based on three...
Periodically, researchers have been sharing their constant attempts to improve the existing methods ...
This thesis studies the problem of file replication in distributed systems. File replication is desi...
AbstractFailure recovery is a nontrivial property for current distributed systems. An autonomous fai...
Networked computer systems are prevalent in most aspects of modern society, and we have become depen...
Distributed systems provide the opportunity for fault tolerance through replication. This dissertati...
[[abstract]]In this paper, we propose a new fault-tolerant model for replication in distributed-file...
The problem of recovery in large-scale transaction-based distributed systems with replicated data is...
Due to the use of commodity software and hardware, crash-stop and Byzantine failures are likely to b...
Traditionally, fault-tolerant systems assume that failures are independent, often expressed as a thr...
This report compares two strategies for crash fault tolerance of nodes in distributed systems: activ...
A new method for redo logging and recovery is described. It is designed to work in a data sharing sy...
Abstract — The paper1 focuses on distributed file systems in P2P networks. We introduce a novel file...
Sistemas distribuídos representam uma plataforma ideal para implementação de sistemas computacionais...
Replicated systems are a kind of distributed systems whose main goal is to ensure that computer sys...
Nixdorf Computer The initial design for a distributed, fault-tolerant version of UNIX based on three...
Periodically, researchers have been sharing their constant attempts to improve the existing methods ...
This thesis studies the problem of file replication in distributed systems. File replication is desi...
AbstractFailure recovery is a nontrivial property for current distributed systems. An autonomous fai...
Networked computer systems are prevalent in most aspects of modern society, and we have become depen...
Distributed systems provide the opportunity for fault tolerance through replication. This dissertati...
[[abstract]]In this paper, we propose a new fault-tolerant model for replication in distributed-file...
The problem of recovery in large-scale transaction-based distributed systems with replicated data is...
Due to the use of commodity software and hardware, crash-stop and Byzantine failures are likely to b...
Traditionally, fault-tolerant systems assume that failures are independent, often expressed as a thr...
This report compares two strategies for crash fault tolerance of nodes in distributed systems: activ...
A new method for redo logging and recovery is described. It is designed to work in a data sharing sy...
Abstract — The paper1 focuses on distributed file systems in P2P networks. We introduce a novel file...