Rollback-recovery in distributed systems is important for fault-tolerant computing. Without fault tolerance mechanisms, an application running on a system has to be restarted from scratch if a fault happens in the middle of its execution, resulting in loss of useful computation. To provide efficient rollback-recovery for fault-tolerance in distributed systems, it is significant to reduce the number of checkpoints under the existence of consistent global checkpoints in index-based distributed checkpointing algorithms. Because of the dependencies among the processes states that induced by inter-process communication in distributed systems, asynchronous checkpointing may suffer from the domino effect. Therefore, a consistent global checkpoint ...
Checkpoint and recovery protocols are commonly used in distributed applications for providing fault ...
Consistent checkpointing provides transparent fault tol erance for longrunning distributed applica...
Mobile Distributed Systems (MDS) are susceptible to faults. It is not easy to predict whether the sy...
A distributed system is composed of multiple independent machines that communicate using messages. F...
This paper presents an index-based checkpointing algorithm for distributed systems with the aim of r...
Checkpointing is a very well known mechanism to achieve fault tolerance. In distributed applications...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
A transaction-consistent global checkpoint of a database records a state of the database which refle...
Checkpointing is a very well known mechanism to achieve fault tolerance. In distributed applications...
As we move to large manycores, the hardware-based global checkpointing schemes that have been propo...
Abstract:- Checkpoint is defined as a designated place in a program at which normal processing is in...
Due to the character of the original source materials and the nature of batch digitization, quality ...
Communication-induced checkpointing protocols that ensure rollback-dependency trackability (RDT) gua...
2003-2004 > Academic research: refereed > Refereed conference paperVersion of RecordPublishe
This article proposes an original approach that applies the Rollback-Dependency Trackability (RDT) p...
Checkpoint and recovery protocols are commonly used in distributed applications for providing fault ...
Consistent checkpointing provides transparent fault tol erance for longrunning distributed applica...
Mobile Distributed Systems (MDS) are susceptible to faults. It is not easy to predict whether the sy...
A distributed system is composed of multiple independent machines that communicate using messages. F...
This paper presents an index-based checkpointing algorithm for distributed systems with the aim of r...
Checkpointing is a very well known mechanism to achieve fault tolerance. In distributed applications...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
A transaction-consistent global checkpoint of a database records a state of the database which refle...
Checkpointing is a very well known mechanism to achieve fault tolerance. In distributed applications...
As we move to large manycores, the hardware-based global checkpointing schemes that have been propo...
Abstract:- Checkpoint is defined as a designated place in a program at which normal processing is in...
Due to the character of the original source materials and the nature of batch digitization, quality ...
Communication-induced checkpointing protocols that ensure rollback-dependency trackability (RDT) gua...
2003-2004 > Academic research: refereed > Refereed conference paperVersion of RecordPublishe
This article proposes an original approach that applies the Rollback-Dependency Trackability (RDT) p...
Checkpoint and recovery protocols are commonly used in distributed applications for providing fault ...
Consistent checkpointing provides transparent fault tol erance for longrunning distributed applica...
Mobile Distributed Systems (MDS) are susceptible to faults. It is not easy to predict whether the sy...