This study explores a recovery strategy using checkpointing in a distributed shared virtual memory (DVM) system. DVM shares virtual memory in a loosely-coupled multi-computer system and is implemented at the software-level. The goal of this recovery strategy is to obtain a consistent recovery line that is close to the time of failure. Therefore the system could be rolled back from the time of failure to the closest possible state of normal execution.In order to achieve the objective, this thesis proposes a checkpointing strategy that utilizes virtual memory (VM) as transient checkpoint storage in addition to commonly-used stable storage. In controllable checkpoint intervals, these additional checkpoints make checkpoint intervals shorter; in...
Relaxed memory consistency models tolerate increased memory access latency in both hardware and soft...
Checkpointing, i.e., recording the volatile state of a virtual machine (VM) running as a guest in a ...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
This study explores a recovery strategy using checkpointing in a distributed shared virtual memory (...
This thesis examines memory management and rollback recovery in parallel architectures. Three memory...
Memory system design is important for providing high reliability and availability. This dissertation...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
Large-scale distributed systems are very attractive for the execution of parallel applications requi...
We consider the problem of bringing a distributed system to a consistent state after transient fail...
This paper proposes an approach for adding fault tolerance, based on consistent checkpointing, to di...
Transparent hypervisor-level checkpoint-restart mechanisms for virtual clusters (VCs) or clusters of...
Checkpointing in a distributed system is essential for recovery to a globally consistent state after...
Checkpointing techniques in parallel systems use dependency tracking and/or message logging to ensur...
. The distributed shared memory(DSM) system transforms an existing network of workstations to a powe...
Most recovery schemes that have been proposed for Distributed Shared Memory (DSM) systems require un...
Relaxed memory consistency models tolerate increased memory access latency in both hardware and soft...
Checkpointing, i.e., recording the volatile state of a virtual machine (VM) running as a guest in a ...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
This study explores a recovery strategy using checkpointing in a distributed shared virtual memory (...
This thesis examines memory management and rollback recovery in parallel architectures. Three memory...
Memory system design is important for providing high reliability and availability. This dissertation...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
Large-scale distributed systems are very attractive for the execution of parallel applications requi...
We consider the problem of bringing a distributed system to a consistent state after transient fail...
This paper proposes an approach for adding fault tolerance, based on consistent checkpointing, to di...
Transparent hypervisor-level checkpoint-restart mechanisms for virtual clusters (VCs) or clusters of...
Checkpointing in a distributed system is essential for recovery to a globally consistent state after...
Checkpointing techniques in parallel systems use dependency tracking and/or message logging to ensur...
. The distributed shared memory(DSM) system transforms an existing network of workstations to a powe...
Most recovery schemes that have been proposed for Distributed Shared Memory (DSM) systems require un...
Relaxed memory consistency models tolerate increased memory access latency in both hardware and soft...
Checkpointing, i.e., recording the volatile state of a virtual machine (VM) running as a guest in a ...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...