This thesis examines memory management and rollback recovery in parallel architectures. Three memory management strategies for rapid rollback recovery are studied in this thesis. The first is a cache-based error recovery scheme for shared memory multiprocessors. The second is a design for recoverable distributed shared virtual memory. The third is a twin-page disk technique for incremental checkpointing and rapid database transaction undo recovery.In Chapter 2, a user-transparent checkpointing and recovery scheme using private caches is developed for recovering from processor transient faults. New implementation techniques using checkpoint identifiers and recovery stacks are examined as a means to reducing performance degradation. This mult...
Checkpointing in a distributed system is essential for recovery to a globally consistent state after...
Checkpointing techniques in parallel systems use dependency tracking and/or message logging to ensur...
A split-cache memory architecture is presented which provides efficient architectural support for ch...
This thesis examines memory management and rollback recovery in parallel architectures. Three memory...
This study explores a recovery strategy using checkpointing in a distributed shared virtual memory (...
Large-scale distributed systems are very attractive for the execution of parallel applications requi...
In this paper, we describe new protocols augmenting traditional cache coherency mechanisms to implem...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
Most recovery schemes that have been proposed for Distributed Shared Memory (DSM) systems require un...
This paper presents ReVive, a novel general-purpose rollback recovery mechanism for shared-memory mu...
Memory system design is important for providing high reliability and availability. This dissertation...
Distributed shared memory (DSM) implemented on a cluster of workstations is an increasingly attracti...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
In this paper we present recovery techniques for distributed main-memory databases, specically for c...
In this paper we present recovery techniques for distributed main-memory databases, specifically for...
Checkpointing in a distributed system is essential for recovery to a globally consistent state after...
Checkpointing techniques in parallel systems use dependency tracking and/or message logging to ensur...
A split-cache memory architecture is presented which provides efficient architectural support for ch...
This thesis examines memory management and rollback recovery in parallel architectures. Three memory...
This study explores a recovery strategy using checkpointing in a distributed shared virtual memory (...
Large-scale distributed systems are very attractive for the execution of parallel applications requi...
In this paper, we describe new protocols augmenting traditional cache coherency mechanisms to implem...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
Most recovery schemes that have been proposed for Distributed Shared Memory (DSM) systems require un...
This paper presents ReVive, a novel general-purpose rollback recovery mechanism for shared-memory mu...
Memory system design is important for providing high reliability and availability. This dissertation...
Distributed shared memory (DSM) implemented on a cluster of workstations is an increasingly attracti...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
In this paper we present recovery techniques for distributed main-memory databases, specically for c...
In this paper we present recovery techniques for distributed main-memory databases, specifically for...
Checkpointing in a distributed system is essential for recovery to a globally consistent state after...
Checkpointing techniques in parallel systems use dependency tracking and/or message logging to ensur...
A split-cache memory architecture is presented which provides efficient architectural support for ch...