Protocols to implement a fault-tolerant computing system are described. These protocols augment the hypervisor of a virtual machine manager to coordinate a primary virtual ma-chine and its backup. The result is a fault-tolerant computing system that does not require modifying the hardware, operating system, or applications programs. A prototype system was constructed for HP’s PA-RISC instruction-set architecture. Using this prototype, engi-neering issues and performance implications of the approach were explored
Massively parallel computers, using thousands of processors, will be the future trend for producing ...
Due to the character of the original source materials and the nature of batch digitization, quality ...
Virtualization is a key piece of modern data center design. Virtualization provides the possibility ...
Protocols to implement a fault-tolerant computing system are described. These protocols augment the ...
We have implemented a commercial enterprise-grade system for providing fault-tolerant virtual machin...
Abstract. Large-scale computing platforms provide tremendous capabilities for scientific discovery. ...
Abstract- In this work, we present the design of the Checkpointing-Enabled Virtual Machine (CEVM) ar...
This paper offers an introduction to a research effort in fault tolerant computer architecture whic...
Hypervisor-based fault tolerance (HBFT), which synchronizes the state between the primary VM and the...
Virtualization is often used as a tool for resource consolidation in the server market. Virtualizati...
Large-scale parallel computing is relying increasingly on clusters with thousands of processors. At ...
Although rare in absolute terms, undetected CPU, memory, and disk errors occur often enough at datac...
Multiprocessor systems which afford a high degree of\ud parallelism are used in a variety of applica...
To familiarize the reader with the field of fault tolerance, this report discusses the most importan...
Many organizations are moving their systems to the cloud, where providers consolidate multiple clie...
Massively parallel computers, using thousands of processors, will be the future trend for producing ...
Due to the character of the original source materials and the nature of batch digitization, quality ...
Virtualization is a key piece of modern data center design. Virtualization provides the possibility ...
Protocols to implement a fault-tolerant computing system are described. These protocols augment the ...
We have implemented a commercial enterprise-grade system for providing fault-tolerant virtual machin...
Abstract. Large-scale computing platforms provide tremendous capabilities for scientific discovery. ...
Abstract- In this work, we present the design of the Checkpointing-Enabled Virtual Machine (CEVM) ar...
This paper offers an introduction to a research effort in fault tolerant computer architecture whic...
Hypervisor-based fault tolerance (HBFT), which synchronizes the state between the primary VM and the...
Virtualization is often used as a tool for resource consolidation in the server market. Virtualizati...
Large-scale parallel computing is relying increasingly on clusters with thousands of processors. At ...
Although rare in absolute terms, undetected CPU, memory, and disk errors occur often enough at datac...
Multiprocessor systems which afford a high degree of\ud parallelism are used in a variety of applica...
To familiarize the reader with the field of fault tolerance, this report discusses the most importan...
Many organizations are moving their systems to the cloud, where providers consolidate multiple clie...
Massively parallel computers, using thousands of processors, will be the future trend for producing ...
Due to the character of the original source materials and the nature of batch digitization, quality ...
Virtualization is a key piece of modern data center design. Virtualization provides the possibility ...