102 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2001.A key problem besetting distributed applications is how to provide reliability guarantees to them, running on off-the-shelf hardware and software components. Chameleon is a Software Implemented Fault Tolerance (SIFT) middleware capable of providing adaptive fault tolerance in a COTS (components-off-the-shelf) environment with the capability to adapt to changing runtime requirements as well as changing application requirements. The thesis presents the architecture and implementation of a hierarchy of error detection techniques, which can be applied in a distributed SIFT environment. The error detection framework is implemented and demonstrated on the Chameleon testbed, th...
Distributed systems form an integral part of human life—from ATMs to the Domain Name Service. Typica...
We consider issues of fault tolerance for distributed computing systems at two levels of system desi...
Many current approaches to software-implemented fault tolerance (SIFT) rely on process replication, ...
102 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2001.A key problem besetting distr...
This paper proposes a hierarchical error detection framework for a Software Implemented Fault Tolera...
Due to the character of the original source materials and the nature of batch digitization, quality ...
This paper presents a new error detection technique called software implemented error detection (SIE...
A general framework for the design and analysis of distributed fault-tolerant systems is proposed in...
This thesis deals with principles and techniques of fault tolerance for distributed embedded systems...
Software is being used for building applications requiring extreme dependability. In many cases, sys...
This paper presents the performance evaluation of a software fault manager for distributed applicati...
Computing grids consist of a large-scale, highly-distributed hardware architecture, often built in a...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
Failures in computing systems are unavoidable. Therefore, it is important to detect and diagnose fai...
Systems that operate in extremely volatile environments, such as orbiting satellites, must be design...
Distributed systems form an integral part of human life—from ATMs to the Domain Name Service. Typica...
We consider issues of fault tolerance for distributed computing systems at two levels of system desi...
Many current approaches to software-implemented fault tolerance (SIFT) rely on process replication, ...
102 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2001.A key problem besetting distr...
This paper proposes a hierarchical error detection framework for a Software Implemented Fault Tolera...
Due to the character of the original source materials and the nature of batch digitization, quality ...
This paper presents a new error detection technique called software implemented error detection (SIE...
A general framework for the design and analysis of distributed fault-tolerant systems is proposed in...
This thesis deals with principles and techniques of fault tolerance for distributed embedded systems...
Software is being used for building applications requiring extreme dependability. In many cases, sys...
This paper presents the performance evaluation of a software fault manager for distributed applicati...
Computing grids consist of a large-scale, highly-distributed hardware architecture, often built in a...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
Failures in computing systems are unavoidable. Therefore, it is important to detect and diagnose fai...
Systems that operate in extremely volatile environments, such as orbiting satellites, must be design...
Distributed systems form an integral part of human life—from ATMs to the Domain Name Service. Typica...
We consider issues of fault tolerance for distributed computing systems at two levels of system desi...
Many current approaches to software-implemented fault tolerance (SIFT) rely on process replication, ...