This report describes an experiment in the design of a general purpose fault tolerant system, FTM. The main objective of the FTM design was to implement a "low-cost" fault tolerant system that could be used on standard workstations. At the operating system level, our goal was to provide a methodology for the design of modular reliable operating systems, while offering fault tolerance transparency to user applications. In other words, porting an application to FTM had only to require compiling the source code without having to modify it. These objectives were achieved using the Mach micro-kernel and a modular set of reliable servers which implement application checkpoints and provide continuous system functions despite machine crashes. At th...
Failing hardware is a fact and trends in microprocessor design indicate that the fraction of hardwar...
This paper offers an introduction to a research effort in fault tolerant computer architecture whic...
Fault Tolerance Middleware (FTM) provides a framework to run on a dedicated core of a multi-core sys...
This report describes an experiment in the design of a general purpose fault tolerant system, FTM. T...
Fault-tolerance has become an essential concern for processor designers due to increasing soft-error...
As processor manufacturers keep pushing the limits of the transistor, the reliability of computer sy...
We present a new approach for building fault-tolerant distributed systems based on distributed trans...
textExperiences with computer systems indicate an inconvenient truth: computers fail and they fail i...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
Abstract: Soft errors are emerging with the ongoing reduction of structure sizes in current and futu...
The FTMPS-project provides a solution to the need for faulttolerance in large systems . A complete ...
Abstract. The FTMPS-project provides a solution to the need for fault-tolerance in large systems. A ...
In this paper an innovative fault tolerant solid state mass memory (FTSSMM) architecture is describe...
Tandem builds single-fault-tolerant computer systems. At the hardware level, the system is designed ...
Fault-tolerant computing began between 1965 and 1970, probably with the highly reliable ...
Failing hardware is a fact and trends in microprocessor design indicate that the fraction of hardwar...
This paper offers an introduction to a research effort in fault tolerant computer architecture whic...
Fault Tolerance Middleware (FTM) provides a framework to run on a dedicated core of a multi-core sys...
This report describes an experiment in the design of a general purpose fault tolerant system, FTM. T...
Fault-tolerance has become an essential concern for processor designers due to increasing soft-error...
As processor manufacturers keep pushing the limits of the transistor, the reliability of computer sy...
We present a new approach for building fault-tolerant distributed systems based on distributed trans...
textExperiences with computer systems indicate an inconvenient truth: computers fail and they fail i...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
Abstract: Soft errors are emerging with the ongoing reduction of structure sizes in current and futu...
The FTMPS-project provides a solution to the need for faulttolerance in large systems . A complete ...
Abstract. The FTMPS-project provides a solution to the need for fault-tolerance in large systems. A ...
In this paper an innovative fault tolerant solid state mass memory (FTSSMM) architecture is describe...
Tandem builds single-fault-tolerant computer systems. At the hardware level, the system is designed ...
Fault-tolerant computing began between 1965 and 1970, probably with the highly reliable ...
Failing hardware is a fact and trends in microprocessor design indicate that the fraction of hardwar...
This paper offers an introduction to a research effort in fault tolerant computer architecture whic...
Fault Tolerance Middleware (FTM) provides a framework to run on a dedicated core of a multi-core sys...