The modeling and design of a fault-tolerant multiprocessor system is addressed. In particular, the behavior of the system during recovery and restoration after a fault has occurred is investigated. Given that a multicomputer system is designed using the Algorithm to Architecture to Mapping Model (ATAMM), and that a fault (death of a computing resource) occurs during its normal steady-state operation, a model is presented as a viable research tool for predicting the performance bounds of the system during its recovery and restoration phases. Furthermore, the bounds of the performance behavior of the system during this transient mode can be assessed. These bounds include: time to recover from the fault (t(sub rec)), time to restore the system...
As computers become more widely used, and in particular as they become used in more safety critical ...
Modern high-end computers are unprecedentedly complex. Occurrence of faults is an inevitable fact in...
The use of dynamic reconfiguration has been proposed to tolerate faults in large-scale partitionable...
The modeling and design of a fault-tolerant multiprocessor system is addressed in this dissertation....
Various aspects of reliable computing are formalized and quantified with emphasis on efficient fault...
A fault-tolerant multiprocessor with a rollback recovery mechanism is discussed. The rollback mechan...
Traditional reliability-related models for fault-tolerant systems are used to predict system reliabi...
AbstractSystem reliability is an important aspect of real-time systems, because the result of a real...
This paper demonstrates a methodology to model and evaluate the fault tolerance characteristics of o...
With the new generation of very fast microprocessors and support chips, it is now possible to consid...
Research on dependable computing is undergoing a shift from traditional fault tolerance towards tech...
Three experiments on fault tolerant multiprocessors (FTMP) were begun. They are: (1) measurement of ...
Reliability modeling for fault tolerant avionic computing systems was developed. The modeling of lar...
A validation methodology for testing the performance of fault-tolerant computer systems was develope...
Performability is an attribute of a system which combines reliability and performance. Recovery proc...
As computers become more widely used, and in particular as they become used in more safety critical ...
Modern high-end computers are unprecedentedly complex. Occurrence of faults is an inevitable fact in...
The use of dynamic reconfiguration has been proposed to tolerate faults in large-scale partitionable...
The modeling and design of a fault-tolerant multiprocessor system is addressed in this dissertation....
Various aspects of reliable computing are formalized and quantified with emphasis on efficient fault...
A fault-tolerant multiprocessor with a rollback recovery mechanism is discussed. The rollback mechan...
Traditional reliability-related models for fault-tolerant systems are used to predict system reliabi...
AbstractSystem reliability is an important aspect of real-time systems, because the result of a real...
This paper demonstrates a methodology to model and evaluate the fault tolerance characteristics of o...
With the new generation of very fast microprocessors and support chips, it is now possible to consid...
Research on dependable computing is undergoing a shift from traditional fault tolerance towards tech...
Three experiments on fault tolerant multiprocessors (FTMP) were begun. They are: (1) measurement of ...
Reliability modeling for fault tolerant avionic computing systems was developed. The modeling of lar...
A validation methodology for testing the performance of fault-tolerant computer systems was develope...
Performability is an attribute of a system which combines reliability and performance. Recovery proc...
As computers become more widely used, and in particular as they become used in more safety critical ...
Modern high-end computers are unprecedentedly complex. Occurrence of faults is an inevitable fact in...
The use of dynamic reconfiguration has been proposed to tolerate faults in large-scale partitionable...