Criticul infrastructure applications pmvide services upon which society depends heavily; such applications require survivabiliry in the face of faults that mighr cause a loss of service. These applications are lhemselves dependent on distributed information systems for all aspects of their operation ond so survivability of rhe information systems is an important issue. Fault tolerance is U key mechanism by which survivnbiliry can be achieved in these information systems. Much of the liternrum on fault-toleranr distribicted sysrems focuses on local error recovery by masking rhe effects of faults. We describe a direction for error recov-ery in rheface of catastrophic faults where the effects of the faulrs cannot be masked using available reso...
Large scale distributed computing systems have been extensively utilized to host critical applicatio...
In this paper we examine the problem of failures within network communications and telecom systems a...
International audienceDistributed computing infrastructures support system and network fault-toleran...
This paper deals with human error resistance. In the first part of it, a short state-of-the-art of h...
Operating systems often manage critical infrastructures where failures can have serious consequences...
A Thesis Submitted to the Faculty 0/ Engineering, University 0/ Lite Witwatersrand, Johannesburg in...
This book covers the most essential techniques for designing and building dependable distributed sys...
Fault-tolerant computing encompasses the methods that let computers perform their intended function ...
The aim of this paper is to take advantage of distributed systems for fault-tolerance, but keeping i...
The Web services architecture is expected to play a prominent role in developing next generation dis...
Failure of IT systems often causes a major loss of service. Thus their dependability has become an i...
Abstract. Traditionally, it is common to distinguish between three broad families of methods for dea...
Traditional reliability-related models for fault-tolerant systems are used to predict system reliabi...
Dependability is a qualitative term referring to a system's ability to meet its service requirements...
In distributed systems, if a hardware fault corrupts the state of a process, this error might propag...
Large scale distributed computing systems have been extensively utilized to host critical applicatio...
In this paper we examine the problem of failures within network communications and telecom systems a...
International audienceDistributed computing infrastructures support system and network fault-toleran...
This paper deals with human error resistance. In the first part of it, a short state-of-the-art of h...
Operating systems often manage critical infrastructures where failures can have serious consequences...
A Thesis Submitted to the Faculty 0/ Engineering, University 0/ Lite Witwatersrand, Johannesburg in...
This book covers the most essential techniques for designing and building dependable distributed sys...
Fault-tolerant computing encompasses the methods that let computers perform their intended function ...
The aim of this paper is to take advantage of distributed systems for fault-tolerance, but keeping i...
The Web services architecture is expected to play a prominent role in developing next generation dis...
Failure of IT systems often causes a major loss of service. Thus their dependability has become an i...
Abstract. Traditionally, it is common to distinguish between three broad families of methods for dea...
Traditional reliability-related models for fault-tolerant systems are used to predict system reliabi...
Dependability is a qualitative term referring to a system's ability to meet its service requirements...
In distributed systems, if a hardware fault corrupts the state of a process, this error might propag...
Large scale distributed computing systems have been extensively utilized to host critical applicatio...
In this paper we examine the problem of failures within network communications and telecom systems a...
International audienceDistributed computing infrastructures support system and network fault-toleran...