This paper presents a fault-tolerant scheme applicable to any decentralized load balancing algorithms used in soft real-time distributed systems. Using the theory of distance-transitive graphs for representing topologies of these systems, the proposed strategy partitions these systems into independent symmetric regions (spheres) centered at some control points. These central points, called fault-control points, provide a two-level task redundancy and efficiently re-distribute the load of failed nodes within their spheres. Using the algebraic characteristics of these topologies, it is shown that the identification of spheres and fault-control points is, in general, is an NP-complete problem. An efficient solution for this problem is presente...
Fault-tolerance is an important requirement for real-time distributed system, which is designed to p...
This paper designed a fault tolerance for soft real time distributed system (FTRTDS). This system is...
A general framework for the design and analysis of distributed fault-tolerant systems is proposed in...
This paper presents a fault-tolerant scheme applicable to any decentralized load balancing algorithm...
Fault-tolerance becomes an important key to establish dependability in Real Time Distributed Systems...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
A distributed computing is software system in which components are located on different attached com...
This paper presents a semi-distributed approach, for load balancing in large parallel and distribute...
Fault tolerance can be defined as a concept of recovery that keeps a computer system operational by ...
Solutions to resource allocation problems and other related synchronization problems in distributed ...
A Thesis Submitted to the Faculty 0/ Engineering, University 0/ Lite Witwatersrand, Johannesburg in...
Clusters of workstations, connected by a fast network, are emerging as a viable architecture for bui...
Fault tolerant distributed systems must be able to continue operation in the presence of hardware fa...
Networking involves every aspect in the design of the network infrastructure from the selection/synt...
A monitoring approach to the problem of constructing fault-tolerant and adaptive real-time systems, ...
Fault-tolerance is an important requirement for real-time distributed system, which is designed to p...
This paper designed a fault tolerance for soft real time distributed system (FTRTDS). This system is...
A general framework for the design and analysis of distributed fault-tolerant systems is proposed in...
This paper presents a fault-tolerant scheme applicable to any decentralized load balancing algorithm...
Fault-tolerance becomes an important key to establish dependability in Real Time Distributed Systems...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
A distributed computing is software system in which components are located on different attached com...
This paper presents a semi-distributed approach, for load balancing in large parallel and distribute...
Fault tolerance can be defined as a concept of recovery that keeps a computer system operational by ...
Solutions to resource allocation problems and other related synchronization problems in distributed ...
A Thesis Submitted to the Faculty 0/ Engineering, University 0/ Lite Witwatersrand, Johannesburg in...
Clusters of workstations, connected by a fast network, are emerging as a viable architecture for bui...
Fault tolerant distributed systems must be able to continue operation in the presence of hardware fa...
Networking involves every aspect in the design of the network infrastructure from the selection/synt...
A monitoring approach to the problem of constructing fault-tolerant and adaptive real-time systems, ...
Fault-tolerance is an important requirement for real-time distributed system, which is designed to p...
This paper designed a fault tolerance for soft real time distributed system (FTRTDS). This system is...
A general framework for the design and analysis of distributed fault-tolerant systems is proposed in...