With the advent of large networks and the demand to have uninterrupted service, there is a pressing need for computer systems to be more robust and fault tolerant. There are numerous ways to implement fault tolerance and recovery [5, 50]. Yet, a central concept in all these methods is the requirement for replicated data leading to high data availability. We believe that a protocol must not only provide data replication, but also that it should do so at low operational overhead. Further, the protocol must provide mechanisms for varying the level of replication (so that the system may be operated at a desired overhead cost), and must scale well. At the University of California, Riverside, we have developed a program-driven ..
The author mainly concentrates on transactional distributed systems. Most previous research on repli...
Fault-tolerant protocols are currently unscalable for large clusters. After a brief explanation of ...
This paper shows how a state-of-the-art software distributed shared-memory (DSM) protocol can be eff...
Distributed Shared Memory (DSM) systems are becoming increasingly more significant as a result of be...
: Distributed Shared Memory (dsm) architectures are attractive to execute high performance parallel ...
International audienceDistributed Shared Memory (DSM) architectures are attractive to execute high p...
Distributed software systems are the basis for innovative applications (e.g. pervasive computing, te...
We present a new coherence protocol class for DSM systems whose instances offer highly available acc...
This research proposes an algorithm for fault-tolerance in a home-based lazy release consistent dist...
This thesis focuses on the issue of reliability and fault tolerance in Distributed Shared Memory Mul...
This paper presents a new technique for efficiently controlling replicas in distributed systems. Con...
Backward error recovery involving checkpointing and restart of tasks is an important component of an...
Software systems fail; distributed systems fail in worse ways [20]. The causes of failures can be va...
The pervasiveness of cloud-based services has significantly increased the demand for highly dependab...
In data-intensive distributed systems, replication is the most widely used approach to offer high da...
The author mainly concentrates on transactional distributed systems. Most previous research on repli...
Fault-tolerant protocols are currently unscalable for large clusters. After a brief explanation of ...
This paper shows how a state-of-the-art software distributed shared-memory (DSM) protocol can be eff...
Distributed Shared Memory (DSM) systems are becoming increasingly more significant as a result of be...
: Distributed Shared Memory (dsm) architectures are attractive to execute high performance parallel ...
International audienceDistributed Shared Memory (DSM) architectures are attractive to execute high p...
Distributed software systems are the basis for innovative applications (e.g. pervasive computing, te...
We present a new coherence protocol class for DSM systems whose instances offer highly available acc...
This research proposes an algorithm for fault-tolerance in a home-based lazy release consistent dist...
This thesis focuses on the issue of reliability and fault tolerance in Distributed Shared Memory Mul...
This paper presents a new technique for efficiently controlling replicas in distributed systems. Con...
Backward error recovery involving checkpointing and restart of tasks is an important component of an...
Software systems fail; distributed systems fail in worse ways [20]. The causes of failures can be va...
The pervasiveness of cloud-based services has significantly increased the demand for highly dependab...
In data-intensive distributed systems, replication is the most widely used approach to offer high da...
The author mainly concentrates on transactional distributed systems. Most previous research on repli...
Fault-tolerant protocols are currently unscalable for large clusters. After a brief explanation of ...
This paper shows how a state-of-the-art software distributed shared-memory (DSM) protocol can be eff...