This paper reports on orchestra, a portable fault injection environment for testing implementations of distributed protocols. The paper focuses on architectural features of orchestra that provide portability, minimize intrusiveness on target protocols, and support testing of real-time systems. orchestra is based on a simple yet powerful framework, called script-driven probing and fault injection, for the evaluation and validation of the fault-tolerance and timing characteristics of distributed protocols. orchestra was initially developed on the Real-Time Mach operating system and later ported to other platforms including Solaris and SunOS, and has been used to conduct extensive experiments on several protocol implementations. A novel feat...
This document describes the research performed on fault isolation in dynamic distributed systems at ...
We describe a centralized approach to testing that distributed fault-tolerant protocols satisfy thei...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
As software for distributed systems becomes more complex, ensuring that a system meets its prescribe...
This paper describes a set of experiments performed on six different vendor TCP implementations usin...
Ensuring that a system meets its prescribed specification is a growing challenge that confronts soft...
We present a case study on fault injection testing at the interface level between components of a di...
TCP, the de facto standard transport protocol in today’s operating systems, is a very robust protoco...
Software is being used for building applications requiring extreme dependability. In many cases, sys...
This paper describes an environment for fault injection based testing of protocols that implement fa...
PhD ThesisOne way of gaining confidence in the adequacy of fault tolerance mechanisms of a system...
Having robust systems that behave properly even in presence of faults is becoming increasingly impor...
There is trend of increasing demand for highly dependable software systems. The factors that influen...
email fsjhan rosen kgshingeecsumichedu This paper presents an integrateD sO ftware fault injeC T i...
TCP, the de facto standard transport protocol in today's operating systems, is a very robust proto...
This document describes the research performed on fault isolation in dynamic distributed systems at ...
We describe a centralized approach to testing that distributed fault-tolerant protocols satisfy thei...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...
As software for distributed systems becomes more complex, ensuring that a system meets its prescribe...
This paper describes a set of experiments performed on six different vendor TCP implementations usin...
Ensuring that a system meets its prescribed specification is a growing challenge that confronts soft...
We present a case study on fault injection testing at the interface level between components of a di...
TCP, the de facto standard transport protocol in today’s operating systems, is a very robust protoco...
Software is being used for building applications requiring extreme dependability. In many cases, sys...
This paper describes an environment for fault injection based testing of protocols that implement fa...
PhD ThesisOne way of gaining confidence in the adequacy of fault tolerance mechanisms of a system...
Having robust systems that behave properly even in presence of faults is becoming increasingly impor...
There is trend of increasing demand for highly dependable software systems. The factors that influen...
email fsjhan rosen kgshingeecsumichedu This paper presents an integrateD sO ftware fault injeC T i...
TCP, the de facto standard transport protocol in today's operating systems, is a very robust proto...
This document describes the research performed on fault isolation in dynamic distributed systems at ...
We describe a centralized approach to testing that distributed fault-tolerant protocols satisfy thei...
This thesis addresses issues in building fault-tolerant distributed real-time systems. Such systems ...