Today’s distributed system infrastructures usually consist of multiple systems that cooperate to deliver computing and storage services together. The reliability of distributed system infrastructures not only depends on the reliability of individual systems, but also on the reliability of the interactions among multiple systems. In fact, failures of system infrastructures are often caused by failures of interactions across multiple systems. We term such failures as cross-system failures. In this thesis, we conduct a pilot study on cross-system failures on seven real-world systems that are commonly used to build distributed system infrastructures. We analyze the characteristics of real-world cross-system failures in four different dimension...
Distributed systems such as grids, peer-to-peer systems, and even Internet DNS servers have grown si...
Many technical systems continue to increase in size and complexity, with more interactions and inter...
Software Systems permeate just about every aspect of life throughout developed and industrialised na...
International audienceEvery large multi-site infrastructure such as Grids and Clouds must implement ...
International audienceWith the increasing functionality and complexity of distributed systems, resou...
Part 1: Full Research PapersInternational audienceEvery large multi-site infrastructure such as Grid...
Distributed software systems have become the backbone of Internet services. Failures in pro-duction ...
With the increasing functionality and complexity of distributed systems, resource failures are inevi...
Understanding the origin of infrastructure failures and their propagation patterns in critical infr...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
Advances in information and communications technology (ICT) encourages the interconnection of ICT sy...
International audienceAbstract With the increasing presence, scale, and complexity of distributed sy...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
Distributed systems such as grids, peer-to-peer systems, and even Internet DNS servers have grown si...
Many technical systems continue to increase in size and complexity, with more interactions and inter...
Software Systems permeate just about every aspect of life throughout developed and industrialised na...
International audienceEvery large multi-site infrastructure such as Grids and Clouds must implement ...
International audienceWith the increasing functionality and complexity of distributed systems, resou...
Part 1: Full Research PapersInternational audienceEvery large multi-site infrastructure such as Grid...
Distributed software systems have become the backbone of Internet services. Failures in pro-duction ...
With the increasing functionality and complexity of distributed systems, resource failures are inevi...
Understanding the origin of infrastructure failures and their propagation patterns in critical infr...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
Advances in information and communications technology (ICT) encourages the interconnection of ICT sy...
International audienceAbstract With the increasing presence, scale, and complexity of distributed sy...
With the increasing presence, scale, and complexity of distributed systems, resource failures are be...
Distributed systems such as grids, peer-to-peer systems, and even Internet DNS servers have grown si...
Many technical systems continue to increase in size and complexity, with more interactions and inter...
Software Systems permeate just about every aspect of life throughout developed and industrialised na...