An increasing number of Internet applications are applying microservice architecture due to its flexibility and clear logic. The stability of microservice is thus vitally important for these applications' quality of service. Accurate failure root cause localization can help operators quickly recover microservice failures and mitigate loss. Although cross-microservice failure root cause localization has been well studied, how to localize failure root causes in a microservice so as to quickly mitigate this microservice has not yet been studied. In this work, we propose a framework, MicroCause, to accurately localize the root cause monitoring indicators in a microservice. MicroCause combines a simple yet effective path condition time series (P...
International audienceDiagnosing problems in Internet-scale services remains particularly difficult ...
We present a method to locate faults in service-based software systems hosted on mobile ad hoc netwo...
With the growth of system size and complexity, reliability has become a major concern for large-scal...
International audienceSoftware architecture is undergoing a transition from monolithic architectures...
The microservice architecture has been commonly adopted by large scale software systems exemplified ...
In this work, we present Graph Based Liability Analysis Framework (GRALAF) for root cause analysis (...
Here are the data used in our paper published at ICSE 2023: "Eadro: An End-to-End Troubleshooting ...
In this work, we present Graph Based Liability Analysis Framework (GRALAF) for root cause analysis (...
Cascading failures can severely affect the correct functioning of large enterprise applications cons...
Microservices are popular for web applications as they offer better scalability and reliability than...
IT infrastructure is a crucial part in most of today's business operations. High availability and re...
Context: With an increasing number of applications running on a microservices-based cloud system (su...
Large-scale data center networks are complex - comprising several thousand network devices and sever...
In this thesis, we have focused on applying Spectrum-based Fault Localization (SFL) to diagnose Serv...
Abstract. For dependability outages in distributed internet infrastructures, it is often not enough ...
International audienceDiagnosing problems in Internet-scale services remains particularly difficult ...
We present a method to locate faults in service-based software systems hosted on mobile ad hoc netwo...
With the growth of system size and complexity, reliability has become a major concern for large-scal...
International audienceSoftware architecture is undergoing a transition from monolithic architectures...
The microservice architecture has been commonly adopted by large scale software systems exemplified ...
In this work, we present Graph Based Liability Analysis Framework (GRALAF) for root cause analysis (...
Here are the data used in our paper published at ICSE 2023: "Eadro: An End-to-End Troubleshooting ...
In this work, we present Graph Based Liability Analysis Framework (GRALAF) for root cause analysis (...
Cascading failures can severely affect the correct functioning of large enterprise applications cons...
Microservices are popular for web applications as they offer better scalability and reliability than...
IT infrastructure is a crucial part in most of today's business operations. High availability and re...
Context: With an increasing number of applications running on a microservices-based cloud system (su...
Large-scale data center networks are complex - comprising several thousand network devices and sever...
In this thesis, we have focused on applying Spectrum-based Fault Localization (SFL) to diagnose Serv...
Abstract. For dependability outages in distributed internet infrastructures, it is often not enough ...
International audienceDiagnosing problems in Internet-scale services remains particularly difficult ...
We present a method to locate faults in service-based software systems hosted on mobile ad hoc netwo...
With the growth of system size and complexity, reliability has become a major concern for large-scal...