When operating large cloud computing infrastructures, ensuring healthiness of physical resources and software components is of paramount importance to meet the demanding service levels expected by customers. This is only possible using automations that can detect anomalies and alert the on-call personnel, or trigger healing procedures. In production-grade deployments, such automations are generally based on static thresholds or predefined pattern-matching rules, checked against relevant metrics and logs. Defining and maintaining them is cumbersome and, as the infrastructure grows, they need continuous adjustments. To tackle this problem, we propose an intelligent automation system for cloud operations that learns, from what operators have d...
Cloud is one of the emerging technologies in the field of computer science and is extremely popular ...
The method for ensuring availability in an existing cloud environment is primarily a metric-based fa...
Nowadays, the development of virtualization technologies as well as the development of the Internet ...
When operating large cloud computing infrastructures, ensuring healthiness of physical resources and...
International audienceThe dependability of cloud computing services is a major concern of cloud prov...
In recent years, microservices have gained popularity due to their benefits such as increased mainta...
Cloud computing systems provide the facilities to make application services resilient against failur...
The increasingly popular cloud-computing paradigm provides on-demand access to computing and storage...
Failures in computer systems can be often tracked down to software anomalies of various kinds. In ma...
Software anomalies are recognized as a major problem affecting the performance and availability of m...
Context: With an increasing number of applications running on a microservices-based cloud system (su...
Abstract. Cloud computing is now on the verge of being embraced as a serious usage-model. However, w...
Various literature studies demonstrated that the cloud computing paradigm can help to improve availa...
Various literature studies demonstrated that the cloud computing paradigm can help to improve availa...
Diagnosing IT issues is a challenging problem for large-scale distributed cloud environments due to ...
Cloud is one of the emerging technologies in the field of computer science and is extremely popular ...
The method for ensuring availability in an existing cloud environment is primarily a metric-based fa...
Nowadays, the development of virtualization technologies as well as the development of the Internet ...
When operating large cloud computing infrastructures, ensuring healthiness of physical resources and...
International audienceThe dependability of cloud computing services is a major concern of cloud prov...
In recent years, microservices have gained popularity due to their benefits such as increased mainta...
Cloud computing systems provide the facilities to make application services resilient against failur...
The increasingly popular cloud-computing paradigm provides on-demand access to computing and storage...
Failures in computer systems can be often tracked down to software anomalies of various kinds. In ma...
Software anomalies are recognized as a major problem affecting the performance and availability of m...
Context: With an increasing number of applications running on a microservices-based cloud system (su...
Abstract. Cloud computing is now on the verge of being embraced as a serious usage-model. However, w...
Various literature studies demonstrated that the cloud computing paradigm can help to improve availa...
Various literature studies demonstrated that the cloud computing paradigm can help to improve availa...
Diagnosing IT issues is a challenging problem for large-scale distributed cloud environments due to ...
Cloud is one of the emerging technologies in the field of computer science and is extremely popular ...
The method for ensuring availability in an existing cloud environment is primarily a metric-based fa...
Nowadays, the development of virtualization technologies as well as the development of the Internet ...