Cloud computing systems provide the facilities to make application services resilient against failures of individual computing resources. However, resiliency is typically limited by a cloud consumer\u27s use and operation of cloud resources. In particular, system operations have been reported as one of the leading causes of system-wide outages. This applies specifically to DevOps operations, such as backup, redeployment, upgrade, customized scaling, and migration - which are executed at much higher frequencies now than a decade ago. We address this problem by proposing a novel approach to detect errors in the execution of these kinds of operations, in particular for rolling upgrade operations. Our regression-based approach leverages the cor...
Most cloud computing clusters are built from unreliable, commercial off-the-shelf components compar...
International audienceThe dependability of cloud computing services is a major concern of cloud prov...
We address several problems in intelligent log management of distributed cloud computing application...
Failure of application operations is one of the main causes of system-wide outages in cloud environm...
Abstract—Failure of application operations is one of the main causes of system-wide outages in cloud...
When operating large cloud computing infrastructures, ensuring healthiness of physical resources and...
In recent years, microservices have gained popularity due to their benefits such as increased mainta...
© Springer International Publishing Switzerland 2016. Cloud data centres are implemented as large-sc...
The increasingly popular cloud-computing paradigm provides on-demand access to computing and storage...
Likely system invariants model properties that hold in operating conditions of a computing system. I...
Cloud computing is gaining enormous popularity every day. But with the growing demand of cloud comp...
Infrastructure-as-a-Service (IaaS) Cloud is a popular platform for providing virtual computing and s...
Cloud datacenters comprise hundreds or thousands of disparate application services, each having stri...
Cloud data centres are critical business infrastructures and the fastest growing service providers. ...
Cloud computing is a model for on-demand access to shared resources based on the pay-per-use policy....
Most cloud computing clusters are built from unreliable, commercial off-the-shelf components compar...
International audienceThe dependability of cloud computing services is a major concern of cloud prov...
We address several problems in intelligent log management of distributed cloud computing application...
Failure of application operations is one of the main causes of system-wide outages in cloud environm...
Abstract—Failure of application operations is one of the main causes of system-wide outages in cloud...
When operating large cloud computing infrastructures, ensuring healthiness of physical resources and...
In recent years, microservices have gained popularity due to their benefits such as increased mainta...
© Springer International Publishing Switzerland 2016. Cloud data centres are implemented as large-sc...
The increasingly popular cloud-computing paradigm provides on-demand access to computing and storage...
Likely system invariants model properties that hold in operating conditions of a computing system. I...
Cloud computing is gaining enormous popularity every day. But with the growing demand of cloud comp...
Infrastructure-as-a-Service (IaaS) Cloud is a popular platform for providing virtual computing and s...
Cloud datacenters comprise hundreds or thousands of disparate application services, each having stri...
Cloud data centres are critical business infrastructures and the fastest growing service providers. ...
Cloud computing is a model for on-demand access to shared resources based on the pay-per-use policy....
Most cloud computing clusters are built from unreliable, commercial off-the-shelf components compar...
International audienceThe dependability of cloud computing services is a major concern of cloud prov...
We address several problems in intelligent log management of distributed cloud computing application...