International audienceModern component frameworks support continuous deployment and simultaneous execution of multiple software components on top of the same virtual machine. However, isolation between the various components is limited. A faulty version of any one of the software components can compromise the whole system by consuming all available resources. In this paper, we address the problem of efficiently identifying faulty software components running simultaneously in a single virtual machine. Current solutions that perform permanent and extensive monitoring to detect anomalies induce high overhead on the system, and can, by themselves, make the system unstable. In this paper we present an optimistic adaptive monitoring system to det...
Modern cloud data centers indicate the frequent existence of complex failure manifestation. Failures...
OSGi has become one of the most promising frameworks for managing service-oriented and component-bas...
We propose a new approach for reacting to a wide variety of software failures, ranging from remotely...
International audienceModern component frameworks support continuous deployment and simultaneous exe...
International audienceModern component frameworks support continuous deployment and simultaneous exe...
Programa doutoral em Informática - MAP-iInternet services play a critical role day-by-day in persona...
Effective monitoring of a software system’s runtime behavior is necessary to evaluate the compliance...
Software systems need monitors for observing them and respond to changes through adaptation. Any fau...
Software anomalies are recognized as a major problem affecting the performance and availability of m...
As Moore's law has been continuously improving the microprocessor's speed, performance is no longer ...
One of the important design criteria for distributed systems and their applications is their reliabi...
On-line failure detection is an essential means to control and assess the dependability of complex a...
Off-The-Shelf (COTS) software components have been extensively used by applications over the world. ...
What do our computer systems do all day? How do we make sure they continue doing it when failures oc...
Availability and reliability are often important features of key software appliances such as firewa...
Modern cloud data centers indicate the frequent existence of complex failure manifestation. Failures...
OSGi has become one of the most promising frameworks for managing service-oriented and component-bas...
We propose a new approach for reacting to a wide variety of software failures, ranging from remotely...
International audienceModern component frameworks support continuous deployment and simultaneous exe...
International audienceModern component frameworks support continuous deployment and simultaneous exe...
Programa doutoral em Informática - MAP-iInternet services play a critical role day-by-day in persona...
Effective monitoring of a software system’s runtime behavior is necessary to evaluate the compliance...
Software systems need monitors for observing them and respond to changes through adaptation. Any fau...
Software anomalies are recognized as a major problem affecting the performance and availability of m...
As Moore's law has been continuously improving the microprocessor's speed, performance is no longer ...
One of the important design criteria for distributed systems and their applications is their reliabi...
On-line failure detection is an essential means to control and assess the dependability of complex a...
Off-The-Shelf (COTS) software components have been extensively used by applications over the world. ...
What do our computer systems do all day? How do we make sure they continue doing it when failures oc...
Availability and reliability are often important features of key software appliances such as firewa...
Modern cloud data centers indicate the frequent existence of complex failure manifestation. Failures...
OSGi has become one of the most promising frameworks for managing service-oriented and component-bas...
We propose a new approach for reacting to a wide variety of software failures, ranging from remotely...