Abstract—In this paper, we present CLUE, a system event analytics tool for black-box performance diagnosis in production Cloud Computing systems. CLUE provides an unified and extensi-ble means of profiling service transactional behaviors, and builds structured data called event sketches. CLUE further offers a set of analytic tools for summarizing and analyzing event sketches by integrating data mining and statistical analysis. CLUE has been developed in NEC as an internal tool and applied in diagnosing a diverse set of real performance problems for multi-tiered IT applications running on multi-core servers of major platforms including Linux (Redhat, Fedora), Unix (HP-UX), and Windows (Windows Server 2008). We demonstrated the evaluation of ...
Abstract—In this paper, we present an automated on-line ser-vice for troubleshooting performance pro...
Processing heavy data is a tough job for small devices, such as smartphones, smartwatches and someti...
Cloud computing systems provide the facilities to make application services resilient against failur...
Abstract—In this paper, we present CLUE, a system event analytics tool for black-box performance dia...
Cloud datacenters comprise hundreds or thousands of disparate application services, each having stri...
Automated root cause analysis of performance problems in modern cloud computing infrastructures is o...
<p>Large production systems are susceptible to chronic performance problems where the system still w...
The increasingly popular cloud-computing paradigm provides on-demand access to computing and storage...
Distributed computing environments are increasingly deployed over geographically spanning data cente...
Modern IT infrastructures are constructed by large scale computing systems and administered by IT se...
More than ever, businesses heavily rely on IT service delivery to meet their current and frequently ...
The main goal of this research is to contribute to automated performance anomaly detection for large...
Diagnosing IT issues is a challenging problem for large-scale distributed cloud environments due to ...
Cloud-based solutions are increasingly being used to implement large-scale dynamic data driven appli...
In recent years, microservices have gained popularity due to their benefits such as increased mainta...
Abstract—In this paper, we present an automated on-line ser-vice for troubleshooting performance pro...
Processing heavy data is a tough job for small devices, such as smartphones, smartwatches and someti...
Cloud computing systems provide the facilities to make application services resilient against failur...
Abstract—In this paper, we present CLUE, a system event analytics tool for black-box performance dia...
Cloud datacenters comprise hundreds or thousands of disparate application services, each having stri...
Automated root cause analysis of performance problems in modern cloud computing infrastructures is o...
<p>Large production systems are susceptible to chronic performance problems where the system still w...
The increasingly popular cloud-computing paradigm provides on-demand access to computing and storage...
Distributed computing environments are increasingly deployed over geographically spanning data cente...
Modern IT infrastructures are constructed by large scale computing systems and administered by IT se...
More than ever, businesses heavily rely on IT service delivery to meet their current and frequently ...
The main goal of this research is to contribute to automated performance anomaly detection for large...
Diagnosing IT issues is a challenging problem for large-scale distributed cloud environments due to ...
Cloud-based solutions are increasingly being used to implement large-scale dynamic data driven appli...
In recent years, microservices have gained popularity due to their benefits such as increased mainta...
Abstract—In this paper, we present an automated on-line ser-vice for troubleshooting performance pro...
Processing heavy data is a tough job for small devices, such as smartphones, smartwatches and someti...
Cloud computing systems provide the facilities to make application services resilient against failur...