Networked computer systems continue to grow in scale and in the complexity of their components and interactions. Component failures become norms instead of exceptions in these environments. A failure will cause one or multiple computer(s) to be unavailable, which affects the resource utilization and system throughput. When a computer fails to function properly, health-related data are valuable for troubleshooting. However, it is challenging to effectively identify anomalies from the voluminous amount of noisy, high-dimensional data. In this paper, we present auto-AID, an autonomic mechanism for anomaly identification in networked computer systems. It is composed of a set of data mining techniques that facilitates automatic analysis of syste...
Network security is critical these days as network technology advances quickly and internet technolo...
In response to the demand for higher computational power, the number of computing nodes in high perf...
As the volume of data recorded from systems increases, there is a need to effectively analyse this d...
One of the important design criteria for distributed systems and their applications is their reliabi...
Anomaly detection in supercomputers is a very difficult problem due to the big scale of the systems ...
With the increasing complexity of data center networks, the operations, management and diagnosis of ...
Nowadays, when multiple aspects of our life depend on complex cyber-physical systems, automated anom...
Distributed systems have become pervasive in current society. From laptops and mobile phones, to ser...
High Performance Computing (HPC) systems are complex machines with heterogeneous components that can...
Due to global competition and increasing product complexity, the complexity of production systems ha...
The impact of an anomaly is domain-dependent. In a dataset of network activities, an anomaly can imp...
Orientadores: Leonardo de Souza Mendes, Mario Lemes Proença JuniorTese (doutorado) - Universidade Es...
Reliability is a cumbersome problem in High Performance Computing Systems and Data Centers evolution...
This thesis investigates the possibility of using anomaly detection on performance data of virtual s...
Communication networks are complex systems consisting of many components each producing a multitude ...
Network security is critical these days as network technology advances quickly and internet technolo...
In response to the demand for higher computational power, the number of computing nodes in high perf...
As the volume of data recorded from systems increases, there is a need to effectively analyse this d...
One of the important design criteria for distributed systems and their applications is their reliabi...
Anomaly detection in supercomputers is a very difficult problem due to the big scale of the systems ...
With the increasing complexity of data center networks, the operations, management and diagnosis of ...
Nowadays, when multiple aspects of our life depend on complex cyber-physical systems, automated anom...
Distributed systems have become pervasive in current society. From laptops and mobile phones, to ser...
High Performance Computing (HPC) systems are complex machines with heterogeneous components that can...
Due to global competition and increasing product complexity, the complexity of production systems ha...
The impact of an anomaly is domain-dependent. In a dataset of network activities, an anomaly can imp...
Orientadores: Leonardo de Souza Mendes, Mario Lemes Proença JuniorTese (doutorado) - Universidade Es...
Reliability is a cumbersome problem in High Performance Computing Systems and Data Centers evolution...
This thesis investigates the possibility of using anomaly detection on performance data of virtual s...
Communication networks are complex systems consisting of many components each producing a multitude ...
Network security is critical these days as network technology advances quickly and internet technolo...
In response to the demand for higher computational power, the number of computing nodes in high perf...
As the volume of data recorded from systems increases, there is a need to effectively analyse this d...