This data set contains the data collected on the DAVIDE HPC system (CINECA & E4 & University of Bologna, Bologna, Italy) in the period March-May 2018. The data set has been used to train a autoencoder-based model to automatically detect anomalies in a semi-supervised fashion, on a real HPC system. This work is described in: 1) "Anomaly Detection using Autoencoders in High Performance Computing Systems", Andrea Borghesi, Andrea Bartolini, Michele Lombardi, Michela Milano, Luca Benini, IAAI19 (proceedings in process) -- https://arxiv.org/abs/1902.08447 2) "Online Anomaly Detection in HPC Systems", Andrea Borghesi, Antonio Libri, Luca Benini, Andrea Bartolini, AICAS19 (proceedings in process) -- https://arxiv.org/abs/1811.05269 See the gi...
I have developed a project called AnomalyDetector, a software written in Python using Tensorflow 2.0...
In response to the demand for higher computational power, the number of computing nodes in high perf...
The certification of the CMS experiment data as usable for physics analysis is a crucial task to ens...
High Performance Computing (HPC) systems are complex machines with heterogeneous components that can...
Alla base di questo studio vi è l'analisi di tecniche non supervisionate applicate per il rilevament...
none5noopenBorghesi, Andrea; Bartolini, Andrea; Lombardi, Michele; Milano, Michela; Benini, LucaBorg...
Reliability is a cumbersome problem in High Performance Computing Systems and Data Centers evolution...
Nell’ambito dei supercomputer, l’attività di anomaly detection rappresenta un’ottima strategia per m...
The increasing complexity of modern high-performance computing (HPC) systems necessitates the introd...
Automated and data-driven methodologies are being introduced to assist system administrators in mana...
The continuously growing amount of monitored data in the Industry 4.0 context requires strong and re...
The Antarex dataset contains trace data collected from the homonymous experimental HPC system locate...
Anomaly detection is the identification of events or observations that deviate from the expected beh...
This demo paper presents a design and implementation of a system AnomalyKiTS for detecting anomalies...
Anomaly detection algorithms solve the problem of identifying unexpected values in data sets. Such a...
I have developed a project called AnomalyDetector, a software written in Python using Tensorflow 2.0...
In response to the demand for higher computational power, the number of computing nodes in high perf...
The certification of the CMS experiment data as usable for physics analysis is a crucial task to ens...
High Performance Computing (HPC) systems are complex machines with heterogeneous components that can...
Alla base di questo studio vi è l'analisi di tecniche non supervisionate applicate per il rilevament...
none5noopenBorghesi, Andrea; Bartolini, Andrea; Lombardi, Michele; Milano, Michela; Benini, LucaBorg...
Reliability is a cumbersome problem in High Performance Computing Systems and Data Centers evolution...
Nell’ambito dei supercomputer, l’attività di anomaly detection rappresenta un’ottima strategia per m...
The increasing complexity of modern high-performance computing (HPC) systems necessitates the introd...
Automated and data-driven methodologies are being introduced to assist system administrators in mana...
The continuously growing amount of monitored data in the Industry 4.0 context requires strong and re...
The Antarex dataset contains trace data collected from the homonymous experimental HPC system locate...
Anomaly detection is the identification of events or observations that deviate from the expected beh...
This demo paper presents a design and implementation of a system AnomalyKiTS for detecting anomalies...
Anomaly detection algorithms solve the problem of identifying unexpected values in data sets. Such a...
I have developed a project called AnomalyDetector, a software written in Python using Tensorflow 2.0...
In response to the demand for higher computational power, the number of computing nodes in high perf...
The certification of the CMS experiment data as usable for physics analysis is a crucial task to ens...