HPC-ODA is a collection of datasets acquired on production HPC systems, which are representative of several real-world use cases in the field of Operational Data Analytics (ODA) for the improvement of reliability and energy efficiency. The datasets are composed of monitoring sensor data, acquired from the components of different HPC systems depending on the specific use case. Two tools, whose overhead is proven to be very light, were used to acquire data in HPC-ODA: these are the DCDB and LDMS monitoring frameworks. The aim of HPC-ODA is to provide several vertical slices (here named segments) of the monitoring data available in a large-scale HPC installation. The segments all have different granularities, in terms of data sources and time...
The dataset in the tarball was used as job- and power-trace input for the paper "What does Power Con...
Monitoring users on large computing platforms such as high performance computing (HPC) and cloud com...
Reliability and energy are two of the top major concerns in the development of today's supercomputer...
HPC-ODA is a collection of datasets acquired on production HPC systems, which are representative of ...
Greening of Data Centers could be achieved through energy savings in two major areas namely: compute...
The Antarex dataset contains trace data collected from the homonymous experimental HPC system locate...
As the scale of High-Performance Computing (HPC) clusters continues to grow, their increasing failur...
The continuous increase in the data produced by simulations, experiments and edge components in the ...
In this work, system monitoring and analysis are discussed in terms of their sig- nificance and bene...
Modern scientific discoveries are driven by an unsatisfiable demand for computational resources. Hig...
Large science projects rely on complex workflows to analyze terabytes or petabytes of data. These jo...
Traditional cluster monitoring approaches consider nodes in singleton, using manufacturer-specified ...
Energy usage of computing equipment is an important consideration and energy inefficiency of compute...
This document describes how to obtain, install, use, and enjoy a better life with OVIS version 3.2. ...
Hardware monitoring through performance counters is available on almost all modern processors. Altho...
The dataset in the tarball was used as job- and power-trace input for the paper "What does Power Con...
Monitoring users on large computing platforms such as high performance computing (HPC) and cloud com...
Reliability and energy are two of the top major concerns in the development of today's supercomputer...
HPC-ODA is a collection of datasets acquired on production HPC systems, which are representative of ...
Greening of Data Centers could be achieved through energy savings in two major areas namely: compute...
The Antarex dataset contains trace data collected from the homonymous experimental HPC system locate...
As the scale of High-Performance Computing (HPC) clusters continues to grow, their increasing failur...
The continuous increase in the data produced by simulations, experiments and edge components in the ...
In this work, system monitoring and analysis are discussed in terms of their sig- nificance and bene...
Modern scientific discoveries are driven by an unsatisfiable demand for computational resources. Hig...
Large science projects rely on complex workflows to analyze terabytes or petabytes of data. These jo...
Traditional cluster monitoring approaches consider nodes in singleton, using manufacturer-specified ...
Energy usage of computing equipment is an important consideration and energy inefficiency of compute...
This document describes how to obtain, install, use, and enjoy a better life with OVIS version 3.2. ...
Hardware monitoring through performance counters is available on almost all modern processors. Altho...
The dataset in the tarball was used as job- and power-trace input for the paper "What does Power Con...
Monitoring users on large computing platforms such as high performance computing (HPC) and cloud com...
Reliability and energy are two of the top major concerns in the development of today's supercomputer...