Error data collected at runtime play a key role for dependability analysis and improvement of software systems. The use of monitoring frameworks for legacy mission-critical systems is hindered by limited intervention degree and low intrusiveness requirements. We present the design and experimentation of an error monitoring service for a legacy large-scale critical system in the Air Traffic Control (ATC) domain. We describe the details of the API realized to collect both direct data (event logs, execution traces) and indirect data (system resources’ utilization). We present experiments with the ATC industrial case study, showing the efficacy of combining different data sources for error detection and propagation analysis, with an acceptable ...
Direct Monitoring Dataset is a collection of data obtained during an experimental analysis of differ...
Data transfer in distributed environment is prone to frequent failures resulting from back-end syste...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
Error data collected at runtime play a key role for dependability analysis and improvement of softwa...
The analysis of monitoring data is extremely valuable for critical computer systems. It allows to ga...
Software systems employed in critical scenarios are increasingly large and complex. The usage of man...
Error propagation analysis is a consolidated practice to gain insights into error modes and effects ...
Middleware plays a strategic role to reduce development cost and time to market. However, it raises ...
Event logs are the first place where to find useful information about application failures. Event lo...
PDFTech ReportDOT-TSC-FAA-71-16Air traffic controlData processingError analysisComputersUnited State...
This paper describes a safety information management system designed to capture maintenance factors ...
On-line timing error detection entails gathering and analyzing monitoring data to pinpoint deviation...
This thesis introduces a novel approach to online failure prediction for mission critical distribute...
Detecting and recovering from errors in data streams is paramount to developing successful autonomou...
This paper introduces a novel approach to failure prediction for mission critical distributed system...
Direct Monitoring Dataset is a collection of data obtained during an experimental analysis of differ...
Data transfer in distributed environment is prone to frequent failures resulting from back-end syste...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
Error data collected at runtime play a key role for dependability analysis and improvement of softwa...
The analysis of monitoring data is extremely valuable for critical computer systems. It allows to ga...
Software systems employed in critical scenarios are increasingly large and complex. The usage of man...
Error propagation analysis is a consolidated practice to gain insights into error modes and effects ...
Middleware plays a strategic role to reduce development cost and time to market. However, it raises ...
Event logs are the first place where to find useful information about application failures. Event lo...
PDFTech ReportDOT-TSC-FAA-71-16Air traffic controlData processingError analysisComputersUnited State...
This paper describes a safety information management system designed to capture maintenance factors ...
On-line timing error detection entails gathering and analyzing monitoring data to pinpoint deviation...
This thesis introduces a novel approach to online failure prediction for mission critical distribute...
Detecting and recovering from errors in data streams is paramount to developing successful autonomou...
This paper introduces a novel approach to failure prediction for mission critical distributed system...
Direct Monitoring Dataset is a collection of data obtained during an experimental analysis of differ...
Data transfer in distributed environment is prone to frequent failures resulting from back-end syste...
Due to copyright restrictions, the access to the full text of this article is only available via sub...