The level of trust on log-based dependability characterization of complex distributed systems, is biased by the ability at identifying failure data from event logs collected across different locations, which is a challenging issue. Several factors compromise the ability at identifying failure data, such as the accuracy of the logging mechanism to detect occur- ring errors, the effectiveness of the infrastructure that is adopted to manage failure data, correlation phenomena among the entries in the log. The focus of the thesis is to evaluate the accuracy of current logging mechanisms at reporting failures, and to develop novel techniques to make event logs effective to infer failure data. Techniques involve production, collection, and correl...
Event logs are the primary source of data to characterize the dependability behavior of a computing ...
System logs are the first source of information available to system designers to analyze and trouble...
Event logs are the primary source of data to character-ize the dependability behavior of a computing...
The level of trust on log-based dependability characterization of complex distributed systems, is bi...
Failure analysis is valuable to dependability engineers because it supports designing effective miti...
Software faults are recognized to be among the main responsible for system failures in many applicat...
The dependability evaluation and management of complex systems is often based on the collection of f...
Event logs have been widely used over the last three decades to analyze the failure behavior of a va...
© 2014 IEEE. As the sizes of supercomputers and data centers grow towards exascale, failures become ...
Monitoring is a consolidated practice to characterize the dependability behavior of a software syste...
Event logs are the primary source of data to characterize the dependability behavior of a computing ...
The ability to automatically detect faults or fault patterns to enhance system reliability is import...
Event logs are the first place where to find useful information about application failures. Event lo...
Abstract—Field Failure Data Analysis (FFDA) is a widely adopted methodology to characterize the depe...
The analysis of monitoring data is extremely valuable for critical computer systems. It allows to ga...
Event logs are the primary source of data to characterize the dependability behavior of a computing ...
System logs are the first source of information available to system designers to analyze and trouble...
Event logs are the primary source of data to character-ize the dependability behavior of a computing...
The level of trust on log-based dependability characterization of complex distributed systems, is bi...
Failure analysis is valuable to dependability engineers because it supports designing effective miti...
Software faults are recognized to be among the main responsible for system failures in many applicat...
The dependability evaluation and management of complex systems is often based on the collection of f...
Event logs have been widely used over the last three decades to analyze the failure behavior of a va...
© 2014 IEEE. As the sizes of supercomputers and data centers grow towards exascale, failures become ...
Monitoring is a consolidated practice to characterize the dependability behavior of a software syste...
Event logs are the primary source of data to characterize the dependability behavior of a computing ...
The ability to automatically detect faults or fault patterns to enhance system reliability is import...
Event logs are the first place where to find useful information about application failures. Event lo...
Abstract—Field Failure Data Analysis (FFDA) is a widely adopted methodology to characterize the depe...
The analysis of monitoring data is extremely valuable for critical computer systems. It allows to ga...
Event logs are the primary source of data to characterize the dependability behavior of a computing ...
System logs are the first source of information available to system designers to analyze and trouble...
Event logs are the primary source of data to character-ize the dependability behavior of a computing...