Understanding the impact of soft errors on applications can be expensive. Often, it requires an extensive error injection campaign involving numerous runs of the full application in the presence of errors. In this paper, we present a novel approach to arriving at the ground truth-the true impact of an error on the final output-for iterative methods by observing a small number of iterations to learn deviations between normal and error-impacted execution. We develop a machine learning based predictor for three iterative methods to generate ground-truth results without running them to completion for every error injected. We demonstrate that this approach achieves greater accuracy than alternative prediction strategies, including three existing...
Resilient algorithms in high-performance computing are subject to rigorous non-functional constrain...
Devices are increasingly vulnerable to soft errors as their feature sizes shrink. Previously, soft e...
In the modern era of computing, processors are increasingly susceptible to soft errors. Current solu...
Soft errors caused by transient bit flips have the potential to significantly impactan applicalion's...
International audienceThis paper presents two error models to evaluate safety of a software error de...
With shrinking device size and increasing complexity, soft errors are becoming an issue in the relia...
Virtual platform frameworks have been extended to allow earlier soft error analysis of more realisti...
ELLIOTT III, JAMES JOHN. Resilient Iterative Linear Solvers Running Through Errors. (Under the direc...
International audienceMachine learning (ML) algorithms have been regaining momentum thanks to their ...
ContextDefect prediction can help at prioritizing testing tasks by, for instance, ranking a list of ...
Software defect prediction is one of the essential sets of operation towards mitigating issues of ri...
Traditionally, fault tolerance researchers have made very strict assumptions about program correctne...
Hardware errors are on the rise with reducing chip sizes, and power constraints have necessitated th...
Background Test resources are usually limited and therefore it is often not possible to completely t...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
Resilient algorithms in high-performance computing are subject to rigorous non-functional constrain...
Devices are increasingly vulnerable to soft errors as their feature sizes shrink. Previously, soft e...
In the modern era of computing, processors are increasingly susceptible to soft errors. Current solu...
Soft errors caused by transient bit flips have the potential to significantly impactan applicalion's...
International audienceThis paper presents two error models to evaluate safety of a software error de...
With shrinking device size and increasing complexity, soft errors are becoming an issue in the relia...
Virtual platform frameworks have been extended to allow earlier soft error analysis of more realisti...
ELLIOTT III, JAMES JOHN. Resilient Iterative Linear Solvers Running Through Errors. (Under the direc...
International audienceMachine learning (ML) algorithms have been regaining momentum thanks to their ...
ContextDefect prediction can help at prioritizing testing tasks by, for instance, ranking a list of ...
Software defect prediction is one of the essential sets of operation towards mitigating issues of ri...
Traditionally, fault tolerance researchers have made very strict assumptions about program correctne...
Hardware errors are on the rise with reducing chip sizes, and power constraints have necessitated th...
Background Test resources are usually limited and therefore it is often not possible to completely t...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
Resilient algorithms in high-performance computing are subject to rigorous non-functional constrain...
Devices are increasingly vulnerable to soft errors as their feature sizes shrink. Previously, soft e...
In the modern era of computing, processors are increasingly susceptible to soft errors. Current solu...