In the modern era of computing, processors are increasingly susceptible to soft errors. Current solutions in both hardware and software enable error detection and correction. Some of these errors, however, go unnoticed by detectors and manifest as silent data corruptions (SDCs) at the application level. Injecting errors into the system and evaluating the outcomes is one method to uncover SDC-causing errors and determine an application's overall resilience to soft errors. The number of possible locations that errors may appear in is large, therefore requiring many injection experiments. One resiliency analysis tool, Relyzer, addresses this issue by performing a comprehensive program analysis to create a small subset of the error injectio...
Emerging high-performance architectures are anticipated to contain unreliable components that may ex...
As high-performance computing (HPC) continues to progress, constraints on HPC system design forces t...
As technology scales, the hardware reliability challenge affects a broad computing market, rendering...
In the modern era of computing, processors are increasingly susceptible to soft errors. Current solu...
The rising count and shrinking feature size of transistors within modern computers is making them in...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
dissertationCurrent scaling trends in transistor technology, in pursuit of larger component counts a...
textDependability and fault tolerance are important aspects of modern computer systems. Particle str...
Resilient algorithms in high-performance computing are subject to rigorous non-functional constrain...
Technology scaling has led to growing concerns about reliability in microprocessors. Currently, faul...
Soft errors are a growing concern for processor reliability. Recent work has motivated architecture ...
Emerging high-performance architectures are anticipated to contain unreliable components that may ex...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
This paper presents an empirical investigation on the soft error sensitivity (SES) of microprocessor...
Soft errors caused by transient bit flips have the potential to significantly impactan applicalion's...
Emerging high-performance architectures are anticipated to contain unreliable components that may ex...
As high-performance computing (HPC) continues to progress, constraints on HPC system design forces t...
As technology scales, the hardware reliability challenge affects a broad computing market, rendering...
In the modern era of computing, processors are increasingly susceptible to soft errors. Current solu...
The rising count and shrinking feature size of transistors within modern computers is making them in...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
dissertationCurrent scaling trends in transistor technology, in pursuit of larger component counts a...
textDependability and fault tolerance are important aspects of modern computer systems. Particle str...
Resilient algorithms in high-performance computing are subject to rigorous non-functional constrain...
Technology scaling has led to growing concerns about reliability in microprocessors. Currently, faul...
Soft errors are a growing concern for processor reliability. Recent work has motivated architecture ...
Emerging high-performance architectures are anticipated to contain unreliable components that may ex...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
This paper presents an empirical investigation on the soft error sensitivity (SES) of microprocessor...
Soft errors caused by transient bit flips have the potential to significantly impactan applicalion's...
Emerging high-performance architectures are anticipated to contain unreliable components that may ex...
As high-performance computing (HPC) continues to progress, constraints on HPC system design forces t...
As technology scales, the hardware reliability challenge affects a broad computing market, rendering...