Nanoscale technology nodes bring reliability concerns back to the center stage of digital system design. A systematic classification of approaches that increase system resilience in the presence of functional hardware (HW)-induced errors is presented, dealing with higher system abstractions, such as the (micro) architecture, the mapping, and platform software (SW). The field is surveyed in a systematic way based on nonoverlapping categories, which add insight into the ongoing work by exposing similarities and differences. HW and SW solutions are discussed in a similar fashion so that interrelationships become apparent. The presented categories are illustrated by representative literature examples to illustrate their properties. Moreover, it...
Hardware errors become more common as silicon technologies shrink and become more vulnerable, especi...
International audienceResilient computing is defined as the ability of a system to stay dependable w...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
This paper presents a new approach for monitoring and estimating device reliability of nanometer-sca...
Hardware techniques to improve the robustness of a computing system can be very expensive, difficult...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
Premi extraordinari doctorat 2013-2014During the last decades, human beings have experienced a signi...
Reliability has always been a major concern in designing computing systems. However, the increasing ...
Soft errors are faults which are not caused by defective hardware, rather they are induced due to no...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
This report provides an introduction to resilience methods. The emphasis is on checkpointing, the de...
National audienceA system that remains dependable when facing changes (new threats, updates) is call...
A system that remains dependable when facing changes (new threats, updates) is called resilient. The...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
As late-CMOS process scaling leads to increasingly variable circuits/logic and as most post-CMOS tec...
Hardware errors become more common as silicon technologies shrink and become more vulnerable, especi...
International audienceResilient computing is defined as the ability of a system to stay dependable w...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
This paper presents a new approach for monitoring and estimating device reliability of nanometer-sca...
Hardware techniques to improve the robustness of a computing system can be very expensive, difficult...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
Premi extraordinari doctorat 2013-2014During the last decades, human beings have experienced a signi...
Reliability has always been a major concern in designing computing systems. However, the increasing ...
Soft errors are faults which are not caused by defective hardware, rather they are induced due to no...
According to Moore’s law, technology scaling is continuously providing smaller and faster devices. T...
This report provides an introduction to resilience methods. The emphasis is on checkpointing, the de...
National audienceA system that remains dependable when facing changes (new threats, updates) is call...
A system that remains dependable when facing changes (new threats, updates) is called resilient. The...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
As late-CMOS process scaling leads to increasingly variable circuits/logic and as most post-CMOS tec...
Hardware errors become more common as silicon technologies shrink and become more vulnerable, especi...
International audienceResilient computing is defined as the ability of a system to stay dependable w...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...