As DNNs become increasingly common in mission-critical applications, ensuring their reliable operation has become crucial. Conventional resilience techniques fail to account for the unique characteristics of DNN algorithms/accelerators, and hence, they are infeasible or ineffective. In this paper, we present a survey of techniques for studying and optimizing the reliability of DNN accelerators and architectures. The reliability issues we cover include soft/hard errors arising due to process variation, voltage scaling, timing errors, DRAM errors due to refresh rate scaling and thermal effects, etc. We organize the research projects on several categories to bring out their key attributes. This paper underscores the importance of designing for...
In this work we present a 3D dynamic simulation analysis for the reliability evaluation of a decanan...
2018-02-02This thesis is dedicated to improving the efficiency of resilient computing through both a...
The lecture provides an overview of considerations relevant for achieving highly reliable operation ...
Abstract — Computer reliability is a growing concern. High-end processor vendors, as well as academi...
7 pages, 6 figuresDeep Neural Networks (DNNs) enable a wide series of technological advancements, ra...
Emergence of Deep Neural Networks (DNN) has led to a proliferation of artificial intelligence appli...
Deep neural networks have achieved phenomenal successes in vision recognition tasks, which motivate ...
Deep Neural Networks (DNNs) enable a wide series of technological advancements, ranging from clinica...
Aggressive process scaling and increasing demands of performance/cost efficiency have exacerbated th...
Computing-in-memory with emerging non-volatile memory (nvCiM) is shown to be a promising candidate f...
Computing-in-Memory (CiM) architectures based on emerging non-volatile memory (NVM) devices have dem...
The recent success of deep neural networks (DNNs) in challenging perception tasks makes them a power...
To achieve faster design closure, there is a need to provide a design framework for the design of Re...
Machine Learning (ML) is making a strong resurgence in tune with the massive generation of unstructu...
The resurgence of machine learning in various applications and it's inherent compute-intensive natur...
In this work we present a 3D dynamic simulation analysis for the reliability evaluation of a decanan...
2018-02-02This thesis is dedicated to improving the efficiency of resilient computing through both a...
The lecture provides an overview of considerations relevant for achieving highly reliable operation ...
Abstract — Computer reliability is a growing concern. High-end processor vendors, as well as academi...
7 pages, 6 figuresDeep Neural Networks (DNNs) enable a wide series of technological advancements, ra...
Emergence of Deep Neural Networks (DNN) has led to a proliferation of artificial intelligence appli...
Deep neural networks have achieved phenomenal successes in vision recognition tasks, which motivate ...
Deep Neural Networks (DNNs) enable a wide series of technological advancements, ranging from clinica...
Aggressive process scaling and increasing demands of performance/cost efficiency have exacerbated th...
Computing-in-memory with emerging non-volatile memory (nvCiM) is shown to be a promising candidate f...
Computing-in-Memory (CiM) architectures based on emerging non-volatile memory (NVM) devices have dem...
The recent success of deep neural networks (DNNs) in challenging perception tasks makes them a power...
To achieve faster design closure, there is a need to provide a design framework for the design of Re...
Machine Learning (ML) is making a strong resurgence in tune with the massive generation of unstructu...
The resurgence of machine learning in various applications and it's inherent compute-intensive natur...
In this work we present a 3D dynamic simulation analysis for the reliability evaluation of a decanan...
2018-02-02This thesis is dedicated to improving the efficiency of resilient computing through both a...
The lecture provides an overview of considerations relevant for achieving highly reliable operation ...