Emerging deep learning workloads urgently need fast general matrix multiplication (GEMM). To meet such demand, one of the critical features of machine-learning-specific accelerators such as NVIDIA Tensor Cores, AMD Matrix Cores, and Google TPUs is the support of mixed-precision enabled GEMM. For DNN models, lower-precision FP data formats and computation offer acceptable correctness but significant performance, area, and memory footprint improvement. While promising, the mixed-precision computation on error resilience remains unexplored. To this end, we develop a fault injection framework that systematically injects fault into the mixed-precision computation results. We investigate how the faults affect the accuracy of machine learning appl...
The great quest for adopting AI-based computation for safety-/mission-critical applications motivate...
Virtual platform frameworks have been extended to allow earlier soft error analysis of more realisti...
The generic matrix multiply (GEMM) function is the core element of high-performance linear algebra l...
International audienceGraphics Processing Units (GPUs) offer the possibility to execute floating-poi...
As Machine Learning (ML) has seen increasing adoption in safety-critical domains (e.g., autonomous v...
With the massive adoption of machine learning (ML) applications in HPC domains, the reliability of M...
Dependable computing on unreliable substrates is the next challenge the computing community needs to...
Nowadays, due to technology enhancement, faults are increasingly compromising all kinds of computing...
Due to limited size, cost and power, embedded devices do not offer the same computational throughput...
Deep learning technology has enabled the development of increasingly complex safety-related autonomo...
Deep learning technology has enabled the development of increasingly complex safety-related autonomo...
Vision Transformers (ViTs) with outstanding performance becomes a popular backbone of deep learning ...
Proceeding of: 31th European Symposium on Reliability of Electron Devices, Failure Physics and Analy...
In recent years, there has been a surge in demand for intelligent applications. These emerging appli...
The convergence of artificial intelligence, high-performance computing (HPC), and data science bring...
The great quest for adopting AI-based computation for safety-/mission-critical applications motivate...
Virtual platform frameworks have been extended to allow earlier soft error analysis of more realisti...
The generic matrix multiply (GEMM) function is the core element of high-performance linear algebra l...
International audienceGraphics Processing Units (GPUs) offer the possibility to execute floating-poi...
As Machine Learning (ML) has seen increasing adoption in safety-critical domains (e.g., autonomous v...
With the massive adoption of machine learning (ML) applications in HPC domains, the reliability of M...
Dependable computing on unreliable substrates is the next challenge the computing community needs to...
Nowadays, due to technology enhancement, faults are increasingly compromising all kinds of computing...
Due to limited size, cost and power, embedded devices do not offer the same computational throughput...
Deep learning technology has enabled the development of increasingly complex safety-related autonomo...
Deep learning technology has enabled the development of increasingly complex safety-related autonomo...
Vision Transformers (ViTs) with outstanding performance becomes a popular backbone of deep learning ...
Proceeding of: 31th European Symposium on Reliability of Electron Devices, Failure Physics and Analy...
In recent years, there has been a surge in demand for intelligent applications. These emerging appli...
The convergence of artificial intelligence, high-performance computing (HPC), and data science bring...
The great quest for adopting AI-based computation for safety-/mission-critical applications motivate...
Virtual platform frameworks have been extended to allow earlier soft error analysis of more realisti...
The generic matrix multiply (GEMM) function is the core element of high-performance linear algebra l...