Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs

dos Santos, Fernando Fernandes
Rech, Paolo
Kritikakou, Angeliki
Sentieys, Olivier

Open PDF

Open link

Publication date

July 2022

DOI

10.1109/ISVLSI54635.2022.00071

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

International audienceGraphics Processing Units (GPUs) offer the possibility to execute floating-point operations (FLOP) with mixed-precisions such as INT8, FP16, Bfloat, FP32, and FP64. For Deep Neural Networks (DNNs), a reduced precision is likely to lower the execution time and power consumption as it requires a smaller hardware area and fewer clock cycles to perform instructions than the standard FP32 and FP64 precisions. As less area is needed for reduced precision, the circuit error rate is also expected to be lower [1]. NVIDIA GPUs also have tensor cores that perform matrix multiplication on hardware. The tensor cores are capable to perform a 4 ×4 FP16 matrix multiplication in one clock cycle [2]. The tensor cores can deliver up to 9...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs

Abstract

Extracted data

Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs

Abstract

Extracted data

Related items

Related items