Mixing Low-Precision Formats in Multiply-Accumulate Units for DNN Training

Tatsumi, Mariko
Filip, Silviu-Ioan
White, Caroline
Sentieys, Olivier
Lemieux, Guy

Publication date

December 2022

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

International audienceThe most compute-intensive stage of deep neural network (DNN) training is matrix multiplication where the multiply-accumulate (MAC) operator is key. To reduce training costs, we consider using low-precision arithmetic for MAC operations. While low-precision training has been investigated in prior work, the focus has been on reducing the number of bits in weights or activations without compromising accuracy. In contrast, the focus in this paper is on implementation details beyond weight or activation width that affect area and accuracy. In particular, we investigate the impact of fixed-versus floating-point representations, multiplier rounding, and floatingpoint exceptional value support. Results suggest that (1) lowpre...