A BF16 FMA is all you need for DNN training

Osorio Ríos, John Haiber
Armejach Sanosa, Adrià
Petit, Eric
Henry, Greg
Casas Guix, Marc

Open PDF

Open link

Publication date

July 2022

DOI

10.1109/TETC.2022.3187770

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Journal

IEEE Transactions on Emerging Topics in Computing

Language

English

Abstract

Fused Multiply-Add (FMA) functional units constitute a fundamental hardware component to train Deep Neural Networks (DNNs). Its silicon area grows quadratically with the mantissa bit count of the computer number format, which has motivated the adoption of the BrainFloat16 format (BF16). BF16 features 1 sign, 8 exponent and 7 explicit mantissa bits. Some approaches to train DNNs achieve significant performance benefits by using the BF16 format. However, these approaches must combine BF16 with the standard IEEE 754 Floating-Point 32-bit (FP32) format to achieve state-of-the-art training accuracy, which limits the impact of adopting BF16. This article proposes the first approach able to train complex DNNs entirely using the BF16 format. We pro...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A BF16 FMA is all you need for DNN training

Abstract

Extracted data

A BF16 FMA is all you need for DNN training

Abstract

Extracted data

Related items

Related items