Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training

Harma, Simla Burcu
Sönmez, Canberk
Falsafi, Babak
Jaggi, Martin
Oh, Yunho

Publication date

November 2022

Language

English

Abstract

The unprecedented growth in DNN model complexity, size and the amount of training data have led to a commensurate increase in demand for computing and a search for minimal encoding. Recent research advocates Hybrid Block Floating-Point (HBFP) as a technique that minimizes silicon provisioning in accelerators by converting the majority of arithmetic operations in training to 8-bit fixed-point. In this paper, we perform a full-scale exploration of the HBFP design space including minimal mantissa encoding, varying block sizes, and mixed mantissa bit-width across layers and epochs. We propose Accuracy Boosters, an epoch-driven mixed-mantissa HBFP that uses 6-bit mantissa only in the last epoch and converts $99.7\%$ of all arithmetic operations ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training

Abstract

Extracted data

Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training

Abstract

Extracted data

Related items

Related items