Low-precision logarithmic arithmetic for neural network accelerators

Christ, Maxime
de Dinechin, Florent
Pétrot, Frédéric

Publication date

July 2022

Publisher

HAL CCSD

Abstract

International audienceResource requirements for hardware acceleration of neural networks inference is notoriously high, both in terms of computation and storage. One way to mitigate this issue is to quantize parameters and activations. This is usually done by scaling and centering the distributions of weights and activations, on a kernel per kernel basis, so that a low-precision binary integer representation can be used. This work studies low-precision logarithmic number system (LNS) as an efficient alternative. Firstly, LNS has more dynamic than fixed-point for the same number of bits. Thus, when quantizing MNIST and CIFAR reference networks without retraining, the smallest format size achieving top-1 accuracy comparable to floating-point ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Low-precision logarithmic arithmetic for neural network accelerators

Abstract

Extracted data

Low-precision logarithmic arithmetic for neural network accelerators

Abstract

Extracted data

Related items

Related items