Relaxed Quantization for Discretized Neural Networks

Louizos, C.
Reisser, M.
Blankevoort, T.
Gavves, E.
Welling, M.

Publication date

February 2019

Publisher

OpenReview

Abstract

Neural network quantization has become an important research area due to its great impact on deployment of large models on resource constrained devices. In order to train networks that can be effectively discretized without loss of performance, we introduce a differentiable quantization procedure. Differentiability can be achieved by transforming continuous distributions over the weights and activations of the network to categorical distributions over the quantization grid. These are subsequently relaxed to continuous surrogates that can allow for efficient gradient-based optimization. We further show that stochastic rounding can be seen as a special case of the proposed approach and that under this formulation the quantization grid itself ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Relaxed Quantization for Discretized Neural Networks

Abstract

Extracted data

Relaxed Quantization for Discretized Neural Networks

Abstract

Extracted data

Related items

Related items