Don’t Waste Your Bits! Squeeze Activations and Gradients for Deep Neural Networks via TinyScript

Fu, Fangcheng
Hu, Yuzheng
He, Yihan
Jiang, Jiawei
Shao, Yingxia
Zhang, Ce
Cui, Bin

Publication date

January 2020

Publisher

PMLR

Abstract

Recent years have witnessed intensive research interests on training deep neural networks (DNNs) more efficiently by quantization-based compression methods, which facilitate DNNs training in two ways: (1) activations are quantized to shrink the memory consumption, and (2) gradients are quantized to decrease the communication cost. However, existing methods mostly use a uniform mechanism that quantizes the values evenly. Such a scheme may cause a large quantization variance and slow down the convergence in practice. In this work, we introduce TinyScript, which applies a non-uniform quantization algorithm to both activations and gradients. TinyScript models the original values by a family of Weibull distributions and searches for ”quantizatio...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Don’t Waste Your Bits! Squeeze Activations and Gradients for Deep Neural Networks via TinyScript

Abstract

Extracted data

Don’t Waste Your Bits! Squeeze Activations and Gradients for Deep Neural Networks via TinyScript

Abstract

Extracted data

Related items

Related items