Quantized Feature Distillation for Network Quantization

Zhu, Ke
He, Yin-Yin
Wu, Jianxin

Open link

Publication date

June 2023

DOI

10.1609/aaai.v37i9.26354

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

Neural network quantization aims to accelerate and trim full-precision neural network models by using low bit approximations. Methods adopting the quantization aware training (QAT) paradigm have recently seen a rapid growth, but are often conceptually complicated. This paper proposes a novel and highly effective QAT method, quantized feature distillation (QFD). QFD first trains a quantized (or binarized) representation as the teacher, then quantize the network using knowledge distillation (KD). Quantitative results show that QFD is more flexible and effective (i.e., quantization friendly) than previous quantization methods. QFD surpasses existing methods by a noticeable margin on not only image classification but also object detection, albe...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Quantized Feature Distillation for Network Quantization

Abstract

Extracted data

Quantized Feature Distillation for Network Quantization

Abstract

Extracted data

Related items

Related items