Harmonious Coexistence of Structured Weight Pruning and Ternarization for Deep Neural Networks

Yang, Li
He, Zhezhi
Fan, Deliang

Open link

Publication date

April 2020

DOI

10.1609/aaai.v34i04.6138

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

Deep convolutional neural network (DNN) has demonstrated phenomenal success and been widely used in many computer vision tasks. However, its enormous model size and high computing complexity prohibits its wide deployment into resource limited embedded system, such as FPGA and mGPU. As the two most widely adopted model compression techniques, weight pruning and quantization compress DNN model through introducing weight sparsity (i.e., forcing partial weights as zeros) and quantizing weights into limited bit-width values, respectively. Although there are works attempting to combine the weight pruning and quantization, we still observe disharmony between weight pruning and quantization, especially when more aggressive compression schemes (e.g....

Extracted data

We use cookies to provide a better user experience.

Data Protection

Harmonious Coexistence of Structured Weight Pruning and Ternarization for Deep Neural Networks

Abstract

Extracted data

Harmonious Coexistence of Structured Weight Pruning and Ternarization for Deep Neural Networks

Abstract

Extracted data

Related items

Related items