Adaptive Quantization for Deep Neural Network

Zhou, Yiren
Moosavi-Dezfooli, Seyed-Mohsen
Cheung, Ngai-Man
Frossard, Pascal

Publication date

April 2018

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Abstract

In recent years Deep Neural Networks (DNNs) have been rapidly developed in various applications, together with increasingly complex architectures. The performance gain of these DNNs generally comes with high computational costs and large memory consumption, which may not be affordable for mobile platforms. Deep model quantization can be used for reducing the computation and memory costs of DNNs, and deploying complex DNNs on mobile equipment. In this work, we propose an optimization framework for deep model quantization. First, we propose a measurement to estimate the effect of parameter quantization errors in individual layers on the overall model prediction accuracy. Then, we propose an optimization process based on this measurement for f...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Adaptive Quantization for Deep Neural Network

Abstract

Extracted data

Adaptive Quantization for Deep Neural Network

Abstract

Extracted data

Related items

Related items