OMPQ: Orthogonal Mixed Precision Quantization

Ma, Yuexiao
Jin, Taisong
Zheng, Xiawu
Wang, Yan
Li, Huixia
Wu, Yongjian
Jiang, Guannan
Zhang, Wei
Ji, Rongrong

Publication date

November 2022

Language

English

Abstract

To bridge the ever increasing gap between deep neural networks' complexity and hardware capability, network quantization has attracted more and more research attention. The latest trend of mixed precision quantization takes advantage of hardware's multiple bit-width arithmetic operations to unleash the full potential of network quantization. However, this also results in a difficult integer programming formulation, and forces most existing approaches to use an extremely time-consuming search process even with various relaxations. Instead of solving a problem of the original integer programming, we propose to optimize a proxy metric, the concept of network orthogonality, which is highly correlated with the loss of the integer programming but...

Extracted data

We use cookies to provide a better user experience.

Data Protection

OMPQ: Orthogonal Mixed Precision Quantization

Abstract

Extracted data

OMPQ: Orthogonal Mixed Precision Quantization

Abstract

Extracted data

Related items

Related items