OMPQ: Orthogonal Mixed Precision Quantization

Ma, Yuexiao
Jin, Taisong
Zheng, Xiawu
Wang, Yan
Li, Huixia
Wu, Yongjian
Jiang, Guannan
Zhang, Wei
Ji, Rongrong

Open link

Publication date

June 2023

DOI

10.1609/aaai.v37i7.26084

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

To bridge the ever-increasing gap between deep neural networks' complexity and hardware capability, network quantization has attracted more and more research attention. The latest trend of mixed precision quantization takes advantage of hardware's multiple bit-width arithmetic operations to unleash the full potential of network quantization. However, existing approaches rely heavily on an extremely time-consuming search process and various relaxations when seeking the optimal bit configuration. To address this issue, we propose to optimize a proxy metric of network orthogonality that can be efficiently solved with linear programming, which proves to be highly correlated with quantized model accuracy and bit-width. Our approach significantly...

Extracted data

We use cookies to provide a better user experience.

Data Protection

OMPQ: Orthogonal Mixed Precision Quantization

Abstract

Extracted data

OMPQ: Orthogonal Mixed Precision Quantization

Abstract

Extracted data

Related items

Related items