SelectQ: Calibration Data Selection for Post-Training Quantization

Zhao Zhang (8145408)
Yangcheng Gao (14044595)
Jicong Fan (11839249)
Zhongqiu Zhao (14044597)
Yi Yang (12549170)
Shuicheng Yan (12549173)

Open link

Publication date

November 2022

DOI

10.36227/techrxiv.21456291.v1

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

Post-training quantization (PTQ) can reduce the memory footprint and latency for deep model inference, while still preserving the accuracy of the model, with only a small unlabeled calibration set and without the retraining on full training set. To calibrate a quantized model, current PTQ methods usually randomly select some unlabeled data from the training set as calibration data. However, we prove that the random data selection would result in performance instability and degradation for the activation distribution mismatch. In this paper, we attempt to solve the crucial task on optimal calibration data selection, and propose a novel one-shot calibration data selection method termed SelectQ, which selects specific data for calibration via ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

SelectQ: Calibration Data Selection for Post-Training Quantization

Abstract

Extracted data

SelectQ: Calibration Data Selection for Post-Training Quantization

Abstract

Extracted data

Related items

Related items