Post-training Quantization for Neural Networks with Provable Guarantees

Zhang, Jinjie
Zhou, Yixuan
Saab, Rayan

Publication date

July 2022

Language

English

Abstract

While neural networks have been remarkably successful in a wide array of applications, implementing them in resource-constrained hardware remains an area of intense research. By replacing the weights of a neural network with quantized (e.g., 4-bit, or binary) counterparts, massive savings in computation cost, memory, and power consumption are attained. To that end, we generalize a post-training neural-network quantization method, GPFQ, that is based on a greedy path-following mechanism. Among other things, we propose modifications to promote sparsity of the weights, and rigorously analyze the associated error. Additionally, our error analysis expands the results of previous work on GPFQ to handle general quantization alphabets, showing that...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Post-training Quantization for Neural Networks with Provable Guarantees

Abstract

Extracted data

Post-training Quantization for Neural Networks with Provable Guarantees

Abstract

Extracted data

Related items

Related items