Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets

Zeng, Lu
Parthasarathi, Sree Hari Krishnan
Liu, Yuzong
Escott, Alex
Cheekatmalla, Santosh Kumar
Strom, Nikko
Vitaladevuni, Shiv

Publication date

September 2022

Language

English

Abstract

We propose a novel 2-stage sub 8-bit quantization aware training algorithm for all components of a 250K parameter feedforward, streaming, state-free keyword spotting model. For the 1st-stage, we adapt a recently proposed quantization technique using a non-linear transformation with tanh(.) on dense layer weights. In the 2nd-stage, we use linear quantization methods on the rest of the network, including other parameters (bias, gain, batchnorm), inputs, and activations. We conduct large scale experiments, training on 26,000 hours of de-identified production, far-field and near-field audio data (evaluating on 4,000 hours of data). We organize our results in two embedded chipset settings: a) with commodity ARM NEON instruction set and 8-bit con...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets

Abstract

Extracted data

Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets

Abstract

Extracted data

Related items

Related items