Work-in-Progress: Quantized NNs as the Definitive solution for inference on low-power ARM MCUs?

Rusci, Manuele
Capotondi, Alessandro
Conti, Francesco
Benini, Luca

Open PDF

Open link

Publication date

January 2018

DOI

10.1109/CODESISSS.2018.8525915

Publisher

Institute of Electrical and Electronics Engineers Inc.

Language

English

Abstract

High energy efficiency and low memory footprint are the key requirements for the deployment of deep learning based analytics on low-power microcontrollers. Here we present work-in-progress results with Q-bit Quantized Neural Networks (QNNs) deployed on a commercial Cortex-M7 class microcontroller by means of an extension to the ARM CMSIS-NN library. We show that i) for Q=4 and Q=2 low memory footprint QNNs can be deployed with an energy overhead of 30% and 36% respectively against the 8-bit CMSIS-NN due to the lack of quantization support in the ISA; ii) for Q=1 native instructions can be used, yielding an energy and latency reduction of ∼3.8× with respect to CMSIS-NN. Our initial results suggest that a small set of QNN-related specialized ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Work-in-Progress: Quantized NNs as the Definitive solution for inference on low-power ARM MCUs?

Abstract

Extracted data

Work-in-Progress: Quantized NNs as the Definitive solution for inference on low-power ARM MCUs?

Abstract

Extracted data

Related items

Related items