Leveraging Automated Mixed-Low-Precision Quantization for Tiny Edge Microcontrollers

Rusci M.
Fariselli M.
Capotondi A.
Benini L.

Open PDF

Open link

Publication date

January 2020

DOI

10.1007/978-3-030-66770-2_22

Publisher

Springer Science and Business Media LLC

Language

English

Abstract

The severe on-chip memory limitations are currently preventing the deployment of the most accurate Deep Neural Network (DNN) models on tiny MicroController Units (MCUs), even if leveraging an effective 8-bit quantization scheme. To tackle this issue, in this paper we present an automated mixed-precision quantization flow based on the HAQ framework but tailored for the memory and computational characteristics of MCU devices. Specifically, a Reinforcement Learning agent searches for the best uniform quantization levels, among 2, 4, 8 bits, of individual weight and activation tensors, under the tight constraints on RAM and FLASH embedded memory sizes. We conduct an experimental analysis on MobileNetV1, MobileNetV2 and MNasNet models for Imagen...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Leveraging Automated Mixed-Low-Precision Quantization for Tiny Edge Microcontrollers

Abstract

Extracted data

Leveraging Automated Mixed-Low-Precision Quantization for Tiny Edge Microcontrollers

Abstract

Extracted data

Related items

Related items