On the Efficiency of Sparse-Tiled Tensor Graph Processing for Low Memory Usage

Cipolletta A.
Calimera A.

Open PDF

Open link

Publication date

January 2021

DOI

10.1109/DAC18074.2021.9586154

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Language

English

Abstract

The memory space taken to host and process large tensor graphs is a limiting factor for embedded ConvNets. Even though many data-driven compression pipelines have proven their efficacy, this work shows there is still room for optimization at the intersection with compute-oriented optimizations. We demonstrate that tensor pruning via weight sparsification can cooperate with a model-agnostic tiling strategy, leading ConvNets towards a new feasible region of the solution space. The collected results show for the first time fast versions of MobileNets deployed at full scale on an ARM M7 core with 512KB of RAM and 2MB of FLASH memory

Extracted data

We use cookies to provide a better user experience.

Data Protection

On the Efficiency of Sparse-Tiled Tensor Graph Processing for Low Memory Usage

Abstract

Extracted data

On the Efficiency of Sparse-Tiled Tensor Graph Processing for Low Memory Usage

Abstract

Extracted data

Related items

Related items