Dynamic Beam Width Tuning for Energy-Efficient Recurrent Neural Networks

Jahier Pagliari, Daniele
Panini, Francesco
Macii, Enrico
Poncino, Massimo

Open PDF

Open link

Publication date

January 2019

DOI

10.1145/3299874.3317974

Publisher

Association for Computing Machinery (ACM)

Abstract

Recurrent Neural Networks (RNNs) are state-of-the-art models for many machine learning tasks, such as language modeling and machine translation. Executing the inference phase of a RNN directly in edge nodes, rather than in the cloud, would provide benefits in terms of energy consumption, latency and network bandwidth, provided that models can be made efficient enough to run on energy-constrained embedded devices. To this end, we propose an algorithmic optimization for improving the energy efficiency of encoder-decoder RNNs. Our method operates on the Beam Width (BW), i.e. one of the parameters that most influences inference complexity, modulating it depending on the currently processed input based on a metric of the network's "confidence...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Dynamic Beam Width Tuning for Energy-Efficient Recurrent Neural Networks

Abstract

Extracted data

Dynamic Beam Width Tuning for Energy-Efficient Recurrent Neural Networks

Abstract

Extracted data

Related items

Related items