An In-DRAM Neural Network Processing Engine

Sudarshan, Chirag
Lappas, Jan
Ghaffar, Muhammad Mohsin
Rybalkin, Vladimir
Weis, Christian
Jung, Matthias
Wehn, Norbert

Publication date

January 2019

DOI

10.1109/ISCAS.2019.8702458

Abstract

Many advanced neural network inference engines are bounded by the available memory bandwidth. The conventional approach to address this issue is to employ high bandwidth memory devices or to adapt data compression techniques (reduced precision, sparse weight matrices). Alternatively, an emerging approach to bridge the memory-computation gap and to exploit extreme data parallelism is Processing in Memory (PIM). The close proximity of the computation units to the memory cells reduces the amount of external data transactions and it increases the overall energy efficiency of the memory system. In this work, we present a novel PIM based Binary Weighted Network (BWN) inference accelerator design that is inline with the commodity Dynamic Random Ac...

Extracted data

We use cookies to provide a better user experience.

Data Protection

An In-DRAM Neural Network Processing Engine

Abstract

Extracted data

An In-DRAM Neural Network Processing Engine

Abstract

Extracted data

Related items

Related items