Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud

Oliveira, Geraldo F.
Gómez-Luna, Juan
Ghose, Saugata
Boroumand, Amirali
Mutlu, Onur

Publication date

September 2022

Language

English

Abstract

Neural networks (NNs) are growing in importance and complexity. A neural network's performance (and energy efficiency) can be bound either by computation or memory resources. The processing-in-memory (PIM) paradigm, where computation is placed near or within memory arrays, is a viable solution to accelerate memory-bound NNs. However, PIM architectures vary in form, where different PIM approaches lead to different trade-offs. Our goal is to analyze, discuss, and contrast DRAM-based PIM architectures for NN performance and energy efficiency. To do so, we analyze three state-of-the-art PIM architectures: (1) UPMEM, which integrates processors and DRAM arrays into a single 2D chip; (2) Mensa, a 3D-stack-based PIM architecture tailored for edge ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud

Abstract

Extracted data

Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud

Abstract

Extracted data

Related items

Related items