Abstract—The end of Dennard scaling has made all sys-tems energy-constrained. For data-intensive applications with limited temporal locality, the major energy bottleneck is data movement between processor chips and main memory modules. For such workloads, the best way to optimize energy is to place processing near the data in main memory. Advances in 3D integration provide an opportunity to implement near-data processing (NDP) without the technology problems that similar efforts had in the past. This paper develops the hardware and software of an NDP architecture for in-memory analytics frameworks, including MapReduce, graph processing, and deep neural networks. We develop simple but scalable hardware support for coherence, communication, a...
Over the last decades, a tremendous change toward using information technology in almost every daily...
We are in the computing era of super-zetta data bytes (a.k.a. Big Data). Big Data is critical to dev...
Near-memory Computing (NMC) promises improved performance for the applications that can exploit the ...
The exponential growth of the dataset size demanded by modern big data applications requires innovat...
Recent technology advances in memory system design, along with 3D stacking, have made near-data proc...
Data-intensive workloads and applications, such as machine learning (ML), are fundamentally limited ...
The conventional approach of moving data to the CPU for computation has become a significant perform...
The conventional approach of moving data to the CPU for computation has become a significant perform...
As the performance of DRAM devices falls more and more behind computing capabilities, the limitation...
\u3cp\u3eThe conventional approach of moving stored data to the CPU for computation has become a maj...
The conventional approach of moving stored data to the CPU for computation has become a major perfor...
For the past two decades, the scaling of main memory lags behind the advancement of computation in a...
3D-stacked memory devices with processing logic can help alleviate the memory bandwidth bottleneck i...
pre-printWhile Processing-in-Memory has been investigated for decades, it has not been embraced comm...
The increasing demand for extracting value out of ever-growing data poses an ongoing challenge to sy...
Over the last decades, a tremendous change toward using information technology in almost every daily...
We are in the computing era of super-zetta data bytes (a.k.a. Big Data). Big Data is critical to dev...
Near-memory Computing (NMC) promises improved performance for the applications that can exploit the ...
The exponential growth of the dataset size demanded by modern big data applications requires innovat...
Recent technology advances in memory system design, along with 3D stacking, have made near-data proc...
Data-intensive workloads and applications, such as machine learning (ML), are fundamentally limited ...
The conventional approach of moving data to the CPU for computation has become a significant perform...
The conventional approach of moving data to the CPU for computation has become a significant perform...
As the performance of DRAM devices falls more and more behind computing capabilities, the limitation...
\u3cp\u3eThe conventional approach of moving stored data to the CPU for computation has become a maj...
The conventional approach of moving stored data to the CPU for computation has become a major perfor...
For the past two decades, the scaling of main memory lags behind the advancement of computation in a...
3D-stacked memory devices with processing logic can help alleviate the memory bandwidth bottleneck i...
pre-printWhile Processing-in-Memory has been investigated for decades, it has not been embraced comm...
The increasing demand for extracting value out of ever-growing data poses an ongoing challenge to sy...
Over the last decades, a tremendous change toward using information technology in almost every daily...
We are in the computing era of super-zetta data bytes (a.k.a. Big Data). Big Data is critical to dev...
Near-memory Computing (NMC) promises improved performance for the applications that can exploit the ...