Recent technology advances in memory system design, along with 3D stacking, have made near-data processing (NDP) more feasible to accelerate different workloads. In this work, we explore near-data processing for a fundamental operation - linked-list traversal (LLT). We propose a new NDP architecture that does not change the existing sequential programming model and does not require any modification to the processor microarchitecture. Instead, we exploit the packetized interface between the core and the memory modules to off-load LLT for NDP. We leverage a system with multiple memory modules (e.g., hybrid memory cube (HMC) modules) interconnected with a memory network and our initial evaluation shows that simply off-loading LLT computation t...
Over the last decades, a tremendous change toward using information technology in almost every daily...
The limitations of DRAM technology in terms of energy consumption and Bandwidth poses a serious prob...
The explosion of data availability and fast data analytic requirements led to the advent of data-int...
Abstract—The end of Dennard scaling has made all sys-tems energy-constrained. For data-intensive app...
While Processing-in-Memory has been investigated for decades, it has not been embraced commercially....
pre-printWhile Processing-in-Memory has been investigated for decades, it has not been embraced comm...
3D-stacked memory devices with processing logic can help alleviate the memory bandwidth bottleneck i...
A large fraction of MapReduce execution time is spent processing the Map phase, and a large fraction...
As the performance of DRAM devices falls more and more behind computing capabilities, the limitation...
Data movement between memory and CPU is a well-known energy bottleneck for analytics. Near-Memory Pr...
This paper proposes both software and hardware mechanisms based on the near-memory processing (NMP) ...
The cost of transferring data between the off-chip memory system and compute unit is the fundamental...
The exponential growth of the dataset size demanded by modern big data applications requires innovat...
The conventional approach of moving data to the CPU for computation has become a significant perform...
The conventional approach of moving data to the CPU for computation has become a significant perform...
Over the last decades, a tremendous change toward using information technology in almost every daily...
The limitations of DRAM technology in terms of energy consumption and Bandwidth poses a serious prob...
The explosion of data availability and fast data analytic requirements led to the advent of data-int...
Abstract—The end of Dennard scaling has made all sys-tems energy-constrained. For data-intensive app...
While Processing-in-Memory has been investigated for decades, it has not been embraced commercially....
pre-printWhile Processing-in-Memory has been investigated for decades, it has not been embraced comm...
3D-stacked memory devices with processing logic can help alleviate the memory bandwidth bottleneck i...
A large fraction of MapReduce execution time is spent processing the Map phase, and a large fraction...
As the performance of DRAM devices falls more and more behind computing capabilities, the limitation...
Data movement between memory and CPU is a well-known energy bottleneck for analytics. Near-Memory Pr...
This paper proposes both software and hardware mechanisms based on the near-memory processing (NMP) ...
The cost of transferring data between the off-chip memory system and compute unit is the fundamental...
The exponential growth of the dataset size demanded by modern big data applications requires innovat...
The conventional approach of moving data to the CPU for computation has become a significant perform...
The conventional approach of moving data to the CPU for computation has become a significant perform...
Over the last decades, a tremendous change toward using information technology in almost every daily...
The limitations of DRAM technology in terms of energy consumption and Bandwidth poses a serious prob...
The explosion of data availability and fast data analytic requirements led to the advent of data-int...