Abstract—Specialized hardware acceleration is an effective technique to mitigate the dark silicon problems. A challenge in designing on-chip hardware accelerators for data-intensive applications is how to efficiently transfer data between the memory hierarchy and the accelerators. Although the Processing-in-Memory (PIM) technique has the potential to reduce the overhead of data transfers, it is limited by the traditional process technology. Recent process technology advancements such as 3D-die stacking enable efficient PIM architectures by integrating accelerators to the logic layer of 3D DRAM, thus leading to the concept of the 3D-stacked Memory-Side Accelerator (MSA). In this paper, we initially present the overall architecture of the 3D-...
In light of the failure of Dennard scaling and recent slowdown of Moore's Law, both industry and aca...
Three-dimensional (3D) on-chip memory stacking has been proposed as a promising solution to the &apo...
In this paper, we show how 3D stacking technology can be used to implement a simple, low-power, high...
<p>This paper introduces a 3D-stacked logic-in-memory (LiM) system to accelerate the processing of s...
The main aim of this thesis is to examine the advantages of 3D stacking applied to microprocessors a...
Abstract — This paper introduces our research status focusing on 3D-implemented microprocessors. 3D-...
none8Convergence of communication, consumer applications and computing within mobile systems pushes ...
none43D integration based on TSV (through silicon via) technology enables stacking of multiple memor...
The cost of transferring data between the off-chip memory system and compute unit is the fundamental...
To address the 'memory wall' challenge, on-chip memory stacking has been proposed as a pro...
[[abstract]]To address the “memory wall” challenge, on-chip memory stacking has been proposed as a p...
Abstract—Energy-efficiency has emerged as a major barrier to performance scalability for modern proc...
In this paper we focus on common data reorganization op-erations such as shuffle, pack/unpack, swap,...
Recently, stereo matching processors have been adopted in real-time embedded systems such as intelli...
In this paper, we show how 3D stacking technology can be used to implement a simple, low-power, high...
In light of the failure of Dennard scaling and recent slowdown of Moore's Law, both industry and aca...
Three-dimensional (3D) on-chip memory stacking has been proposed as a promising solution to the &apo...
In this paper, we show how 3D stacking technology can be used to implement a simple, low-power, high...
<p>This paper introduces a 3D-stacked logic-in-memory (LiM) system to accelerate the processing of s...
The main aim of this thesis is to examine the advantages of 3D stacking applied to microprocessors a...
Abstract — This paper introduces our research status focusing on 3D-implemented microprocessors. 3D-...
none8Convergence of communication, consumer applications and computing within mobile systems pushes ...
none43D integration based on TSV (through silicon via) technology enables stacking of multiple memor...
The cost of transferring data between the off-chip memory system and compute unit is the fundamental...
To address the 'memory wall' challenge, on-chip memory stacking has been proposed as a pro...
[[abstract]]To address the “memory wall” challenge, on-chip memory stacking has been proposed as a p...
Abstract—Energy-efficiency has emerged as a major barrier to performance scalability for modern proc...
In this paper we focus on common data reorganization op-erations such as shuffle, pack/unpack, swap,...
Recently, stereo matching processors have been adopted in real-time embedded systems such as intelli...
In this paper, we show how 3D stacking technology can be used to implement a simple, low-power, high...
In light of the failure of Dennard scaling and recent slowdown of Moore's Law, both industry and aca...
Three-dimensional (3D) on-chip memory stacking has been proposed as a promising solution to the &apo...
In this paper, we show how 3D stacking technology can be used to implement a simple, low-power, high...