Journal ArticleCache hierarchies in future many-core processors are expected to grow in size and contribute a large fraction of overall processor power and performance. In this paper, we postulate a 3D chip design that stacks SRAM and DRAM upon processing cores and employs OS-based page coloring to minimize horizontal communication of cache data. We then propose a heterogeneous reconfigurable cache design that takes advantage of the high density of DRAM and the superior power/delay characteristics of SRAM to efficiently meet the working set demands of each individual core. Finally, we analyze the communication patterns for such a processor and show that a tree topology is an ideal fit that significantly reduces the power and latency requi...
Advancements in packaging technology enable high-bandwidth 3D-DRAM that mitigates the memory bandwid...
Abstract—As scaling DRAM cells becomes more challenging and energy-efficient DRAM chips are in high ...
Recently, 3D-stacked dynamic random access memory (DRAM) has become a promising solution for ultra-h...
Abstract — This paper introduces our research status focusing on 3D-implemented microprocessors. 3D-...
Journal ArticleIn future multi-cores, large amounts of delay and power will be spent accessing data...
The objective of this thesis is to optimize the uncore of 3D many-core architectures. More specifica...
Over the last three decades, innovations in the memory subsystem were primarily targeted at overcomi...
Resource pooling, wheremultiple architectural components are shared among cores, is a promising tech...
Abstract—Recent research advocates large die-stacked DRAM caches in manycore servers to break the me...
Abstract—Resource pooling, where multiple architectural components are shared among multiple cores, ...
Multiple-channel die-stacked DRAMs have been used for maximizing the performance and minimizing the ...
International audienceWith the emergence of manycore architectures, the need of on-chip memories suc...
Journal ArticleThe ever increasing sizes of on-chip caches and the growing domination of wire delay...
Abstract—In chip-multiprocessor (CMP) designs, limited mem-ory bandwidth is a potential bottleneck o...
Memory bandwidth has become a major performance bottleneck as more and more cores are integrated ont...
Advancements in packaging technology enable high-bandwidth 3D-DRAM that mitigates the memory bandwid...
Abstract—As scaling DRAM cells becomes more challenging and energy-efficient DRAM chips are in high ...
Recently, 3D-stacked dynamic random access memory (DRAM) has become a promising solution for ultra-h...
Abstract — This paper introduces our research status focusing on 3D-implemented microprocessors. 3D-...
Journal ArticleIn future multi-cores, large amounts of delay and power will be spent accessing data...
The objective of this thesis is to optimize the uncore of 3D many-core architectures. More specifica...
Over the last three decades, innovations in the memory subsystem were primarily targeted at overcomi...
Resource pooling, wheremultiple architectural components are shared among cores, is a promising tech...
Abstract—Recent research advocates large die-stacked DRAM caches in manycore servers to break the me...
Abstract—Resource pooling, where multiple architectural components are shared among multiple cores, ...
Multiple-channel die-stacked DRAMs have been used for maximizing the performance and minimizing the ...
International audienceWith the emergence of manycore architectures, the need of on-chip memories suc...
Journal ArticleThe ever increasing sizes of on-chip caches and the growing domination of wire delay...
Abstract—In chip-multiprocessor (CMP) designs, limited mem-ory bandwidth is a potential bottleneck o...
Memory bandwidth has become a major performance bottleneck as more and more cores are integrated ont...
Advancements in packaging technology enable high-bandwidth 3D-DRAM that mitigates the memory bandwid...
Abstract—As scaling DRAM cells becomes more challenging and energy-efficient DRAM chips are in high ...
Recently, 3D-stacked dynamic random access memory (DRAM) has become a promising solution for ultra-h...