This paper proposes both software and hardware mechanisms based on the near-memory processing (NMP) accelerator to improve the linked list traversal of the in-memory caching. From a software perspective, we propose a simple but an effective mechanism called ITEM JUMP to reduce the number of traversal on list iteration, and additionally, LSB-first parallel linked list traversal unit, which is an NMP-based hardware accelerator is proposed to improve parallel comparison performance of items. The evaluation result shows LSB-first parallel linked list traversal unit can achieve about 34 times better performance in item comparisons than the case where there is no hardware accelerator, and ITEM JUMP can reduce the number of items retrieved by up t...
International audienceToday computing centric von Neumann architectures face strong limitations in t...
The memory system remains a major performance bottleneck in modern and future architectures. In this...
The Near Memory Processor (NMP) is a multithreaded vector processor integrated with the memory contr...
Recent technology advances in memory system design, along with 3D stacking, have made near-data proc...
This dissertation presents a hardware accelerator that is able to accelerate large (including non-pa...
Recent years have witnessed a rapid growth in the amount of generated data, owing to the emergence o...
There has been a significant amount of research on hardware and software support for efficient concu...
A large fraction of MapReduce execution time is spent processing the Map phase, and a large fraction...
Many important scientific and engineering applications execute sub-optimally on current commodity pr...
AbstractAlthough parallel algorithms using linked lists, trees, and graphs have been studied extensi...
The increasing capacity of NAND flash memory leads to large RAM footprint on address mapping in the ...
100 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.In the architectural aspect, ...
Submitted for review to MICRO-40 conference the 9th of June 2007This paper introduces an advanced ha...
The performance of memory-bound commercial applications such as databases is limited by increasing m...
The decreasing cost of DRAM has made possible and grown the use of in-memory databases. However, mem...
International audienceToday computing centric von Neumann architectures face strong limitations in t...
The memory system remains a major performance bottleneck in modern and future architectures. In this...
The Near Memory Processor (NMP) is a multithreaded vector processor integrated with the memory contr...
Recent technology advances in memory system design, along with 3D stacking, have made near-data proc...
This dissertation presents a hardware accelerator that is able to accelerate large (including non-pa...
Recent years have witnessed a rapid growth in the amount of generated data, owing to the emergence o...
There has been a significant amount of research on hardware and software support for efficient concu...
A large fraction of MapReduce execution time is spent processing the Map phase, and a large fraction...
Many important scientific and engineering applications execute sub-optimally on current commodity pr...
AbstractAlthough parallel algorithms using linked lists, trees, and graphs have been studied extensi...
The increasing capacity of NAND flash memory leads to large RAM footprint on address mapping in the ...
100 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.In the architectural aspect, ...
Submitted for review to MICRO-40 conference the 9th of June 2007This paper introduces an advanced ha...
The performance of memory-bound commercial applications such as databases is limited by increasing m...
The decreasing cost of DRAM has made possible and grown the use of in-memory databases. However, mem...
International audienceToday computing centric von Neumann architectures face strong limitations in t...
The memory system remains a major performance bottleneck in modern and future architectures. In this...
The Near Memory Processor (NMP) is a multithreaded vector processor integrated with the memory contr...