Abstract A widely adopted design paradigm for many-core accelerators features processing elements grouped in clusters. Due to area, power and design simplicity, pro-cessors in the same clusters are often not equipped with data-caches but rather share a tightly coupled data memory (TCDM). Even if the use of a TCDM is more energy and area efficient than a cache, it requires a higher programming effort because memory transfers need to be explicitly man-aged (often with DMA-based off-chip memory to TCDM copies). In this context software caches can be used to automatically transfer data between the local TCDM and the external memory, simplifying the task of the program-mer. Despite their ease of use, software caches may incur in non-negligible o...
Set-associative caches are traditionally managed using hardwarebased lookup and replacement schemes ...
L1 instruction caches in many-core systems represent a siz-able fraction of the total power consumpt...
With rapidly evolving technology, multicore and manycore processors have emerged as promising archit...
A widely adopted design paradigm for many-core accelerators features processing elements grouped in ...
A widely adopted design paradigm for many-core accelerators features processing elements grouped in ...
Modern many-core programmable accelerators are often composed by several computing units grouped in ...
Increasing demand for power-efficient, high-performance computing has spurred a growing number and d...
The growing computing demands of emerging application domains such as Recognition/Mining/Synthesis (...
The performance of a computing system heavily depends on the memory hierarchy. Fast but expensive ca...
Multicore chips will have large amounts of fast on-chip cache memory, along with relatively slow DRA...
abstract: Caches have long been used to reduce memory access latency. However, the increased complex...
This thesis proposes a software-oriented distributed shared cache management approach for chip multi...
... embedded devices to have the benefits of a memory hierarchy without the hardware costs. A softwa...
Software-coherent, distributed shared memory has received conciderable amount of attention as an att...
In this paper we propose an instruction to accelerate software caches. While DMAs are very efficient...
Set-associative caches are traditionally managed using hardwarebased lookup and replacement schemes ...
L1 instruction caches in many-core systems represent a siz-able fraction of the total power consumpt...
With rapidly evolving technology, multicore and manycore processors have emerged as promising archit...
A widely adopted design paradigm for many-core accelerators features processing elements grouped in ...
A widely adopted design paradigm for many-core accelerators features processing elements grouped in ...
Modern many-core programmable accelerators are often composed by several computing units grouped in ...
Increasing demand for power-efficient, high-performance computing has spurred a growing number and d...
The growing computing demands of emerging application domains such as Recognition/Mining/Synthesis (...
The performance of a computing system heavily depends on the memory hierarchy. Fast but expensive ca...
Multicore chips will have large amounts of fast on-chip cache memory, along with relatively slow DRA...
abstract: Caches have long been used to reduce memory access latency. However, the increased complex...
This thesis proposes a software-oriented distributed shared cache management approach for chip multi...
... embedded devices to have the benefits of a memory hierarchy without the hardware costs. A softwa...
Software-coherent, distributed shared memory has received conciderable amount of attention as an att...
In this paper we propose an instruction to accelerate software caches. While DMAs are very efficient...
Set-associative caches are traditionally managed using hardwarebased lookup and replacement schemes ...
L1 instruction caches in many-core systems represent a siz-able fraction of the total power consumpt...
With rapidly evolving technology, multicore and manycore processors have emerged as promising archit...