Exploring DMA-assisted prefetching strategies for software caches on multicore clusters

Christian Pinto
Luca Benini

Open link

Publication date

January 2014

DOI

10.1109/ASAP.2014.6868666

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

Modern many-core programmable accelerators are often composed by several computing units grouped in clusters, with a shared per-cluster scratchpad data memory. The main programming challenge imposed by these architectures is to hide the external memory to on-chip scratchpad memory transfer latency, trying to overlap as much as possible memory transfers with actual computation. This problem is usually tackled using complex DMA-based programming patterns (e.g. double buffering), which require a heavy refactoring of applications. Software caches are an alternative to hand-optimized DMA programming. However, even if a software cache can reduce the programming effort, it is still relying on synchronous memory transfers. In fact in case of a cach...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Exploring DMA-assisted prefetching strategies for software caches on multicore clusters

Abstract

Extracted data

Exploring DMA-assisted prefetching strategies for software caches on multicore clusters

Abstract

Extracted data

Related items

Related items