Abstract-Exploring data transfer and storage issues is crucial to efficiently map data intensive applications (e.g., multimedia) onto programmable processors. Code transformations are used to minimise main memory bus load and hence also power and system performance. However, this typically incurs a considerable arithmetic overhead in the addressing and local control. For instance, memory optimising in-place and data-layout transformations add costly modulo and integer division operations to the initial addressing code. In this paper, we show how the cycle overhead can be almost completely removed. This is done according to a systematic methodology which is a combination of an algebraic transformation exploration approach for the (non)linear...
[[abstract]]This paper presents compilation techniques used to compress holes, which are caused by t...
This paper presents compilation techniques used to compress holes, which are caused by the nonunit a...
Many embedded architectures provide indirect addressing modes with autoincrement /decrement arithmet...
Memory intensive applications require considerable arithmetic for the computation and selection of t...
palkovic,miranda,denolf,vosp,catthoor¢ A cost-efficient realisation of an advanced multimedia system...
Abstract — Multimedia signal processing software typically have to process large amounts of data. Th...
An important class of problems used widely in both the embedded systems and scientific domains perfo...
Abstract—Many application-specific architectures provide indirect addressing modes with auto-increme...
Abstract—Modern high performance processors require memory systems that can provide access to data a...
We present a linear algebraic formulation for a class of index transformations such as Gray code enc...
International audienceMemory disambiguation mechanisms, coupled with load/store queues in out-of-ord...
This paper presents a technique for finding good distributions of arrays and suitable loop restructu...
Increasing the locality of a memory access profile is an interesting optimization problem, whose sol...
This paper presents compilation techniques to compress holes, which are caused by the non-unit align...
Computation-in-Memory (CIM) is an emerging computing paradigm to address memory bottleneck challenge...
[[abstract]]This paper presents compilation techniques used to compress holes, which are caused by t...
This paper presents compilation techniques used to compress holes, which are caused by the nonunit a...
Many embedded architectures provide indirect addressing modes with autoincrement /decrement arithmet...
Memory intensive applications require considerable arithmetic for the computation and selection of t...
palkovic,miranda,denolf,vosp,catthoor¢ A cost-efficient realisation of an advanced multimedia system...
Abstract — Multimedia signal processing software typically have to process large amounts of data. Th...
An important class of problems used widely in both the embedded systems and scientific domains perfo...
Abstract—Many application-specific architectures provide indirect addressing modes with auto-increme...
Abstract—Modern high performance processors require memory systems that can provide access to data a...
We present a linear algebraic formulation for a class of index transformations such as Gray code enc...
International audienceMemory disambiguation mechanisms, coupled with load/store queues in out-of-ord...
This paper presents a technique for finding good distributions of arrays and suitable loop restructu...
Increasing the locality of a memory access profile is an interesting optimization problem, whose sol...
This paper presents compilation techniques to compress holes, which are caused by the non-unit align...
Computation-in-Memory (CIM) is an emerging computing paradigm to address memory bottleneck challenge...
[[abstract]]This paper presents compilation techniques used to compress holes, which are caused by t...
This paper presents compilation techniques used to compress holes, which are caused by the nonunit a...
Many embedded architectures provide indirect addressing modes with autoincrement /decrement arithmet...