Code generation for embedded processors opens up the possibility for several performance optimization techniques that have been ignored by traditional compilers that typically do not exploit architectural features of embedded processors such as paramaterized caches. In this report, we present techniques that take into account the parameters of the data caches, for organizing variables declared in embedded code into memory, with the objective of improving data cache performance. We present techniques for clustering variables to minimize compulsory cache misses, and for solving the memory assignment problem to minimize conflict cahce misses. Our experiments with benchmark code kernels from DSP and other domains on the CW4001 embedded processo...
Due to the rapid development in the technology, embedded systems have an effective part in controlli...
International audienceUsual cache optimisation techniques for high performance computing are difficu...
Abstract — In many computer systems, a large portion of the execution time and energy consumption is...
ing with credit is permitted. To copy otherwise, to republish, to post on servers, to redistribute t...
This paper explores an application-specific customization technique for the data cache, one of the f...
Multiprocessor embedded systems often have processor-local caches and a shared memory. If the system...
This thesis presents methodologies for improving system performance and energy consumptionby optimiz...
The instruction cache is a popular target for optimizations of microprocessor-based systems because ...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
Power has become one of the primary design constraints in modern embedded microprocessors. Many embe...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
This paper introduces a dynamic layout optimization strategy to minimize the number of cycles spent ...
When applying optimizations, a number of decisions are made using fixed strategies, such as always a...
Today, embedded processors are expected to be able to run complex, algorithm-heavy applications that...
We present a technique for determining the best data cache size required for a given memory-intensiv...
Due to the rapid development in the technology, embedded systems have an effective part in controlli...
International audienceUsual cache optimisation techniques for high performance computing are difficu...
Abstract — In many computer systems, a large portion of the execution time and energy consumption is...
ing with credit is permitted. To copy otherwise, to republish, to post on servers, to redistribute t...
This paper explores an application-specific customization technique for the data cache, one of the f...
Multiprocessor embedded systems often have processor-local caches and a shared memory. If the system...
This thesis presents methodologies for improving system performance and energy consumptionby optimiz...
The instruction cache is a popular target for optimizations of microprocessor-based systems because ...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
Power has become one of the primary design constraints in modern embedded microprocessors. Many embe...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
This paper introduces a dynamic layout optimization strategy to minimize the number of cycles spent ...
When applying optimizations, a number of decisions are made using fixed strategies, such as always a...
Today, embedded processors are expected to be able to run complex, algorithm-heavy applications that...
We present a technique for determining the best data cache size required for a given memory-intensiv...
Due to the rapid development in the technology, embedded systems have an effective part in controlli...
International audienceUsual cache optimisation techniques for high performance computing are difficu...
Abstract — In many computer systems, a large portion of the execution time and energy consumption is...