The development of embedded applications typically faces memory and/or execution time con-straints. In order to improve performance advanced compilers may use code transformations. There are cases where code transformations are not automatically applied and it is up to the devel-oper to manually apply them. One of the most relevant code transformations is Loop Unrolling. It is a widely studied transformation and it is able to improve the performance of many loops. It is easily implemented and its applicability is always legal. The main goal of this dissertation is to propose an approach to help developers decide about the application of Loop Unrolling and about the unroll factor to use. We propose an approach that uses a set of heuristics, ...
This paper presents UNRET (unrolling and retiming), a new approach for resourceconstrained software ...
Abstract. Optimizing compilers have a long history of applying loop transformations to C and Fortran...
Multithreading is attractive in that it can tolerate latency and synchronization, which are the two ...
It is well-known that, to optimize a program for speed-up, efforts should be focused on the regions ...
Loops in programs are the source of many optimizations for improv-ing program performance, particula...
We introduce Approximate Unrolling, a loop optimization that reduces execution time and energy consu...
Compilers base many critical decisions on abstracted architectural models. While recent research has...
ii The high performance of today’s microprocessors is achieved mainly by fast, multipleissuing hardw...
In order to deliver the promise of MooreÂs Law to the enduser, compilers must make decisions that ar...
International audienceSoftware pipelining is a powerful technique to expose fine-grain parallelism, ...
Loop unrolling is a widely adopted loop transformation, commonly used for enabling subsequent optimi...
In order to improve the accuracy of loop unrolling factor in the compiler, we propose a loop unrolli...
This thesis demonstrates that careful selection of compiler transformations can improve the output a...
International audienceThis article studies an important open problem in backend compilation regardin...
Loop optimizations such as loop unrolling, unfolding and invariant, code motion have long been used ...
This paper presents UNRET (unrolling and retiming), a new approach for resourceconstrained software ...
Abstract. Optimizing compilers have a long history of applying loop transformations to C and Fortran...
Multithreading is attractive in that it can tolerate latency and synchronization, which are the two ...
It is well-known that, to optimize a program for speed-up, efforts should be focused on the regions ...
Loops in programs are the source of many optimizations for improv-ing program performance, particula...
We introduce Approximate Unrolling, a loop optimization that reduces execution time and energy consu...
Compilers base many critical decisions on abstracted architectural models. While recent research has...
ii The high performance of today’s microprocessors is achieved mainly by fast, multipleissuing hardw...
In order to deliver the promise of MooreÂs Law to the enduser, compilers must make decisions that ar...
International audienceSoftware pipelining is a powerful technique to expose fine-grain parallelism, ...
Loop unrolling is a widely adopted loop transformation, commonly used for enabling subsequent optimi...
In order to improve the accuracy of loop unrolling factor in the compiler, we propose a loop unrolli...
This thesis demonstrates that careful selection of compiler transformations can improve the output a...
International audienceThis article studies an important open problem in backend compilation regardin...
Loop optimizations such as loop unrolling, unfolding and invariant, code motion have long been used ...
This paper presents UNRET (unrolling and retiming), a new approach for resourceconstrained software ...
Abstract. Optimizing compilers have a long history of applying loop transformations to C and Fortran...
Multithreading is attractive in that it can tolerate latency and synchronization, which are the two ...