We present a novel loop transformation technique, particularly well suited for optimizing embedded compilers, where an increase in compilation time is acceptable in exchange for significant performance increase. The transformation technique optimizes loops containing nested conditional blocks. Specifically, the transformation takes advantage of the fact that the Boolean value of the conditional expression, determining the true/false paths, can be statically analyzed using a novel interval analysis technique that can evaluate conditional expressions in the general polynomial form. Results from interval analysis combined with loop dependency information is used to partition the iteration space of the nested loop. In such cases, the loop nest ...
This paper presents a fully automatic approach to loop paralleliza-tion that integrates the use of s...
International audience— Nested loops present the most critical sections in several embedded real-tim...
The memory bandwidth largely determines the performance of embedded systems. However, very often com...
We present a novel loop transformation technique, particularly well suited for optimizing embedded c...
We present a novel loop transformation technique, particu-larly well suited for optimizing embedded ...
This paper presents a novel source code transformation for control flow optimization called loop nes...
This paper presents the influence of the loop nest splitting source code optimization on the worst-c...
Control divergence poses many problems in parallelizing loops. While predicated execution is commonl...
Numerous code optimization techniques, including loop nest optimizations, have been developed over t...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
Understanding the performance impact of compiler optimizations on superscalar processors is complica...
Irregular control-flow structures like deeply nested conditional branches are common in real-world s...
Embedded systems are usually constrained in terms of timing, power, and memory. Many embedded applic...
Over the past 20 years, increases in processor speed have dramatically outstripped performance incre...
Previous research has shown existence of a huge potential of the coarse-grain parallelism in program...
This paper presents a fully automatic approach to loop paralleliza-tion that integrates the use of s...
International audience— Nested loops present the most critical sections in several embedded real-tim...
The memory bandwidth largely determines the performance of embedded systems. However, very often com...
We present a novel loop transformation technique, particularly well suited for optimizing embedded c...
We present a novel loop transformation technique, particu-larly well suited for optimizing embedded ...
This paper presents a novel source code transformation for control flow optimization called loop nes...
This paper presents the influence of the loop nest splitting source code optimization on the worst-c...
Control divergence poses many problems in parallelizing loops. While predicated execution is commonl...
Numerous code optimization techniques, including loop nest optimizations, have been developed over t...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
Understanding the performance impact of compiler optimizations on superscalar processors is complica...
Irregular control-flow structures like deeply nested conditional branches are common in real-world s...
Embedded systems are usually constrained in terms of timing, power, and memory. Many embedded applic...
Over the past 20 years, increases in processor speed have dramatically outstripped performance incre...
Previous research has shown existence of a huge potential of the coarse-grain parallelism in program...
This paper presents a fully automatic approach to loop paralleliza-tion that integrates the use of s...
International audience— Nested loops present the most critical sections in several embedded real-tim...
The memory bandwidth largely determines the performance of embedded systems. However, very often com...