Performance bounds represent the best achievable performance that can be delivered by target microarchitectures on specified workloads. Accurate performance bounds establish an efficient way to evaluate the performance potential of either code optimizations or architectural innovations. We advocate using performance bounds to guide code compilation. In this dissertation, we introduce a novel bound-guided approach to systematically regulate code-size related instruction level parallelism (ILP) optimizations, including tail duplication, loop unrolling, and if-conversion. Our approach is based on the notion of code size efficiency, which is defined as the ratio of ILP improvement over static code size increase. With such a notion, we (1) devel...
Current microprocessors improve performance by exploiting instruction-level parallelism (ILP). ILP h...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
We advocate using performance bounds to guide code optimizations. Accurate performance bounds establ...
In global scheduling for ILP processors, regionenlarging optimizations, especially tail duplication,...
To achieve high-performance on processors featuring ILP, most compilers apply locally a set of heuri...
Software pipelining is an effective technique to reduce cycle count by exploiting instruction level ...
Abstract-In an embedded system, the cost of storing a program on-chip can be as high as the cost of ...
The number of transistors as well as the frequency of processors have followed Moore's law for the p...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
Memory bandwidth has become the performance bottleneck for memory intensive programs on modern proce...
This paper presents a model for simultaneous instruction selection, compaction, and register allocat...
The advent of data proliferation and electronic devices gets low execution time and energy consumpti...
This paper describes transformation techniques for out-of-core pro-grams (i.e., those that deal with...
Numerous code optimization techniques, including loop nest optimizations, have been developed over t...
Current microprocessors improve performance by exploiting instruction-level parallelism (ILP). ILP h...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
We advocate using performance bounds to guide code optimizations. Accurate performance bounds establ...
In global scheduling for ILP processors, regionenlarging optimizations, especially tail duplication,...
To achieve high-performance on processors featuring ILP, most compilers apply locally a set of heuri...
Software pipelining is an effective technique to reduce cycle count by exploiting instruction level ...
Abstract-In an embedded system, the cost of storing a program on-chip can be as high as the cost of ...
The number of transistors as well as the frequency of processors have followed Moore's law for the p...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
Memory bandwidth has become the performance bottleneck for memory intensive programs on modern proce...
This paper presents a model for simultaneous instruction selection, compaction, and register allocat...
The advent of data proliferation and electronic devices gets low execution time and energy consumpti...
This paper describes transformation techniques for out-of-core pro-grams (i.e., those that deal with...
Numerous code optimization techniques, including loop nest optimizations, have been developed over t...
Current microprocessors improve performance by exploiting instruction-level parallelism (ILP). ILP h...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
As the demand increases for high performance and power efficiency in modern computer runtime systems...