Traditional compilers are limited in their ability to optimize applications for different architectures because statically modeling the effect of specific optimizations on different hardware implementations is difficult. Recent research has been addressing this issue through the use of empirical tuning, which uses trial executions to determine the optimization parameters that are most effective on a particular hardware platform. In this paper, we investigate empirical tuning of loop fusion, an important transformation for optimizing a significant class of real-world applications. In spite of its usefulness, fusion has attracted little attention from previous empirical tuning research, partially because it is much harder to configure than tr...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
The advent of data proliferation and electronic devices gets low execution time and energy consumpti...
Abstract. Traditional compilers are limited in their ability to optimize applications for different ...
Abstract: Loop fusion is recognized as an effective transformation for improving memory hierarchy pe...
UnrestrictedWe are facing an increasing performance gap between processor and memory speed on today'...
The memory bandwidth largely determines the performance of embedded systems. However, very often com...
Modern compilers implement a number of optimization switches and they must be configured carefully i...
Compile-time optimizations generally improve program performance. Nevertheless, degradations caused ...
Current compilers fail to deliver satisfactory levels of performance on modern processors, due to ra...
In recent years, several approaches have been proposed to use profile information in compiler optimi...
AbstractThe complexity of modern architectures require compilers to apply an increasingly large coll...
Numerous code optimization techniques, including loop nest optimizations, have been developed over t...
Today’s compilers have a plethora of optimizations-transformations to choose from, and the correct c...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
The advent of data proliferation and electronic devices gets low execution time and energy consumpti...
Abstract. Traditional compilers are limited in their ability to optimize applications for different ...
Abstract: Loop fusion is recognized as an effective transformation for improving memory hierarchy pe...
UnrestrictedWe are facing an increasing performance gap between processor and memory speed on today'...
The memory bandwidth largely determines the performance of embedded systems. However, very often com...
Modern compilers implement a number of optimization switches and they must be configured carefully i...
Compile-time optimizations generally improve program performance. Nevertheless, degradations caused ...
Current compilers fail to deliver satisfactory levels of performance on modern processors, due to ra...
In recent years, several approaches have been proposed to use profile information in compiler optimi...
AbstractThe complexity of modern architectures require compilers to apply an increasingly large coll...
Numerous code optimization techniques, including loop nest optimizations, have been developed over t...
Today’s compilers have a plethora of optimizations-transformations to choose from, and the correct c...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
The advent of data proliferation and electronic devices gets low execution time and energy consumpti...