Abstract. Traditional compilers are limited in their ability to optimize applications for different architectures because statically modeling the effect of specific optimizations on different hardware implementations is difficult. Recent research has been addressing this issue through the use of empirical tuning, which uses trial executions to determine the optimization parameters that are most effective on a particular hardware platform. In this paper, we investigate empirical tuning of loop fusion, an important transformation for optimizing a significant class of real-world applications. In spite of its usefulness, fusion has attracted little attention from previous empirical tuning research, partially because it is much harder to configu...
The advent of data proliferation and electronic devices gets low execution time and energy consumpti...
Loop transformations have been used by compilers to optimize the performance of codes. However, it i...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
Traditional compilers are limited in their ability to optimize applications for different architectu...
Abstract: Loop fusion is recognized as an effective transformation for improving memory hierarchy pe...
UnrestrictedWe are facing an increasing performance gap between processor and memory speed on today'...
The memory bandwidth largely determines the performance of embedded systems. However, very often com...
Modern compilers implement a number of optimization switches and they must be configured carefully i...
Compile-time optimizations generally improve program performance. Nevertheless, degradations caused ...
Current compilers fail to deliver satisfactory levels of performance on modern processors, due to ra...
In recent years, several approaches have been proposed to use profile information in compiler optimi...
Numerous code optimization techniques, including loop nest optimizations, have been developed over t...
AbstractThe complexity of modern architectures require compilers to apply an increasingly large coll...
Today’s compilers have a plethora of optimizations-transformations to choose from, and the correct c...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
The advent of data proliferation and electronic devices gets low execution time and energy consumpti...
Loop transformations have been used by compilers to optimize the performance of codes. However, it i...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
Traditional compilers are limited in their ability to optimize applications for different architectu...
Abstract: Loop fusion is recognized as an effective transformation for improving memory hierarchy pe...
UnrestrictedWe are facing an increasing performance gap between processor and memory speed on today'...
The memory bandwidth largely determines the performance of embedded systems. However, very often com...
Modern compilers implement a number of optimization switches and they must be configured carefully i...
Compile-time optimizations generally improve program performance. Nevertheless, degradations caused ...
Current compilers fail to deliver satisfactory levels of performance on modern processors, due to ra...
In recent years, several approaches have been proposed to use profile information in compiler optimi...
Numerous code optimization techniques, including loop nest optimizations, have been developed over t...
AbstractThe complexity of modern architectures require compilers to apply an increasingly large coll...
Today’s compilers have a plethora of optimizations-transformations to choose from, and the correct c...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
The advent of data proliferation and electronic devices gets low execution time and energy consumpti...
Loop transformations have been used by compilers to optimize the performance of codes. However, it i...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...