Abstract — A key step in program optimization is the estimation of optimal values for parameters such as tile sizes and loop unrolling factors. Traditional compilers use simple analytical models to compute these values. In contrast, library generators like ATLAS use global search over the space of parameter values by generating programs with many different combinations of parameter values, and running them on the actual hardware to determine which values give the best performance. It is widely believed that traditional model-driven optimization cannot compete with search-based empirical optimization because tractable analytical models cannot capture all the complexities of modern high-performance architectures, but few quantitative comparis...
This paper proposes the use of empirical modeling techniques for building microarchitecture sensitiv...
International audienceToday's multi-core era places significant demands on an optimizing compiler, w...
Abstract. The goal of this work is a systematic approach to compiler optimization for simultaneously...
A major difference between compilers, and library generators such as ATLAS or FFTW is that compilers...
A key step in program optimization is the determination of optimal values for code optimization par...
For scientific array-based programs, optimization for a particular target platform is a hard problem...
Abstract. In many cases, simple analytical models used by traditional compilers are no longer able t...
Library generators like ATLAS generate high-performance BLAS by performing a global, empirical searc...
UnrestrictedWe are facing an increasing performance gap between processor and memory speed on today'...
Abstract. In recent years, a number of strategies have emerged for em-pirically tuning applications ...
Abstract. The increasing complexities of modern architectures require compilers to extensively apply...
AbstractEmpirical performance optimization of computer codes using autotuners has received significa...
Achieving peak performance from the computational ker-nels that dominate application performance oft...
Today's multi-core era places significant demands on an optimizing compiler, which must parallelize ...
With the projected increase in hardware error rates in the future, software needs to be resilient to...
This paper proposes the use of empirical modeling techniques for building microarchitecture sensitiv...
International audienceToday's multi-core era places significant demands on an optimizing compiler, w...
Abstract. The goal of this work is a systematic approach to compiler optimization for simultaneously...
A major difference between compilers, and library generators such as ATLAS or FFTW is that compilers...
A key step in program optimization is the determination of optimal values for code optimization par...
For scientific array-based programs, optimization for a particular target platform is a hard problem...
Abstract. In many cases, simple analytical models used by traditional compilers are no longer able t...
Library generators like ATLAS generate high-performance BLAS by performing a global, empirical searc...
UnrestrictedWe are facing an increasing performance gap between processor and memory speed on today'...
Abstract. In recent years, a number of strategies have emerged for em-pirically tuning applications ...
Abstract. The increasing complexities of modern architectures require compilers to extensively apply...
AbstractEmpirical performance optimization of computer codes using autotuners has received significa...
Achieving peak performance from the computational ker-nels that dominate application performance oft...
Today's multi-core era places significant demands on an optimizing compiler, which must parallelize ...
With the projected increase in hardware error rates in the future, software needs to be resilient to...
This paper proposes the use of empirical modeling techniques for building microarchitecture sensitiv...
International audienceToday's multi-core era places significant demands on an optimizing compiler, w...
Abstract. The goal of this work is a systematic approach to compiler optimization for simultaneously...