Abstract. Achieving peak performance in important numerical kernels such as dense matrix multiply or sparse-matrix vector multiplication usu-ally requires extensive, machine-dependent tuning by hand. In response, a number automatic tuning systems have been developed which typi-cally operate by (1) generating multiple implementations of a kernel, and (2) empirically selecting an optimal implementation. One such sys-tem is FFTW (Fastest Fourier Transform in the West) for the discrete Fourier transform. In this paper, we review FFTW's inner workings with an emphasis on its code generator, and report on our empirical evalua-tion of the system on two dierent hardware and compiler platforms. We then describe a number of our own extensions to...
The native implementation of the N-point digital Fourier Transform involves calculating the scalar p...
The Sparse Fast Fourier Transform is a recent algorithm developed by Hassanieh et al. at MIT for Dis...
Several SOA (state of the art) self-tuning software libraries exist, such as the Fastest Fourier Tra...
FFTW is an implementation of the discrete Fourier transform (DFT) that adapts to the hardware in ord...
We present an auto-tuning framework for FFTs on graphics pro-cessors (GPUs). Due to complex design o...
ii Computing the discrete Fourier transform is one of the most important in ap-plied computer scienc...
This paper presents a high-level compiler that generates hardware implementations of the discrete Fo...
This paper describes FFTW, a portable C package for computing the one- and multidimen-sional complex...
FFTW library for computing the discrete Fourier transform (DFT) has gained a wide acceptance in both...
This paper considers the optimization of resource utilization for three FFT algorithms, as it pertai...
Mathematical software for the Fast Fourier Transform We present a library for computing the Fast Fo...
Li, XiaomingGenerating high performance fast Fourier transform(FFT) libraries for different compute...
Abstract. This paper introduces a formal framework for automatically generating performance optimize...
Ideally, computational libraries and frameworks should oer developers two key benefits. First, they ...
Abstract — Fast changing, increasingly complex, and diverse computing platforms pose central problem...
The native implementation of the N-point digital Fourier Transform involves calculating the scalar p...
The Sparse Fast Fourier Transform is a recent algorithm developed by Hassanieh et al. at MIT for Dis...
Several SOA (state of the art) self-tuning software libraries exist, such as the Fastest Fourier Tra...
FFTW is an implementation of the discrete Fourier transform (DFT) that adapts to the hardware in ord...
We present an auto-tuning framework for FFTs on graphics pro-cessors (GPUs). Due to complex design o...
ii Computing the discrete Fourier transform is one of the most important in ap-plied computer scienc...
This paper presents a high-level compiler that generates hardware implementations of the discrete Fo...
This paper describes FFTW, a portable C package for computing the one- and multidimen-sional complex...
FFTW library for computing the discrete Fourier transform (DFT) has gained a wide acceptance in both...
This paper considers the optimization of resource utilization for three FFT algorithms, as it pertai...
Mathematical software for the Fast Fourier Transform We present a library for computing the Fast Fo...
Li, XiaomingGenerating high performance fast Fourier transform(FFT) libraries for different compute...
Abstract. This paper introduces a formal framework for automatically generating performance optimize...
Ideally, computational libraries and frameworks should oer developers two key benefits. First, they ...
Abstract — Fast changing, increasingly complex, and diverse computing platforms pose central problem...
The native implementation of the N-point digital Fourier Transform involves calculating the scalar p...
The Sparse Fast Fourier Transform is a recent algorithm developed by Hassanieh et al. at MIT for Dis...
Several SOA (state of the art) self-tuning software libraries exist, such as the Fastest Fourier Tra...