Algorithmic choice is essential in any problem domain to realizing optimal computational performance. Multigrid is a prime example: not only is it possible to make choices at the highest grid resolution, but a program can switch techniques as the problem is recursively attacked on coarser grid levels to take advantage of algorithms with different scaling behaviors. Additionally, users with different convergence criteria must experiment with parameters to yield a tuned algorithm that meets their accuracy requirements. Even after a tuned algorithm has been found, users often have to start all over when migrating from one machine to another. We present an algorithm and autotuning methodology that address these issues in a near-optimal and effi...
In high-performance computing, excellent node-level performance is required for the efficient use of...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Mathematics, 2011.Cataloged from PD...
The recent transformation from an environment where gains in computational performance came from inc...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
It is often impossible to obtain a one-size-fits-all solution for high performance algorithms when c...
It is often impossible to obtain a one-size-fits-all solution for high performance algorithms when c...
GPUs, with their high bandwidths and computational capabilities are an increasingly popular target f...
Multi-grid methods are numerical algorithms used in parallel and distributed processing. The main id...
It is often impossible to obtain a one-size-fits-all solution for high performance algorithms when c...
Modern high performance libraries, such as ATLAS and FFTW, and programming languages, such as PetaBr...
A daunting challenge faced by program performance autotuning is input sensitivity, where the best au...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.The experimental results moti...
Abstract—Autotuning systems intelligently navigate a search space of possible implementations of a c...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
Empirical autotuning is increasingly being used in many domains to achieve optimized performance in ...
In high-performance computing, excellent node-level performance is required for the efficient use of...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Mathematics, 2011.Cataloged from PD...
The recent transformation from an environment where gains in computational performance came from inc...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
It is often impossible to obtain a one-size-fits-all solution for high performance algorithms when c...
It is often impossible to obtain a one-size-fits-all solution for high performance algorithms when c...
GPUs, with their high bandwidths and computational capabilities are an increasingly popular target f...
Multi-grid methods are numerical algorithms used in parallel and distributed processing. The main id...
It is often impossible to obtain a one-size-fits-all solution for high performance algorithms when c...
Modern high performance libraries, such as ATLAS and FFTW, and programming languages, such as PetaBr...
A daunting challenge faced by program performance autotuning is input sensitivity, where the best au...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.The experimental results moti...
Abstract—Autotuning systems intelligently navigate a search space of possible implementations of a c...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
Empirical autotuning is increasingly being used in many domains to achieve optimized performance in ...
In high-performance computing, excellent node-level performance is required for the efficient use of...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Mathematics, 2011.Cataloged from PD...
The recent transformation from an environment where gains in computational performance came from inc...