Autotuning technology has emerged recently as a systematic pro-cess for evaluating alternative implementations of a computation, in order to select the best-performing solution for a particular archi-tecture. Specialization optimizes code customized to a particular class of input data set. In this paper, we demonstrate how compiler-based autotuning that incorporates specialization for expected data set sizes of key computations can be used to speed up Nek5000, a spectral-element code. Nek5000 makes heavy use of what are effectively Basic Linear Algebra Subroutine (BLAS) calls, but for very small matrices. Through autotuning and specialization, we can achieve significant performance gains over hand-tuned libraries (e.g., Goto, ATLAS, and ACM...
The abstract relation between hardware parameters and program performance makes setting program para...
As computing hardware evolves, increasing core counts mean that memory bandwidth is becoming the dec...
An autotuner takes a parameterized code as input and tries to optimize the code by finding the best ...
Abstract. Autotuning technology has emerged recently as a systematic process for evaluating alternat...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
It is rare for a programmer to solve a numerical problem with a single library call; most problems r...
Abstract—Autotuning systems intelligently navigate a search space of possible implementations of a c...
This report summarizes the progress made as part of a one year lab-directed research and development...
Today, scientific computing plays an important role in scientific research. People build supercomput...
Vectorization support in hardware continues to expand and grow as well we still continue on supersca...
AbstractAs computing hardware evolves, increasing core counts mean that memory bandwidth is becoming...
Autotuning is a method which enables a program to automatically choose the most suitable parameters ...
A plethora of program analysis and optimization techniques rely on linear programming at their heart...
AbstractEmpirical performance optimization of computer codes using autotuners has received significa...
Thesis (M.A.)--Özyeğin University, Graduate School of Sciences and Engineering, Department of Comput...
The abstract relation between hardware parameters and program performance makes setting program para...
As computing hardware evolves, increasing core counts mean that memory bandwidth is becoming the dec...
An autotuner takes a parameterized code as input and tries to optimize the code by finding the best ...
Abstract. Autotuning technology has emerged recently as a systematic process for evaluating alternat...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
It is rare for a programmer to solve a numerical problem with a single library call; most problems r...
Abstract—Autotuning systems intelligently navigate a search space of possible implementations of a c...
This report summarizes the progress made as part of a one year lab-directed research and development...
Today, scientific computing plays an important role in scientific research. People build supercomput...
Vectorization support in hardware continues to expand and grow as well we still continue on supersca...
AbstractAs computing hardware evolves, increasing core counts mean that memory bandwidth is becoming...
Autotuning is a method which enables a program to automatically choose the most suitable parameters ...
A plethora of program analysis and optimization techniques rely on linear programming at their heart...
AbstractEmpirical performance optimization of computer codes using autotuners has received significa...
Thesis (M.A.)--Özyeğin University, Graduate School of Sciences and Engineering, Department of Comput...
The abstract relation between hardware parameters and program performance makes setting program para...
As computing hardware evolves, increasing core counts mean that memory bandwidth is becoming the dec...
An autotuner takes a parameterized code as input and tries to optimize the code by finding the best ...