International audienceMany computationally-intensive algorithms benefit from the wide parallelism offered by Graphical Processing Units (GPUs). However , the search for a close-to-optimal implementation remains extremely tedious due to the specialization and complexity of GPU architectures.We present a novel approach to automatically discover the best performing code from a given set of possible implementations. It involves a branch and bound algorithm with two distinctive features: (1) an analytic performance model of a lower bound on the execution time, and (2) the ability to estimate such bounds on a partially-specified implementation.The unique features of this performance model allow to aggressively prune the optimization space without...
International audienceThis paper investigates the applicability of iterative search techniques in pr...
Over the last several decades we have witnessed tremendous change in the landscape of computer archi...
The ability to efficiently optimize or re-optimize an algorithm for high performance on a particular...
Compilers looking for an efficient implementation of a function must find which optimizations are th...
International audienceBranch-and-Bound (B&B) algorithms are time intensive tree-based exploration me...
Abstract. In recent years, a number of strategies have emerged for em-pirically tuning applications ...
As computer architectures become more complex, the task of writing efficient program to best utilize...
International audienceBranch-and-bound (B&B) algorithms are attractive methods for solving to optima...
Recent years have witnessed phenomenal growth in the application, and capabilities of Graphical Proc...
We propose a generalized method for adapting and optimizing algorithms for efficient execution on mo...
For scientific array-based programs, optimization for a particular target platform is a hard problem...
Abstract- Future computing systems, from handhelds to su-percomputers, will undoubtedly be more para...
International audienceIn this paper, we develop an approach to GPU kernel optimization by focusing o...
International audienceSolving exactly Combinatorial Optimization Problems (COPs) using a Branch-and-...
Abstract — A key step in program optimization is the estimation of optimal values for parameters suc...
International audienceThis paper investigates the applicability of iterative search techniques in pr...
Over the last several decades we have witnessed tremendous change in the landscape of computer archi...
The ability to efficiently optimize or re-optimize an algorithm for high performance on a particular...
Compilers looking for an efficient implementation of a function must find which optimizations are th...
International audienceBranch-and-Bound (B&B) algorithms are time intensive tree-based exploration me...
Abstract. In recent years, a number of strategies have emerged for em-pirically tuning applications ...
As computer architectures become more complex, the task of writing efficient program to best utilize...
International audienceBranch-and-bound (B&B) algorithms are attractive methods for solving to optima...
Recent years have witnessed phenomenal growth in the application, and capabilities of Graphical Proc...
We propose a generalized method for adapting and optimizing algorithms for efficient execution on mo...
For scientific array-based programs, optimization for a particular target platform is a hard problem...
Abstract- Future computing systems, from handhelds to su-percomputers, will undoubtedly be more para...
International audienceIn this paper, we develop an approach to GPU kernel optimization by focusing o...
International audienceSolving exactly Combinatorial Optimization Problems (COPs) using a Branch-and-...
Abstract — A key step in program optimization is the estimation of optimal values for parameters suc...
International audienceThis paper investigates the applicability of iterative search techniques in pr...
Over the last several decades we have witnessed tremendous change in the landscape of computer archi...
The ability to efficiently optimize or re-optimize an algorithm for high performance on a particular...