We propose a generalized method for adapting and optimizing algorithms for efficient execution on modern graphics processing units (GPU). The method consists of several steps. First, build a control flow graph (CFG) of the algorithm. Next, transform the CFG into a tree of loops and merge non-parallelizable loops into parallelizable ones. Finally, map the resulting loops tree to the tree of GPU computational units, unrolling the algorithm's loops as necessary for the match. The method provides a convenient and robust mental framework and strategy for GPU code optimization. We demonstrate the method by adapting a backtracking search algorithm to the GPU platform and building an optimized implementation of the ResNeXt-50 neural network. Distri...
The Graphics Processing Unit (GPU) parallel architecture is now being used not just for graphics but...
Neural networks stand out from artificial intelligence because they can complete challenging tasks, ...
Graphics Processing Units (GPUs) have revolutionized the computing landscape in the past decade and ...
Abstract—Recent years have seen a trend in using graphic pro-cessing units (GPU) as accelerators for...
In this thesis we investigate the relation between the structure of input graphs and the performance...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
Abstract—Graphs are common data structures for many applications, and efficient graph processing is ...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
In this paper, we describe our work to investigate how much cyclic graph based Genetic Programming (...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
Graphs are de facto data structures for many applications, and efficient graph processing is a must ...
Modern Graphics Processing Units (GPUs) provide high computation power at low costs and have been de...
Abstract—Many general-purpose applications exploit Graphics Processing Units (GPUs) by executing a s...
The focus of this work is the automatic performance tuning of stencil computations on Graphics Proce...
The Graphics Processing Unit (GPU) parallel architecture is now being used not just for graphics but...
Neural networks stand out from artificial intelligence because they can complete challenging tasks, ...
Graphics Processing Units (GPUs) have revolutionized the computing landscape in the past decade and ...
Abstract—Recent years have seen a trend in using graphic pro-cessing units (GPU) as accelerators for...
In this thesis we investigate the relation between the structure of input graphs and the performance...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
Abstract—Graphs are common data structures for many applications, and efficient graph processing is ...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
In this paper, we describe our work to investigate how much cyclic graph based Genetic Programming (...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
Graphs are de facto data structures for many applications, and efficient graph processing is a must ...
Modern Graphics Processing Units (GPUs) provide high computation power at low costs and have been de...
Abstract—Many general-purpose applications exploit Graphics Processing Units (GPUs) by executing a s...
The focus of this work is the automatic performance tuning of stencil computations on Graphics Proce...
The Graphics Processing Unit (GPU) parallel architecture is now being used not just for graphics but...
Neural networks stand out from artificial intelligence because they can complete challenging tasks, ...
Graphics Processing Units (GPUs) have revolutionized the computing landscape in the past decade and ...