Branch divergence is a very commonly occurring performance problem in GPGPU in which the execution of diverging branches is serialized to execute only one control flow path at a time. Existing hardware mechanism to reconverge threads using a stack causes duplicate execution of code for unstructured control flow graphs. Also the stack mechanism cannot effectively utilize the available parallelism among diverging branches. Further, the amount of nested divergence allowed is also limited by depth of the branch divergence stack. In this paper we propose a simple and elegant transformation to handle all of the above mentioned problems. The transformation converts an unstructured CFG to a structured CFG without duplicating user code. It incurs on...
National audienceParallel architectures following the SIMT model such as GPUs benefit from applicati...
International audienceThe increasing popularity of Graphics Processing Units (GPUs), has brought ren...
Recent advances in graphics processing units (GPUs) have resulted in massively parallel hard-ware th...
Branch divergence is a very commonly occurring performance problem in GPGPU in which the execution o...
Graphic processing units (GPUs) are composed of a group of single-instruction multiple data (SIMD) s...
Current graphics processing units (GPUs) utilize the single in-struction multiple thread (SIMT) exec...
Irregular control-flow structures like deeply nested conditional branches are common in real-world s...
General Purpose Graphical Processing Units (GPGPUs) rose to prominence with the release of the Fermi...
Branch divergence has a significant impact on the perfor-mance of GPU programs. We propose two novel...
We present an abstract interpretation technique to automatically build a Control Flow Graph (CFG) re...
We propose a generalized method for adapting and optimizing algorithms for efficient execution on mo...
Parallel architectures following the SIMT model such as GPUs benefit from application regularity by ...
There has been a tremendous growth in the use of Graphics Processing Units (GPU) for the acceleratio...
Abstract—Data-parallel architectures must provide efficient support for complex control-flow constru...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
National audienceParallel architectures following the SIMT model such as GPUs benefit from applicati...
International audienceThe increasing popularity of Graphics Processing Units (GPUs), has brought ren...
Recent advances in graphics processing units (GPUs) have resulted in massively parallel hard-ware th...
Branch divergence is a very commonly occurring performance problem in GPGPU in which the execution o...
Graphic processing units (GPUs) are composed of a group of single-instruction multiple data (SIMD) s...
Current graphics processing units (GPUs) utilize the single in-struction multiple thread (SIMT) exec...
Irregular control-flow structures like deeply nested conditional branches are common in real-world s...
General Purpose Graphical Processing Units (GPGPUs) rose to prominence with the release of the Fermi...
Branch divergence has a significant impact on the perfor-mance of GPU programs. We propose two novel...
We present an abstract interpretation technique to automatically build a Control Flow Graph (CFG) re...
We propose a generalized method for adapting and optimizing algorithms for efficient execution on mo...
Parallel architectures following the SIMT model such as GPUs benefit from application regularity by ...
There has been a tremendous growth in the use of Graphics Processing Units (GPU) for the acceleratio...
Abstract—Data-parallel architectures must provide efficient support for complex control-flow constru...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
National audienceParallel architectures following the SIMT model such as GPUs benefit from applicati...
International audienceThe increasing popularity of Graphics Processing Units (GPUs), has brought ren...
Recent advances in graphics processing units (GPUs) have resulted in massively parallel hard-ware th...