Compiler-assisted workload consolidation to efficiently exploit dynamic parallelism for recursive applications

Wu, Hancheng

Publisher

University of Missouri--Columbia

Abstract

GPUs have been widely used to parallelize and accelerate applications for its high throughput. Traditionally, a GPU function can only be launched from the CPU side. This results in the fact that GPUs are preferable for those application which express a flat data parallelism, a simple data parallelism that is known at compiling time and can be easily distributed to different GPU blocks and threads. However, for those applications that contain nested data parallelism, which is not known a priori and can only be discovered at running time, it is difficult to write a GPU function that achieve high performance on parallelization and acceleration. One can easily end up with either a too coarse-grained or too fine-grained GPU function. Since Keple...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Compiler-assisted workload consolidation to efficiently exploit dynamic parallelism for recursive applications

Abstract

Extracted data

Compiler-assisted workload consolidation to efficiently exploit dynamic parallelism for recursive applications

Abstract

Extracted data

Related items

Related items