Acceleration and optimization of dynamic parallelism for irregular applications on GPUs

Wang, Jin

Publication date

January 2017

Publisher

Georgia Institute of Technology

Abstract

The objective of this thesis is the development, implementation and optimization of a GPU execution model extension that efficiently supports time-varying, nested, fine-grained dynamic parallelism occurring in the irregular data intensive applications. These dynamically formed pockets of structured parallelism can utilize the recently introduced device-side nested kernel launch capabilities on GPUs. However, the low utilization of GPU resources and the high cost of the device kernel launch make it still difficult to harness dynamic parallelism on GPUs. This thesis then presents an extension to the common Bulk Synchronous Parallel (BSP) GPU execution model -- Dynamic Thread Block Launch (DTBL), which provides the capability of spawning li...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Acceleration and optimization of dynamic parallelism for irregular applications on GPUs

Abstract

Extracted data

Acceleration and optimization of dynamic parallelism for irregular applications on GPUs

Abstract

Extracted data

Related items

Related items