This doctoral research aims at understanding the nature of the overhead for data irregular GPU workloads, proposing a solution, and examining the consequences of the result. We propose a novel, retry-free GPU workload scheduler for irregular workloads. When used in a Breadth First Search (BFS) algorithm, the proposed simple, monolithic concurrent queue scales to within 10% of ideal scalability on AMD’s Fiji GPU with 14,336 active threads. The dissertation presents an important finding that the retry overhead associated with Compare and Swap (CAS) operations is the principle reason why concurrent queues do not scale well as the number of clients increases in a massively multi-threaded environment
Accelerator-based systems are making rapid inroads into becoming platforms of choice for both high e...
High compute-density with massive thread-level parallelism of Graphics Processing Units (GPUs) is be...
Heterogeneous computing systems using one or more graphics processing units (GPUs) as accelerators p...
This doctoral research aims at understanding the nature of the overhead for data irregular GPU workl...
The Graphics Processing Unit (GPU) has become a more important component in high-performance computi...
We explore software mechanisms for managing irregular tasks on graphics processing units (GPUs). We ...
abstract: With the massive multithreading execution feature, graphics processing units (GPUs) have b...
<p>Heterogeneous processors with accelerators provide an opportunity to improve performance within a...
Massively parallel processing devices, like Graphics Processing Units (GPUs), have the ability to ac...
Abstract—Although GPGPUs are traditionally used to accel-erate workloads with regular control and me...
Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applic...
The efficiency of concurrent data structures is crucial to the performance of multi-threaded program...
In the last decade, there has been a wide scale adoption of Graphics Processing Units (GPUs) as a co...
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput co...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Accelerator-based systems are making rapid inroads into becoming platforms of choice for both high e...
High compute-density with massive thread-level parallelism of Graphics Processing Units (GPUs) is be...
Heterogeneous computing systems using one or more graphics processing units (GPUs) as accelerators p...
This doctoral research aims at understanding the nature of the overhead for data irregular GPU workl...
The Graphics Processing Unit (GPU) has become a more important component in high-performance computi...
We explore software mechanisms for managing irregular tasks on graphics processing units (GPUs). We ...
abstract: With the massive multithreading execution feature, graphics processing units (GPUs) have b...
<p>Heterogeneous processors with accelerators provide an opportunity to improve performance within a...
Massively parallel processing devices, like Graphics Processing Units (GPUs), have the ability to ac...
Abstract—Although GPGPUs are traditionally used to accel-erate workloads with regular control and me...
Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applic...
The efficiency of concurrent data structures is crucial to the performance of multi-threaded program...
In the last decade, there has been a wide scale adoption of Graphics Processing Units (GPUs) as a co...
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput co...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Accelerator-based systems are making rapid inroads into becoming platforms of choice for both high e...
High compute-density with massive thread-level parallelism of Graphics Processing Units (GPUs) is be...
Heterogeneous computing systems using one or more graphics processing units (GPUs) as accelerators p...