Modern supercomputers now use accelerators to achieve their performance with the most widely used accelerator being the Graphics Processing Unit (GPU). However, achieving the performance potential of systems that combine a GPU and CPU is an arduous task which could be made easier with the assistance of the compiler or runtime. In particular, exploiting two features of GPU architectures -- distributed memory and concurrent kernel execution -- is critical to achieve good performance, but in current GPU programming systems, programmers must exploit them manually. This can lead to poor performance. In this thesis, we propose automatic techniques that: i) perform data transfers between the CPU and GPU, ii) allocate resources for concurrent kerne...
The rapid development in computing technology has paved the way for directive-based programming mode...
Recent NVIDIA Graphics Processing Units (GPUs) can execute multiple kernels concurrently. On these G...
There is growing interest in accelerating irregular data-parallel algorithms on GPUs. These algorith...
Exploiting the performance potential of GPUs requires managing the data transfers to and from them e...
\ud \ud Exploiting the performance potential of GPUs requires managing the data transfers to and fro...
abstract: With the massive multithreading execution feature, graphics processing units (GPUs) have b...
Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workload...
Each new generation of GPUs vastly increases the resources avail-able to GPGPU programs. GPU program...
Heterogeneous supercomputers that incorporate computational accelerators such as GPUs are increasing...
Each new generation of GPUs vastly increases the resources available to GPGPU programs. GPU programm...
Enhancing the match between software executions and hardware features is key to computing efficiency...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
<p>The continued growth of the computational capability of throughput processors has made throughput...
Graphics Processing Unit (GPU)-based architectures have become the default accelerator choice for a ...
2018-02-23Graphics Processing Units (GPUs) are designed primarily to execute multimedia, and game re...
The rapid development in computing technology has paved the way for directive-based programming mode...
Recent NVIDIA Graphics Processing Units (GPUs) can execute multiple kernels concurrently. On these G...
There is growing interest in accelerating irregular data-parallel algorithms on GPUs. These algorith...
Exploiting the performance potential of GPUs requires managing the data transfers to and from them e...
\ud \ud Exploiting the performance potential of GPUs requires managing the data transfers to and fro...
abstract: With the massive multithreading execution feature, graphics processing units (GPUs) have b...
Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workload...
Each new generation of GPUs vastly increases the resources avail-able to GPGPU programs. GPU program...
Heterogeneous supercomputers that incorporate computational accelerators such as GPUs are increasing...
Each new generation of GPUs vastly increases the resources available to GPGPU programs. GPU programm...
Enhancing the match between software executions and hardware features is key to computing efficiency...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
<p>The continued growth of the computational capability of throughput processors has made throughput...
Graphics Processing Unit (GPU)-based architectures have become the default accelerator choice for a ...
2018-02-23Graphics Processing Units (GPUs) are designed primarily to execute multimedia, and game re...
The rapid development in computing technology has paved the way for directive-based programming mode...
Recent NVIDIA Graphics Processing Units (GPUs) can execute multiple kernels concurrently. On these G...
There is growing interest in accelerating irregular data-parallel algorithms on GPUs. These algorith...