We explore software mechanisms for managing irregular tasks on graphics processing units (GPUs). We demonstrate that dynamic scheduling and efficient memory management are critical problems in achieving high efficiency on irregular workloads. We experiment with several task-management techniques, ranging from the use of a single monolithic task queue to distributed queuing with task stealing and donation. On irregular workloads, we show that both centralized and distributed queues have more than 100 times as much idle times as our task-stealing and -donation queues. Our preferred choice is task-donation because of comparable performance to task-stealing while using less memory overhead. To help in this analysis, we use an artificial task-ma...
Modern consumer-grade 3D graphic cards boast a computation/memory resource that can easily rival or ...
International audienceThe race for Exascale computing has naturally led the current technologies to ...
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput co...
We explore software mechanisms for managing irregular tasks on graphics processing units (GPUs). We ...
International audienceThe use of accelerators such as GPUs has become mainstream to achieve high per...
Heterogeneous computing systems using one or more graphics processing units (GPUs) as accelerators p...
This doctoral research aims at understanding the nature of the overhead for data irregular GPU workl...
International audienceGraphics Processing units (GPU) have become a valuable support for High Perfor...
The computational power provided by many-core graph-ics processing units (GPUs) has been exploited i...
A now-classical way of meeting the increasing demand for computing speed by HPC applications is the ...
We present a task-parallel programming model for the GPU. Our task model is robust enough to handle ...
In order to satisfy timing constraints, modern real-time applications require massively parallel acc...
Task parallelism is omnipresent these days; whether in data mining or machine learning, for matrix f...
In this study, we provide an extensive survey on wide spectrum of scheduling methods for multitaskin...
We propose a GPU fine-grained load-balancing abstraction that decouples load balancing from work pro...
Modern consumer-grade 3D graphic cards boast a computation/memory resource that can easily rival or ...
International audienceThe race for Exascale computing has naturally led the current technologies to ...
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput co...
We explore software mechanisms for managing irregular tasks on graphics processing units (GPUs). We ...
International audienceThe use of accelerators such as GPUs has become mainstream to achieve high per...
Heterogeneous computing systems using one or more graphics processing units (GPUs) as accelerators p...
This doctoral research aims at understanding the nature of the overhead for data irregular GPU workl...
International audienceGraphics Processing units (GPU) have become a valuable support for High Perfor...
The computational power provided by many-core graph-ics processing units (GPUs) has been exploited i...
A now-classical way of meeting the increasing demand for computing speed by HPC applications is the ...
We present a task-parallel programming model for the GPU. Our task model is robust enough to handle ...
In order to satisfy timing constraints, modern real-time applications require massively parallel acc...
Task parallelism is omnipresent these days; whether in data mining or machine learning, for matrix f...
In this study, we provide an extensive survey on wide spectrum of scheduling methods for multitaskin...
We propose a GPU fine-grained load-balancing abstraction that decouples load balancing from work pro...
Modern consumer-grade 3D graphic cards boast a computation/memory resource that can easily rival or ...
International audienceThe race for Exascale computing has naturally led the current technologies to ...
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput co...