Nowadays GPU clusters are available in almost every data processing center. Their GPUs are typically shared by different applications that might have different processing needs and/or different levels of priority. As current GPUs do not support hardware-based preemption mechanisms, it is not possible to ensure the required Quality of Service (QoS) when application kernels are offloaded to devices. In this work, we present an efficient software preemption mechanism with low overhead that evicts and relaunches GPU kernels to provide support to different preemptive scheduling policies. We also propose a new fairness-based scheduler named Fair and Responsive Scheduler, (FRS), that takes into account the current value of the kernels slowdown t...
International audienceThe use of accelerators such as GPUs has become mainstream to achieve high per...
The widespread deployment of NVIDIA GPUs in latency-sensitive systems today requires predictable GPU...
Heterogeneous systems consisting of multiple CPUs and GPUs are increasingly attractive as platforms ...
Blocking synchronisation idioms, e.g. mutexes and barriers, play an important role in concurrent pro...
Blocking synchronisation idioms, e.g. mutexes and barriers, play an important role in concurrent pro...
GPUs are being increasingly adopted as compute accelerators in many domains, spanning environments f...
Many emerging cyber-physical systems, such as autonomous vehicles, have both extreme computation and...
International audienceModern GPUs allow concurrent kernel execution and preemption to improve hardwa...
Modern automotive-grade embedded computing platforms feature high-performance Graphics Processing Un...
Recent NVIDIA Graphics Processing Units (GPUs) can execute multiple kernels concurrently. On these G...
In this study, we provide an extensive survey on wide spectrum of scheduling methods for multitaskin...
Today’s operating systems treat GPUs and other computational accelerators as if they were simple dev...
There is growing interest in accelerating irregular data-parallel algorithms on GPUs. These algorith...
Heterogeneous platforms play an increasingly important role in modern computer systems. They combin...
GPGPUs (General Purpose Graphic Processing Units) provide massive computational power. However, appl...
International audienceThe use of accelerators such as GPUs has become mainstream to achieve high per...
The widespread deployment of NVIDIA GPUs in latency-sensitive systems today requires predictable GPU...
Heterogeneous systems consisting of multiple CPUs and GPUs are increasingly attractive as platforms ...
Blocking synchronisation idioms, e.g. mutexes and barriers, play an important role in concurrent pro...
Blocking synchronisation idioms, e.g. mutexes and barriers, play an important role in concurrent pro...
GPUs are being increasingly adopted as compute accelerators in many domains, spanning environments f...
Many emerging cyber-physical systems, such as autonomous vehicles, have both extreme computation and...
International audienceModern GPUs allow concurrent kernel execution and preemption to improve hardwa...
Modern automotive-grade embedded computing platforms feature high-performance Graphics Processing Un...
Recent NVIDIA Graphics Processing Units (GPUs) can execute multiple kernels concurrently. On these G...
In this study, we provide an extensive survey on wide spectrum of scheduling methods for multitaskin...
Today’s operating systems treat GPUs and other computational accelerators as if they were simple dev...
There is growing interest in accelerating irregular data-parallel algorithms on GPUs. These algorith...
Heterogeneous platforms play an increasingly important role in modern computer systems. They combin...
GPGPUs (General Purpose Graphic Processing Units) provide massive computational power. However, appl...
International audienceThe use of accelerators such as GPUs has become mainstream to achieve high per...
The widespread deployment of NVIDIA GPUs in latency-sensitive systems today requires predictable GPU...
Heterogeneous systems consisting of multiple CPUs and GPUs are increasingly attractive as platforms ...