In this paper, we present two conceptual frameworks for GPU applications to adjust their task execution times based on total workload. These frameworks enable smart GPU resource management when many applications share GPU resources while the workloads of those applications vary. Application developers can explicitly adjust the number of GPU cores depending on their needs. An implicit adjust-ment will be supported by a run-time framwork, which dy-namically allocates the number of cores to tasks based on the total workload. The runtime support of the proposed system can be realized using functions which measure the execution times of the tasks on GPU and change the number of GPU cores. We motivate the necessity of this framework in the contex...
Programming modern embedded vision systems brings various challenges, due to the steep learning curv...
Abstract-Graphic Processing Units (GPUs) achieve latency tolerance by exploiting massive amounts of ...
GPUs are being increasingly adopted as compute accelerators in many domains, spanning environments f...
General-purpose Graphics Processing Units (GPUs) have been considered as a promising technology to a...
This paper describes GPUSync, which is a framework for managing graphics processing units (GPUs) in ...
The graphics processing unit (GPU) is becoming a very powerful platform to accelerate graphics and d...
The effective use of GPUs for accelerating applications depends on a number of factors including eff...
We propose a GPU fine-grained load-balancing abstraction that decouples load balancing from work pro...
© 2021 ACM.Recently, graphic processing unit (GPU) multitasking has become important in many platfor...
The demand for low-power and high-performance computing has been driving the semiconductor industry ...
Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applic...
Heterogeneous parallel architectures like those comprised of CPUs and GPUs are a tantalizing compute...
Abstract—GPU architecture has traditionally been used in graphics application because of its enormou...
Recent advances in GPUs (graphics processing units) lead to mas-sively parallel hardware that is eas...
High compute-density with massive thread-level parallelism of Graphics Processing Units (GPUs) is be...
Programming modern embedded vision systems brings various challenges, due to the steep learning curv...
Abstract-Graphic Processing Units (GPUs) achieve latency tolerance by exploiting massive amounts of ...
GPUs are being increasingly adopted as compute accelerators in many domains, spanning environments f...
General-purpose Graphics Processing Units (GPUs) have been considered as a promising technology to a...
This paper describes GPUSync, which is a framework for managing graphics processing units (GPUs) in ...
The graphics processing unit (GPU) is becoming a very powerful platform to accelerate graphics and d...
The effective use of GPUs for accelerating applications depends on a number of factors including eff...
We propose a GPU fine-grained load-balancing abstraction that decouples load balancing from work pro...
© 2021 ACM.Recently, graphic processing unit (GPU) multitasking has become important in many platfor...
The demand for low-power and high-performance computing has been driving the semiconductor industry ...
Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applic...
Heterogeneous parallel architectures like those comprised of CPUs and GPUs are a tantalizing compute...
Abstract—GPU architecture has traditionally been used in graphics application because of its enormou...
Recent advances in GPUs (graphics processing units) lead to mas-sively parallel hardware that is eas...
High compute-density with massive thread-level parallelism of Graphics Processing Units (GPUs) is be...
Programming modern embedded vision systems brings various challenges, due to the steep learning curv...
Abstract-Graphic Processing Units (GPUs) achieve latency tolerance by exploiting massive amounts of ...
GPUs are being increasingly adopted as compute accelerators in many domains, spanning environments f...