International audienceInitially introduced as special-purpose accelerators for graphics applications, GPUs have now emerged as general purpose computing platforms for a wide range of applications. To address the requirements of these applications, modern GPUs include sizable hardware-managed caches. However, several factors, such as unique architecture of GPU, rise of CPU-GPU heterogeneous computing etc., demand effective management of caches to achieve high performance and energy efficiency. Recently, several techniques have been proposed for this purpose. In this paper, we survey several architectural and system-level techniques proposed for managing and leveraging GPU caches. We also discuss the importance and challenges of cache managem...
The massive parallel architecture enables graphics process-ing units (GPUs) to boost performance for...
Heterogeneous multicore processors that take full advantage of CPUs and GPUs within the same chip ra...
The massive parallel architecture enables graphics processing units (GPUs) to boost performance for ...
Hardware caches are widely employed in GPGPUs to achieve higher performance and energy efficiency. I...
Abstract—With the SIMT execution model, GPUs can hide memory latency through massive multithreading ...
Graphics processing units (GPUs) have become ubiquitous for general purpose applications due to thei...
The usage of Graphics Processing Units (GPUs) as an application accelerator has become increasingly ...
Abstract—On-chip caches are commonly used in computer systems to hide long off-chip memory access la...
In the last decade, GPUs have emerged to be widely adopted for general-purpose applications. To capt...
<p>The continued growth of the computational capability of throughput processors has made throughput...
Abstract—This paper presents the design, implementation and evaluation of BAG, a system that manages...
Pervasive use of GPUs across multiple disciplines is a result of continuous adaptation of the GPU a...
Current GPU computing models support a mixture of coherent and incoherent classes of memory operatio...
Heterogeneous systems are ubiquitous in the field of High- Performance Computing (HPC). Graphics pro...
This paper presents novel cache optimizations for massively parallel, throughput-oriented architectu...
The massive parallel architecture enables graphics process-ing units (GPUs) to boost performance for...
Heterogeneous multicore processors that take full advantage of CPUs and GPUs within the same chip ra...
The massive parallel architecture enables graphics processing units (GPUs) to boost performance for ...
Hardware caches are widely employed in GPGPUs to achieve higher performance and energy efficiency. I...
Abstract—With the SIMT execution model, GPUs can hide memory latency through massive multithreading ...
Graphics processing units (GPUs) have become ubiquitous for general purpose applications due to thei...
The usage of Graphics Processing Units (GPUs) as an application accelerator has become increasingly ...
Abstract—On-chip caches are commonly used in computer systems to hide long off-chip memory access la...
In the last decade, GPUs have emerged to be widely adopted for general-purpose applications. To capt...
<p>The continued growth of the computational capability of throughput processors has made throughput...
Abstract—This paper presents the design, implementation and evaluation of BAG, a system that manages...
Pervasive use of GPUs across multiple disciplines is a result of continuous adaptation of the GPU a...
Current GPU computing models support a mixture of coherent and incoherent classes of memory operatio...
Heterogeneous systems are ubiquitous in the field of High- Performance Computing (HPC). Graphics pro...
This paper presents novel cache optimizations for massively parallel, throughput-oriented architectu...
The massive parallel architecture enables graphics process-ing units (GPUs) to boost performance for...
Heterogeneous multicore processors that take full advantage of CPUs and GPUs within the same chip ra...
The massive parallel architecture enables graphics processing units (GPUs) to boost performance for ...