Intra-cluster coalescing to reduce GPU NoC pressure

Wang, Lu
Zhao, Xia
Kaeli, David
Wang, Zhiying
Eeckhout, Lieven

Open PDF

Open link

Publication date

January 2018

DOI

10.1109/ipdps.2018.00108

Publisher

IEEE

Language

English

Citation count (estimate)

Abstract

GPUs continue to increase the number of streaming multiprocessors (SMs) to provide increasingly higher compute capabilities. To construct a scalable crossbar network-on-chip (NoC) that connects the SMs to the memory controllers, a cluster structure is introduced in modern GPUs in which several SMs are grouped together to share a network port. Because of network port sharing, clustered GPUs face severe NoC congestion, which creates a critical performance bottleneck. In this paper, we target redundant network traffic to mitigate GPU NoC congestion. In particular, we observe that in many GPU-compute applications, different SMs in a cluster access shared data. Issuing redundant requests to access the same memory location wastes valuable NoC ba...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Intra-cluster coalescing to reduce GPU NoC pressure

Abstract

Extracted data

Intra-cluster coalescing to reduce GPU NoC pressure

Abstract

Extracted data

Related items

Related items