Modeling emerging memory-divergent GPU applications

Wang, Lu
Jahre, Magnus
Adileh, Almutaz
Wang, Zhiying
Eeckhout, Lieven

Open PDF

Open link

Publication date

January 2019

DOI

10.1109/lca.2019.2923618

Language

English

Abstract

Analytical performance models yield valuable architectural insight without incurring the excessive runtime overheads of simulation. In this work, we study contemporary GPU applications and find that the key performance-related behavior of such applications is distinct from traditional GPU applications. The key issue is that these GPU applications are memory-intensive and have poor spatial locality, which implies that the loads of different threads commonly access different cache blocks. Such memory-divergent applications quickly exhaust the number of misses the L1 cache can process concurrently, and thereby cripple the GPU's ability to use Memory-Level Parallelism (MLP) and Thread-Level Parallelism (TLP) to hide memory latencies. Our Memory...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Modeling emerging memory-divergent GPU applications

Abstract

Extracted data

Modeling emerging memory-divergent GPU applications

Abstract

Extracted data

Related items

Related items