Instruction Roofline: An insightful visual performance model for GPUs

Ding, N
Awan, M
Williams, S

Publication date

September 2022

Publisher

eScholarship, University of California

Abstract

The Roofline performance model provides an intuitive approach to identify performance bottlenecks and guide performance optimization. However, the classic FLOP-centric approach is inappropriate for the emerging applications that perform more integer operations than floating point operations. In this article, we reintroduce our Instruction Roofline Model on NVIDIA GPUs and expand our evaluation of it. The Instruction Roofline incorporates instructions and memory transactions across all memory hierarchies together, and provides more performance insights than the FLOP-oriented Roofline Model, that is, instruction throughput, stride memory access patterns, bank conflicts, and thread predication. We use our Instruction Roofline methodology to an...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Instruction Roofline: An insightful visual performance model for GPUs

Abstract

Extracted data

Instruction Roofline: An insightful visual performance model for GPUs

Abstract

Extracted data

Related items

Related items