Improving instruction scheduling in GPGPUs

Huerta Gañán, Rodrigo

Open PDF

Open link

Publication date

June 2022

Publisher

Universitat Politècnica de Catalunya

Language

English

Abstract

GPU architectures have become popular for executing general-purpose programs. Moreover, they are some of the most efficient architectures for machine learning applications which are among the most trendy and demanding applications these days. GPUs rely on having a large number of threads that run concurrently to hide the latency among dependent instructions. This work presents SOCGPU (Simple Out-of-order Core for GPU), a simple out-of-order execution mechanism that does not require register renaming nor scoreboards. It uses a small Instruction Buffer and a tiny Dependence matrix to keep track of dependencies among instructions and avoid data hazards. Evaluations for an Nvidia GTX1080TI-like GPU show that SOCGPU provides a speed-up up to 3.7...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Improving instruction scheduling in GPGPUs

Abstract

Extracted data

Improving instruction scheduling in GPGPUs

Abstract

Extracted data

Related items

Related items