Decoupled Vector Runahead

Naithani, Ajeya
Roelandts, Jaime
Ainsworth, Sam
Jones, Timothy M
Eeckhout, Lieven

Open link

Publication date

September 2023

DOI

10.17863/CAM.101697

Publisher

Department of Computer Science and Technology

Abstract

We present Decoupled Vector Runahead (DVR), an in-core prefetching technique, executing separately to the main application thread, that exploits massive amounts of memory-level parallelism to improve the performance of applications featuring indirect memory accesses. DVR dynamically infers loop bounds at run-time, recognizing striding loads, and vectorizing subsequent instructions that are part of an indirect chain. It proactively issues memory accesses for the resulting loads far into the future, even when the out-of-order core has not yet stalled, bringing their data into the L1 cache, and thus providing timely prefetches for the main thread. DVR can adjust the degree of vectorization at run-time, vectorize the same chain of indirect memo...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Decoupled Vector Runahead

Abstract

Extracted data

Decoupled Vector Runahead

Abstract

Extracted data

Related items

Related items