Inter-loop optimizations in RAJA using loop chains

Neth, B.
Scogland, T.R.W.
de Supinski, B.R.
Strout, M.M.

Publication date

June 2021

Publisher

Association for Computing Machinery (ACM)

Abstract

Typical parallelization approaches such as OpenMP and CUDA provide constructs for parallelizing and blocking for data locality for individual loops. By focusing on each loop separately, these approaches fail to leverage sources of data locality possible due to inter-loop data reuse. The loop chain abstraction provides a framework for reasoning about and applying inter-loop optimizations. In this work, we incorporate the loop chain abstraction into RAJA, a performance portability layer for high-performance computing applications. Using the loop-chain-extended RAJA, or RAJALC, developers can have the RAJA library apply loop transformations like loop fusion and overlapped tiling while maintaining the original structure of their programs. By in...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Inter-loop optimizations in RAJA using loop chains

Abstract

Extracted data

Inter-loop optimizations in RAJA using loop chains

Abstract

Extracted data

Related items

Related items