Understanding memory effects in the automated generation of optimized matrix algebra kernels

Jessup, Elizabeth R.
Karlin, Ian
Silkensen, Erik
Belter, Geoffrey
Siek, Jeremy

Open PDF

Open link

Publication date

May 2010

DOI

10.1016/j.procs.2010.04.209

Publisher

Published by Elsevier B.V.

ISSN

1877-0509

Citation count (estimate)

Abstract

AbstractEfficient implementation of matrix algebra is important to the performance of many large and complex physical models. Among important tuning techniques is loop fusion which can reduce the amount of data moved between memory and the processor. We have developed the Build to Order (BTO) compiler to automate loop fusion for matrix algebra kernels. In this paper, we present BTO’s analytic memory model which substantially reduces the number of loop fusion options considered by the compiler. We introduce an example that motivates the inclusion of registers in the model. We demonstrate how the model’s modular design facilitates the addition of register allocation to the model’s set of memory components, improving its accuracy

Extracted data

We use cookies to provide a better user experience.

Data Protection

Understanding memory effects in the automated generation of optimized matrix algebra kernels

Abstract

Extracted data

Understanding memory effects in the automated generation of optimized matrix algebra kernels

Abstract

Extracted data

Related items

Related items