Compiler Cache Optimizations for Banded Matrix Problems

Wei Li

Open link

Publication date

January 1995

DOI

10.1145/224538.224541

Citation count (estimate)

Abstract

Almost every modern processor is designed with a memory hierarchy organized into several levels, each of which is smaller, faster, and more expensive than the level below. High performance requires the effective use of the cached data, i.e. cache locality. Smart compiler transformations can relieve the programmer from hand-optimizing for the specific machine architectures. Most of the existing compiler optimizations are developed for dense matrix programs. Irregular problems, on the other hand, have to rely on runtime optimizations, since the data access patterns are unknown at the compile-time. However, many scientific computing problems result in solving linear systems where the matrix of coefficients is banded, a structure known at the ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Compiler Cache Optimizations for Banded Matrix Problems

Abstract

Extracted data

Compiler Cache Optimizations for Banded Matrix Problems

Abstract

Extracted data

Related items

Related items