A Family of High-Performance Matrix Multiplication Algorithms ∗

Publication date

April 2008

Abstract

During the last half-decade, a number of research efforts have centered around developing software for generating automatically tuned matrix multiplication kernels. These include the PHiPAC project and the ATLAS project. The software products of both projects employ brute force to search a parameter space for blockings that accommodate multiple levels of memory hierarchy. We take a different approach. Using a simple model of hierarchical memories we employ mathematics to determine a locally-optimal strategy for blocking matrices. The theoretical results show that, depending on the shape of the matrices involved, different strategies are locally-optimal. Rather than determining a blocking strategy at library generation time, the theoretical ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A Family of High-Performance Matrix Multiplication Algorithms ∗

Abstract

Extracted data

A Family of High-Performance Matrix Multiplication Algorithms ∗

Abstract

Extracted data

Related items

Related items