When cache blocking sparse matrix vector multiply works and why

Rajesh Nishtala
Richard W. Vuduc
James W. Demmel
Katherine A

Publication date

January 2004

Abstract

Abstract. We present new performance models and more compact data structures for cache blocking when applied to sparse matrix-vector multiply (SpM×V). We extend our prior models by relaxing the assumption that the vectors fit in cache and find that the new models are accurate enough to predict optimum block sizes. In addition, we determine criteria that predict when cache blocking improves performance. We conclude with architectural suggestions that would make memory systems execute SpM×V faster.

Extracted data

We use cookies to provide a better user experience.

Data Protection

When cache blocking sparse matrix vector multiply works and why

Abstract

Extracted data

When cache blocking sparse matrix vector multiply works and why

Abstract

Extracted data

Related items

Related items