I/O-Optimal Cache-Oblivious Sparse Matrix-Sparse Matrix Multiplication

Gleinig, Niels
Besta, Maciej
Hoefler, Torsten

Open link

Publication date

January 2022

DOI

10.1109/IPDPS53621.2022.00013

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

Data movements between different levels of the memory hierarchy (I/O-transitions, or simply I/Os) are a critical performance bottleneck in modern computing. Therefore it is a problem of high practical relevance to find algorithms that use a minimal number of I/Os. We present a cache-oblivious sparse matrix-sparse matrix multiplication algorithm that uses a worst-case number of I/Os that matches a previously established lower bound for this problem (O (N-2/B.M) read-I/Os and O (N-2/B) write-I/Os, where N is the size of the problem instance, M is the size of the fast memory and B is the size of the cache lines). When the output does not need to be stored, also the number of write-I/Os can be reduced to O (N-2/B.M). This improves the worst-cas...

Extracted data

We use cookies to provide a better user experience.

Data Protection

I/O-Optimal Cache-Oblivious Sparse Matrix-Sparse Matrix Multiplication

Abstract

Extracted data

I/O-Optimal Cache-Oblivious Sparse Matrix-Sparse Matrix Multiplication

Abstract

Extracted data

Related items

Related items