Data movements between different levels of the memory hierarchy (I/O-transitions, or simply I/Os) are a critical performance bottleneck in modern computing. Therefore it is a problem of high practical relevance to find algorithms that use a minimal number of I/Os. We present a cache-oblivious sparse matrix-sparse matrix multiplication algorithm that uses a worst-case number of I/Os that matches a previously established lower bound for this problem (O (N-2/B.M) read-I/Os and O (N-2/B) write-I/Os, where N is the size of the problem instance, M is the size of the fast memory and B is the size of the cache lines). When the output does not need to be stored, also the number of write-I/Os can be reduced to O (N-2/B.M). This improves the worst-cas...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
Data movements between different levels of the memory hierarchy (I/O-transitions, or simply I/O s) a...
In this article, we introduce a cache-oblivious method for sparse matrix–vector multiplication. Our ...
The sparse matrix–vector (SpMV) multiplication is an important kernel in many applications. When the...
In this thesis we introduce a cost measure to compare the cache- friendliness of different permutati...
The sparse matrix–vector (SpMV) multiplication is an important kernel in many applications. When the...
© Erik D. Demaine, Andrea Lincoln, Quanquan C. Liu, Jayson Lynch, and Virginia Vassilevska Williams....
This report deals with the ecient calculation of matrix-matrix multiplication, without using explici...
Let X[0 . . n - 1] and Y[0 . . m - 1] be two sorted arrays, and define the m x n matrix A by A[j][i]...
AbstractOne of the keys to tap the full performance potential of current hardware is the optimal uti...
We analyze the problem of sparse-matrix dense-vector mul-tiplication (SpMV) in the I/O-model. The ta...
In this paper we explore a simple and general approach for developing parallel algorithms that lead ...
The thesis introduces a cache-oblivious method for the sparse matrix-vector (SpMV) multiplication, w...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
Data movements between different levels of the memory hierarchy (I/O-transitions, or simply I/O s) a...
In this article, we introduce a cache-oblivious method for sparse matrix–vector multiplication. Our ...
The sparse matrix–vector (SpMV) multiplication is an important kernel in many applications. When the...
In this thesis we introduce a cost measure to compare the cache- friendliness of different permutati...
The sparse matrix–vector (SpMV) multiplication is an important kernel in many applications. When the...
© Erik D. Demaine, Andrea Lincoln, Quanquan C. Liu, Jayson Lynch, and Virginia Vassilevska Williams....
This report deals with the ecient calculation of matrix-matrix multiplication, without using explici...
Let X[0 . . n - 1] and Y[0 . . m - 1] be two sorted arrays, and define the m x n matrix A by A[j][i]...
AbstractOne of the keys to tap the full performance potential of current hardware is the optimal uti...
We analyze the problem of sparse-matrix dense-vector mul-tiplication (SpMV) in the I/O-model. The ta...
In this paper we explore a simple and general approach for developing parallel algorithms that lead ...
The thesis introduces a cache-oblivious method for the sparse matrix-vector (SpMV) multiplication, w...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...