Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines the total communication volume, we try to find a suitable vector partitioning that balances the communication load among the processors. We present a new lower bound for the maximum communication cost per processor, an optimal algorithm that attains this bound for the special case where each matrix column is owned by at most two processors, and a new heuristic algorithm for the general case that often attains the lower bound. This heuristic algorithm tries to avoid raising the current lower bound when assigning vector components to processors. Experimental results show that the new algorithm often improves upon the heuristic algorithm that is...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
To minimize the communication in parallel sparse matrix-vector multiplication while maintaining load...
We investigate outer-product--parallel, inner-product--parallel, and row-by-row-product--parallel fo...
Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines ...
Abstract. This paper addresses the problem of one-dimensional partitioning of structurally unsymmetr...
We investigate sparse matrix bipartitioning – a problem where we minimize the communication volume i...
We investigate sparse matrix bipartitioning – a problem where we minimize the communication volume i...
We investigate sparse matrix bipartitioning – a problem where we minimize the communication volume i...
The sparse matrix partitioning problem arises when minimizing communication in parallel sparse matri...
This paper addresses the problem of one-dimensional partitioning of structurally unsymmetric square ...
We show a two-phase approach for minimizing various communication-cost metrics in fine-grain partiti...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
International audienceWe investigate one dimensional partitioning of sparse matrices under a given o...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
To minimize the communication in parallel sparse matrix-vector multiplication while maintaining load...
We investigate outer-product--parallel, inner-product--parallel, and row-by-row-product--parallel fo...
Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines ...
Abstract. This paper addresses the problem of one-dimensional partitioning of structurally unsymmetr...
We investigate sparse matrix bipartitioning – a problem where we minimize the communication volume i...
We investigate sparse matrix bipartitioning – a problem where we minimize the communication volume i...
We investigate sparse matrix bipartitioning – a problem where we minimize the communication volume i...
The sparse matrix partitioning problem arises when minimizing communication in parallel sparse matri...
This paper addresses the problem of one-dimensional partitioning of structurally unsymmetric square ...
We show a two-phase approach for minimizing various communication-cost metrics in fine-grain partiti...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
International audienceWe investigate one dimensional partitioning of sparse matrices under a given o...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
To minimize the communication in parallel sparse matrix-vector multiplication while maintaining load...
We investigate outer-product--parallel, inner-product--parallel, and row-by-row-product--parallel fo...