Sparse matrix problems are difficult to parallelize efficiently on distributed memory machines since non-zero elements are unevenly scattered and are accessed via multiple levels of indirection. Irregular distributions that achieve good load balance and locality are hard to compute, have high memory overheads and also lead to further indirection in locating distributed data. This paper evaluates alternative semi-regular distribution strategies which trade off the quality of loadbalance and locality for lower decomposition overheads and efficient lookup. The proposed techniques are compared to an irregular sparse matrix partitioner and the relative merits of each distribution method are outlined. 1 Introduction Sparse matrices are used in ...
Sparse matrix computations play an important role in iterative methods to solve systems of equations...
Parallel computing promises several orders of magnitude increase in our ability to solve realistic c...
We describe the work performed in the context of a Franco-Berkeley funded project between NERSC-LBN...
Sparse matrix problems are difficult to parallelize efficiently on distributed memory machines since...
A notable characteristic of the scientific computing and machine learning prob-lem domains is the la...
A significant part of scientific codes consist of sparse matrix computations. In this work we propos...
We present a distributed-memory library for computations with dense structured matrices. A matrix is...
Abstract In this paper, we study the sparse matrix-vector product (SMVP) distribution on a large sca...
The general block distribution of a matrix is a rectilinear partition of the matrix into orthogonal ...
[[abstract]]In our previous work, we have studied three data distribution schemes, Send Followed Com...
Sparse times dense matrix multiplication (SpMM) finds its applications in well-established fields su...
The treatment of sparse numerical problems on large scale systems is often reduced to that of their ...
Several methods have been proposed in the literature for the distribution of data on distributed mem...
[[abstract]]©1997 SIAM-We present a compile-time method to select compression and distribution schem...
Sparse matrix computations are a critical component of computer simulation in many scientific and en...
Sparse matrix computations play an important role in iterative methods to solve systems of equations...
Parallel computing promises several orders of magnitude increase in our ability to solve realistic c...
We describe the work performed in the context of a Franco-Berkeley funded project between NERSC-LBN...
Sparse matrix problems are difficult to parallelize efficiently on distributed memory machines since...
A notable characteristic of the scientific computing and machine learning prob-lem domains is the la...
A significant part of scientific codes consist of sparse matrix computations. In this work we propos...
We present a distributed-memory library for computations with dense structured matrices. A matrix is...
Abstract In this paper, we study the sparse matrix-vector product (SMVP) distribution on a large sca...
The general block distribution of a matrix is a rectilinear partition of the matrix into orthogonal ...
[[abstract]]In our previous work, we have studied three data distribution schemes, Send Followed Com...
Sparse times dense matrix multiplication (SpMM) finds its applications in well-established fields su...
The treatment of sparse numerical problems on large scale systems is often reduced to that of their ...
Several methods have been proposed in the literature for the distribution of data on distributed mem...
[[abstract]]©1997 SIAM-We present a compile-time method to select compression and distribution schem...
Sparse matrix computations are a critical component of computer simulation in many scientific and en...
Sparse matrix computations play an important role in iterative methods to solve systems of equations...
Parallel computing promises several orders of magnitude increase in our ability to solve realistic c...
We describe the work performed in the context of a Franco-Berkeley funded project between NERSC-LBN...