The general block distribution of a matrix is a rectilinear partition of the matrix into orthogonal blocks such that the maximum sum of the elements within a single block is mini-mized. This corresponds to partitioning the matrix onto parallel processors so as to minimize processor load while maintaining regular communication patterns. Applications of the prob-lem include various parallel sparse matrix computations, compilers for high-performance languages, particle in cell computations, video and image compression, and simulations as-sociated with a communication network. We analyze the performance guarantee of a natural and practical heuristic based on iterative renement, which has previously been shown to give good empirical results. Whe...
Many applications of scientific computing rely on computations on sparse matrices. The design of ef...
Abstract. Sparse matrix-vector multiplication forms the heart of iterative linear solvers used widel...
Sparse-matrix solution is a dominant part of execution time in simulating VLSI circuits by a detaile...
AbstractThe general block distribution of a matrix is a rectilinear partition of the matrix into ort...
A significant part of scientific codes consist of sparse matrix computations. In this work we propos...
In this paper, we consider the problem of partitioning a square into a set of zones of prescribed ar...
Sparse matrix problems are difficult to parallelize efficiently on distributed memory machines since...
International audienceIn the context of the block Cimmino algorithm, we study preprocessing strategi...
The treatment of sparse numerical problems on large scale systems is often reduced to that of their ...
The Cimmino method is a row projection method in which the original linear system is divided into su...
This research aims at creating and providing a framework to describe algorithmic redistribution meth...
In this paper, we present a new load balancing technique, called panel scattering, which is generall...
Implementing linear algebra kernels on distributed memory parallel computers raises the problem of d...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
This report has been developed over the work done in the deliverable [Nava94] There it was shown tha...
Many applications of scientific computing rely on computations on sparse matrices. The design of ef...
Abstract. Sparse matrix-vector multiplication forms the heart of iterative linear solvers used widel...
Sparse-matrix solution is a dominant part of execution time in simulating VLSI circuits by a detaile...
AbstractThe general block distribution of a matrix is a rectilinear partition of the matrix into ort...
A significant part of scientific codes consist of sparse matrix computations. In this work we propos...
In this paper, we consider the problem of partitioning a square into a set of zones of prescribed ar...
Sparse matrix problems are difficult to parallelize efficiently on distributed memory machines since...
International audienceIn the context of the block Cimmino algorithm, we study preprocessing strategi...
The treatment of sparse numerical problems on large scale systems is often reduced to that of their ...
The Cimmino method is a row projection method in which the original linear system is divided into su...
This research aims at creating and providing a framework to describe algorithmic redistribution meth...
In this paper, we present a new load balancing technique, called panel scattering, which is generall...
Implementing linear algebra kernels on distributed memory parallel computers raises the problem of d...
The problem of partitioning dense matrices into sets of sub-matrices has received increased attentio...
This report has been developed over the work done in the deliverable [Nava94] There it was shown tha...
Many applications of scientific computing rely on computations on sparse matrices. The design of ef...
Abstract. Sparse matrix-vector multiplication forms the heart of iterative linear solvers used widel...
Sparse-matrix solution is a dominant part of execution time in simulating VLSI circuits by a detaile...