We develop algorithms for Cholesky factorization and the solution of triangular systems of linear equations on a hypercube multiprocessor. Specifically, we describe algorithms that apply when the matrix is distributed around the hypercube by submatrices, or patches. We show that these algorithms use asymptomatically less internode communication than more common row- and column- oriented algorithms. Empirical results accompany the analysis and show that patch-oriented algorithms are competitive with, but not demonstrably superior to, the other algorithms for hypercubes of low dimension. Implementations in C appear in an appendix
The Polymorphic Torus architecture is a reconfigurable, massively parallel finegrained system, whic...
Nonlinear matrix equations arise frequently in applied probability, especially in the numerical sol...
[[abstract]]In this paper we use hypercube computers for solving linear systems. First, the pivoting...
This paper is concerned with parallel algorithms for matrix factorization on distributed-memory, mes...
We develop an algorithm for computing the symbolic and numeric Cholesky factorization of a large sp...
Systems of linear equations of the form $Ax = b,$ where $A$ is a large sparse symmetric positive de...
We consider solving triangular systems of linear equations on a hypercube multiprocessor. Specifica...
The solution of dense systems of linear equations is at the heart of numerical computations. Such sy...
We describe the solution of linear systems of equations, Ax = b, on distributed-memory concurrent co...
AbstractThe solution of large sparse positive definite systems of equations typically involves four ...
In this paper we present an efficient dense matrix multi-plication algorithm for distributed memory ...
This paper describes a set of concurrent algorithms for matrix algebra, based on a library of collec...
As sequential computers seem to be approaching their limits in CPU speed there is increasing intere...
We discuss optimal communication and decomposition algorithms for a class of regular problems on con...
This paper represents the first attempt towards a decomposition-independent implementation of parall...
The Polymorphic Torus architecture is a reconfigurable, massively parallel finegrained system, whic...
Nonlinear matrix equations arise frequently in applied probability, especially in the numerical sol...
[[abstract]]In this paper we use hypercube computers for solving linear systems. First, the pivoting...
This paper is concerned with parallel algorithms for matrix factorization on distributed-memory, mes...
We develop an algorithm for computing the symbolic and numeric Cholesky factorization of a large sp...
Systems of linear equations of the form $Ax = b,$ where $A$ is a large sparse symmetric positive de...
We consider solving triangular systems of linear equations on a hypercube multiprocessor. Specifica...
The solution of dense systems of linear equations is at the heart of numerical computations. Such sy...
We describe the solution of linear systems of equations, Ax = b, on distributed-memory concurrent co...
AbstractThe solution of large sparse positive definite systems of equations typically involves four ...
In this paper we present an efficient dense matrix multi-plication algorithm for distributed memory ...
This paper describes a set of concurrent algorithms for matrix algebra, based on a library of collec...
As sequential computers seem to be approaching their limits in CPU speed there is increasing intere...
We discuss optimal communication and decomposition algorithms for a class of regular problems on con...
This paper represents the first attempt towards a decomposition-independent implementation of parall...
The Polymorphic Torus architecture is a reconfigurable, massively parallel finegrained system, whic...
Nonlinear matrix equations arise frequently in applied probability, especially in the numerical sol...
[[abstract]]In this paper we use hypercube computers for solving linear systems. First, the pivoting...