We present a new concurrent LU‐decomposition algorithm based on implicit pivoting of both rows and columns. This algorithm is, to a large extent, independent of the distribution of the matrix over the concurrent processes. As a result, it can be used in programs with dynamically varying data distributions. Another advantage is that most pivoting strategies are easily incorporated. We also introduce two new, intrinsically concurrent, pivoting strategies: multirow and multicolumn pivoting. With this program, we study the performance of concurrent LU‐decomposition as a function of data distribution and pivoting strategy. We show that LU‐decomposition with some pivoting strategies is both faster and numerically more stable than LU‐decomposit...
Abstract—Dense LU factorization is a prominent benchmark used to rank the performance of supercomput...
In this paper, we make efficient use of asynchronous communications on the LU decomposition algorit...
We present an out-of-core sparse nonsymmetric LU-factorization algorithm with partial pivoting. We h...
We present a new concurrent LU‐decomposition algorithm based on implicit pivoting of both rows and c...
On multicomputers the partial pivoting phase of the LU factorization has a peculiar load unbalancing...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...
In this paper, we make efficient use of pipelining on LU decomposition with pivoting and a column-sc...
LU decomposition is intensively used in various scientific and engineering computations. A parallel ...
AbstractLU decomposition is intensively used in various scientific and engineering computations. A p...
This thesis presents a parallel algorithm for the direct LU factorization of general unsymmetric spa...
Many linear algebra algorithms require explicit row/column swapping mainly when pivoting operations ...
This paper presents CALU, a Communication Avoiding algorithm for the LU factorization of dense matri...
This paper presents a comparative study of two data mapping schemes for parallel sparse LU factoriza...
We combine the idea of the direct LU factorization with the idea of the pivoting strategy in the usu...
Abstract—Dense LU factorization is a prominent benchmark used to rank the performance of supercomput...
In this paper, we make efficient use of asynchronous communications on the LU decomposition algorit...
We present an out-of-core sparse nonsymmetric LU-factorization algorithm with partial pivoting. We h...
We present a new concurrent LU‐decomposition algorithm based on implicit pivoting of both rows and c...
On multicomputers the partial pivoting phase of the LU factorization has a peculiar load unbalancing...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...
In this paper, we make efficient use of pipelining on LU decomposition with pivoting and a column-sc...
LU decomposition is intensively used in various scientific and engineering computations. A parallel ...
AbstractLU decomposition is intensively used in various scientific and engineering computations. A p...
This thesis presents a parallel algorithm for the direct LU factorization of general unsymmetric spa...
Many linear algebra algorithms require explicit row/column swapping mainly when pivoting operations ...
This paper presents CALU, a Communication Avoiding algorithm for the LU factorization of dense matri...
This paper presents a comparative study of two data mapping schemes for parallel sparse LU factoriza...
We combine the idea of the direct LU factorization with the idea of the pivoting strategy in the usu...
Abstract—Dense LU factorization is a prominent benchmark used to rank the performance of supercomput...
In this paper, we make efficient use of asynchronous communications on the LU decomposition algorit...
We present an out-of-core sparse nonsymmetric LU-factorization algorithm with partial pivoting. We h...