We investigate performance characteristics for the LU factorization of large matrices with various sparsity patterns. We consider supernodal right-looking parallel factorization on a bi-dimensional grid of processors, making use of static pivoting. We develop a performance model and we validate it using the implementation in SuperLU_DIST, the real matrices and the IBM Power3 machine at NERSC. We use this model to obtain performance bounds on parallel computers, to perform scalability analysis and to identify performance bottlenecks. We also discuss the role of load balance and data distribution in this approach
Abstract. We investigate several ways to improve the performance of sparse LU factorization with par...
. Solving large nonsymmetric sparse linear systems on distributed memory multiprocessors is an activ...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...
We investigate performance characteristics for the LU factorization of large matrices with various s...
We present a simulation-based performance model to analyze a parallel sparse LU factorization algori...
We present a performance model to analyze a parallel sparse LU factorization algorithm on modern ca...
We present a performance model to analyze a parallel sparseLU factorization algorithm on modern cach...
This thesis presents a parallel algorithm for the direct LU factorization of general unsymmetric spa...
We present an out-of-core sparse nonsymmetric LU-factorization algorithm with partial pivoting. We h...
In this paper we present several improvements of widely used parallel LU factorization methods on sp...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper describes ...
This paper presents a comparative study of two data mapping schemes for parallel sparse LU factoriza...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
Colloque avec actes et comité de lecture. internationale.International audienceIn this paper we pres...
Abstract. We investigate several ways to improve the performance of sparse LU factorization with par...
. Solving large nonsymmetric sparse linear systems on distributed memory multiprocessors is an activ...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...
We investigate performance characteristics for the LU factorization of large matrices with various s...
We present a simulation-based performance model to analyze a parallel sparse LU factorization algori...
We present a performance model to analyze a parallel sparse LU factorization algorithm on modern ca...
We present a performance model to analyze a parallel sparseLU factorization algorithm on modern cach...
This thesis presents a parallel algorithm for the direct LU factorization of general unsymmetric spa...
We present an out-of-core sparse nonsymmetric LU-factorization algorithm with partial pivoting. We h...
In this paper we present several improvements of widely used parallel LU factorization methods on sp...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper describes ...
This paper presents a comparative study of two data mapping schemes for parallel sparse LU factoriza...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
Colloque avec actes et comité de lecture. internationale.International audienceIn this paper we pres...
Abstract. We investigate several ways to improve the performance of sparse LU factorization with par...
. Solving large nonsymmetric sparse linear systems on distributed memory multiprocessors is an activ...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...