We investigate performance characteristics for the LU factorization of large matrices with various sparsity patterns. We consider supernodal right-looking parallel factorization on a bi-dimensional grid of processors, making use of static pivoting. We develop a performance model and we validate it using the implementation in SuperLU-DIST, the real matrices and the IBM Power3 machine at NERSC. We use this model to obtain performance bounds on parallel computers, to perform scalability analysis and to identify performance bottlenecks. We also discuss the role of load balance and data distribution in this approach
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...
In this paper we present a static scheduling algorithm for parallel sparse LU factorization with st...
We investigate performance characteristics for the LU factorization of large matrices with various ...
We present a simulation-based performance model to analyze a parallel sparse LU factorization algori...
We present a performance model to analyze a parallel sparse LU factorization algorithm on modern ca...
We present a performance model to analyze a parallel sparseLU factorization algorithm on modern cach...
This thesis presents a parallel algorithm for the direct LU factorization of general unsymmetric spa...
We present an out-of-core sparse nonsymmetric LU-factorization algorithm with partial pivoting. We h...
This paper presents a comparative study of two data mapping schemes for parallel sparse LU factoriza...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper describes ...
In this paper we present several improvements of widely used parallel LU factorization methods on sp...
Abstract. We investigate several ways to improve the performance of sparse LU factorization with par...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
Colloque avec actes et comité de lecture. internationale.International audienceIn this paper we pres...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...
In this paper we present a static scheduling algorithm for parallel sparse LU factorization with st...
We investigate performance characteristics for the LU factorization of large matrices with various ...
We present a simulation-based performance model to analyze a parallel sparse LU factorization algori...
We present a performance model to analyze a parallel sparse LU factorization algorithm on modern ca...
We present a performance model to analyze a parallel sparseLU factorization algorithm on modern cach...
This thesis presents a parallel algorithm for the direct LU factorization of general unsymmetric spa...
We present an out-of-core sparse nonsymmetric LU-factorization algorithm with partial pivoting. We h...
This paper presents a comparative study of two data mapping schemes for parallel sparse LU factoriza...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper describes ...
In this paper we present several improvements of widely used parallel LU factorization methods on sp...
Abstract. We investigate several ways to improve the performance of sparse LU factorization with par...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
Colloque avec actes et comité de lecture. internationale.International audienceIn this paper we pres...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
In this paper we present a new parallel algorithm for the LU decomposition of a general sparse matri...
In this paper we present a static scheduling algorithm for parallel sparse LU factorization with st...