We present a simulation-based performance model to analyze a parallel sparse LU factorization algorithm on modern cached-based, high-end parallel architectures. We consider supernodal right-looking parallel factorization on a bi-dimensional grid of processors, that uses static pivoting. Our model characterizes the algorithmic behavior by taking into account the underlying processor speed, memory system performance, as well as the interconnect speed. The model is validated using the implementation in the SuperLU DIST linear system solver, the sparse matrices from real application, and an IBM POWER3 parallel machine. Our modeling methodology can be adapted to study performance of other types of sparse factorizations, such as Cholesky or QR, a...
. Solving large nonsymmetric sparse linear systems on distributed memory multiprocessors is an activ...
This paper presents a comparative study of two data mapping schemes for parallel sparse LU factoriza...
In this paper we present several improvements of widely used parallel LU factorization methods on sp...
We present a performance model to analyze a parallel sparse LU factorization algorithm on modern ca...
We present a performance model to analyze a parallel sparseLU factorization algorithm on modern cach...
We investigate performance characteristics for the LU factorization of large matrices with various ...
We investigate performance characteristics for the LU factorization of large matrices with various s...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
This thesis presents a parallel algorithm for the direct LU factorization of general unsymmetric spa...
We present an out-of-core sparse nonsymmetric LU-factorization algorithm with partial pivoting. We h...
Solving large sparse linear systems is at the heart of many application problems arising from scient...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper describes ...
Sparse linear systems occur in areas such as finite element methods and statistics. These system...
Several fine grained parallel algorithms were developed and compared to compute the Cholesky factori...
. Solving large nonsymmetric sparse linear systems on distributed memory multiprocessors is an activ...
This paper presents a comparative study of two data mapping schemes for parallel sparse LU factoriza...
In this paper we present several improvements of widely used parallel LU factorization methods on sp...
We present a performance model to analyze a parallel sparse LU factorization algorithm on modern ca...
We present a performance model to analyze a parallel sparseLU factorization algorithm on modern cach...
We investigate performance characteristics for the LU factorization of large matrices with various ...
We investigate performance characteristics for the LU factorization of large matrices with various s...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and ...
This thesis presents a parallel algorithm for the direct LU factorization of general unsymmetric spa...
We present an out-of-core sparse nonsymmetric LU-factorization algorithm with partial pivoting. We h...
Solving large sparse linear systems is at the heart of many application problems arising from scient...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper describes ...
Sparse linear systems occur in areas such as finite element methods and statistics. These system...
Several fine grained parallel algorithms were developed and compared to compute the Cholesky factori...
. Solving large nonsymmetric sparse linear systems on distributed memory multiprocessors is an activ...
This paper presents a comparative study of two data mapping schemes for parallel sparse LU factoriza...
In this paper we present several improvements of widely used parallel LU factorization methods on sp...