International audienceWe examine the send and receive mechanisms of MPI and show how to implement message passing robustly so that performance is not significantly affected by changes to the MPI system. We discuss this within the context of two different parallel algorithms for sparse Gaussian elimination: a multifrontal solver (MUMPS), and a supernodal one (SuperLU). The performance of our initial strategies based on simple MPI point-to-point communication primitives is very sensitive to the MPI system, particularly the way MPI buffers are used. Using nonblocking communication primitives improves the performance and robustness, but at the cost of increased code complexity
Parallelizing sparse irregular application on distributed memory systems poses serious scalability c...
Scalability to a large number of processes is one of the weaknesses of current MPI implementations. ...
MUMPS is a parallel sparse direct solver, using message passing (MPI) for parallelism. In this repor...
International audienceWe examine the send and receive mechanisms of MPI and show how to implement me...
We examine the send and receive mechanisms of MPI and how to implement message passing robustly so t...
We examine the mechanics of the send and receive mechanism of MPI and in particular how we can impl...
We examine the mechanics of the send and receive mechanism of MPI and in particular how we can imple...
In this report we describe the conversion of a simple Master-Worker parallel program from global blo...
Sparse matrix operations dominate the cost of many scientific applications. In parallel, the perform...
The Message Passing Interface (MPI) has been extremely successful as a portable way to program high-...
The Message Passing Interface (MPI) has been extremely successful as a portable way to program high-...
Communication overhead is one of the dominant factors affecting performance in high-end computing sy...
AbstractWe discuss issues in designing sparse (nearest neigh-bor) collective operations for communic...
International audienceOverlapping communications with computation is an efficient way to amortize th...
International audienceWith the growing number of cores and fast network like Infiniband, one of the ...
Parallelizing sparse irregular application on distributed memory systems poses serious scalability c...
Scalability to a large number of processes is one of the weaknesses of current MPI implementations. ...
MUMPS is a parallel sparse direct solver, using message passing (MPI) for parallelism. In this repor...
International audienceWe examine the send and receive mechanisms of MPI and show how to implement me...
We examine the send and receive mechanisms of MPI and how to implement message passing robustly so t...
We examine the mechanics of the send and receive mechanism of MPI and in particular how we can impl...
We examine the mechanics of the send and receive mechanism of MPI and in particular how we can imple...
In this report we describe the conversion of a simple Master-Worker parallel program from global blo...
Sparse matrix operations dominate the cost of many scientific applications. In parallel, the perform...
The Message Passing Interface (MPI) has been extremely successful as a portable way to program high-...
The Message Passing Interface (MPI) has been extremely successful as a portable way to program high-...
Communication overhead is one of the dominant factors affecting performance in high-end computing sy...
AbstractWe discuss issues in designing sparse (nearest neigh-bor) collective operations for communic...
International audienceOverlapping communications with computation is an efficient way to amortize th...
International audienceWith the growing number of cores and fast network like Infiniband, one of the ...
Parallelizing sparse irregular application on distributed memory systems poses serious scalability c...
Scalability to a large number of processes is one of the weaknesses of current MPI implementations. ...
MUMPS is a parallel sparse direct solver, using message passing (MPI) for parallelism. In this repor...