We present in this paper multi-thread and multi-process parallelizations of the Fast Multipole Method (FMM) for Laplace equation, for uniform and non uniform distributions. These parallelizations apply to the original FMM formulation and to our new matrix formulation with BLAS (Basic Linear Algebra Subprograms) routines. Differences between the multi-thread and the multi-process versions are detailed, and a hybrid MPI-thread approach enables to gain parallel efficiency and memory scalability over the pure MPI one on clusters of SMP nodes. On 128 processors, we obtain 85% (respectively 75%) parallel efficiency for uniform (respectively non uniform) distributions with up to 100 million particles
This thesis presents a top to bottom analysis on designing and implementing fast algorithms for curr...
Due to its O(N log N) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most...
International audienceAmong all the steps involved in DD simulations, the computation of the interna...
This thesis focuses on the Fast Multipole Method which hierarchically solves the N-body problem with...
International audienceThe multipole-to-local (M2L) operator is the most time-consuming part of the f...
International audienceIn a previous work, we have presented a new formulation of the uniform version...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
The N-body problem appears in many computational physics simulations. At each time step the computat...
AbstractThis paper presents a parallel version of the fast multipole method (FMM). The FMM is a rece...
We present a new adaptive fast multipole algorithm and its parallel implementation. The algorithm is...
Solving an N-body problem, electrostatic or gravitational, is a crucial task and the main computatio...
International audienceThe Fast Multipole Method (FMM) is considered as one of the top ten algorithms...
We report on a parallel version of the Fast Multipole Method (FMM) implemented in the classical mole...
<b>Invited Lecture at the SIAM <i>"Encuentro Nacional de Ingeniería Matemática,"</i> at Pontificia U...
Cataloged from PDF version of article.Due to its O(NlogN) complexity, the multilevel fast multipole ...
This thesis presents a top to bottom analysis on designing and implementing fast algorithms for curr...
Due to its O(N log N) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most...
International audienceAmong all the steps involved in DD simulations, the computation of the interna...
This thesis focuses on the Fast Multipole Method which hierarchically solves the N-body problem with...
International audienceThe multipole-to-local (M2L) operator is the most time-consuming part of the f...
International audienceIn a previous work, we have presented a new formulation of the uniform version...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
The N-body problem appears in many computational physics simulations. At each time step the computat...
AbstractThis paper presents a parallel version of the fast multipole method (FMM). The FMM is a rece...
We present a new adaptive fast multipole algorithm and its parallel implementation. The algorithm is...
Solving an N-body problem, electrostatic or gravitational, is a crucial task and the main computatio...
International audienceThe Fast Multipole Method (FMM) is considered as one of the top ten algorithms...
We report on a parallel version of the Fast Multipole Method (FMM) implemented in the classical mole...
<b>Invited Lecture at the SIAM <i>"Encuentro Nacional de Ingeniería Matemática,"</i> at Pontificia U...
Cataloged from PDF version of article.Due to its O(NlogN) complexity, the multilevel fast multipole ...
This thesis presents a top to bottom analysis on designing and implementing fast algorithms for curr...
Due to its O(N log N) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most...
International audienceAmong all the steps involved in DD simulations, the computation of the interna...