This thesis investigates possible optimization on an efficient implementation of the multilevel fast multipole algorithm (MLFMA), which is intended for solving integral equations for large problems. Though MLFMA is not inherently parallel due to its tree-like computational structure, if carefully optimized, it is suitable for parallelization as the throughput and computation power becomes higher on current GPU accelerators. By dividing problems into hierarchical multilevel groups, the MLFMA can be distributed to supercomputers like the Blue Waters, utilizing massive computing resources and balancing the workload. For solving large problems with stability and fast convergence rate, several different iterative solvers are written usin...
We present fast and accurate solutions of large-scale scattering problems involving three-dimensiona...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...
This thesis investigates possible optimization on an efficient implementation of the multilevel fas...
Cataloged from PDF version of article.Due to its O(NlogN) complexity, the multilevel fast multipole ...
This paper investigates the scalability of the parallel multilevel fast multipole algorithm (MLFMA)....
We present two memory-reduction methods for the parallel multilevel fast multipole algorithm (MLFMA)...
Due to its O(N log N) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
A hierarchical parallelisation of the multilevel fast multipole algorithm (MLFMA) for the efficient ...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
In this paper we wish to focus on some recent advances in the Multilevel Fast Multipole Algorithm (M...
The development of a scalable parallel multilevel fast multipole algorithm (MLFMA) for three dimensi...
Cataloged from PDF version of article.We present a novel hierarchical partitioning strategy for the...
This paper reviews recent advances in large-scale computational electromagnetics using frequency dom...
We present fast and accurate solutions of large-scale scattering problems involving three-dimensiona...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...
This thesis investigates possible optimization on an efficient implementation of the multilevel fas...
Cataloged from PDF version of article.Due to its O(NlogN) complexity, the multilevel fast multipole ...
This paper investigates the scalability of the parallel multilevel fast multipole algorithm (MLFMA)....
We present two memory-reduction methods for the parallel multilevel fast multipole algorithm (MLFMA)...
Due to its O(N log N) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
A hierarchical parallelisation of the multilevel fast multipole algorithm (MLFMA) for the efficient ...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
In this paper we wish to focus on some recent advances in the Multilevel Fast Multipole Algorithm (M...
The development of a scalable parallel multilevel fast multipole algorithm (MLFMA) for three dimensi...
Cataloged from PDF version of article.We present a novel hierarchical partitioning strategy for the...
This paper reviews recent advances in large-scale computational electromagnetics using frequency dom...
We present fast and accurate solutions of large-scale scattering problems involving three-dimensiona...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...