A combination of hierarchical tree-like data structures and data access patterns from fast mul-tipole methods and hierarchical low-rank approximation of linear operators from H-matrix methods appears to form an algorithmic path forward for efficient implementation of many linear algebraic operations of scientific computing at the exascale. The combination provides asymptotically optimal computational and communication complexity and applicability to large classes of operators that com-monly arise in scientific computing applications. A convergence of the mathematical theories of the fast multipole and H-matrix methods has been underway for over a decade. We recap this mathemat-ical unification and describe implementation aspects of a hybrid...
We present a novel hierarchical partitioning strategy for the efficient parallelization of the multi...
Diagonal translation operators form the core of the dynamic multilevel fast multipole algorithm (MLF...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...
A combination of hierarchical tree-like data structures and data access patterns from fast multipole...
Many problems in mathematical physics and engineering involve solving linear systems Ax = b which ar...
We present an overview of the Fast Multipole Method, explain the use of optimal data structures and...
A hierarchical parallelisation of the multilevel fast multipole algorithm (MLFMA) for the efficient ...
Dense linear algebra computations are essential to nearly every problem in scientific computing and ...
The Fast Multipole Method (FMM) is well known to possess a bottleneck arising from decreasing worklo...
This paper investigates the parallel, distributed-memory computation of the translation operator wit...
Due to its O(N log N) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most...
Large-scale electromagnetics problems can be solved efficiently with the multilevel fast multipole a...
In this paper we study the performance of two classical dense linear algebra algorithms, the LU and ...
Due to its O(NlogN) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most p...
This study focuses on the performance of two classical dense linear algebra algorithms, the LU and t...
We present a novel hierarchical partitioning strategy for the efficient parallelization of the multi...
Diagonal translation operators form the core of the dynamic multilevel fast multipole algorithm (MLF...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...
A combination of hierarchical tree-like data structures and data access patterns from fast multipole...
Many problems in mathematical physics and engineering involve solving linear systems Ax = b which ar...
We present an overview of the Fast Multipole Method, explain the use of optimal data structures and...
A hierarchical parallelisation of the multilevel fast multipole algorithm (MLFMA) for the efficient ...
Dense linear algebra computations are essential to nearly every problem in scientific computing and ...
The Fast Multipole Method (FMM) is well known to possess a bottleneck arising from decreasing worklo...
This paper investigates the parallel, distributed-memory computation of the translation operator wit...
Due to its O(N log N) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most...
Large-scale electromagnetics problems can be solved efficiently with the multilevel fast multipole a...
In this paper we study the performance of two classical dense linear algebra algorithms, the LU and ...
Due to its O(NlogN) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most p...
This study focuses on the performance of two classical dense linear algebra algorithms, the LU and t...
We present a novel hierarchical partitioning strategy for the efficient parallelization of the multi...
Diagonal translation operators form the core of the dynamic multilevel fast multipole algorithm (MLF...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...