This thesis presents a top to bottom analysis on designing and implementing fast algorithms for current and future systems. We present new analysis, algorithmic techniques, and implementations of the Fast Multipole Method (FMM) for solving N- body problems. We target the FMM because it is broadly applicable to a variety of scientific particle simulations used to study electromagnetic, fluid, and gravitational phenomena, among others. Importantly, the FMM has asymptotically optimal time complexity with guaranteed approximation accuracy. As such, it is among the most attractive solutions for scalable particle simulation on future extreme scale systems. We specifically address two key challenges. The first challenge is how to engineer fast cod...
Algorithmic improvements to the parallel, distributed-memory multilevel fast multipole algorithm (ML...
The brain is probably the most complex organ in the human body. To understand processes such as lear...
We present a new adaptive fast multipole algorithm and its parallel implementation. The algorithm is...
<b>Invited Lecture at the SIAM <i>"Encuentro Nacional de Ingeniería Matemática,"</i> at Pontificia U...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
We present new analysis, algorithmic techniques, and implementations of the Fast Multipole Method (F...
The N-body problem appears in many computational physics simulations. At each time step the computat...
Solving an N-body problem, electrostatic or gravitational, is a crucial task and the main computatio...
A significant and computationally most demanding part of molecular dynamics simulations is the calcu...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
An important and computationally demanding part of molecular dynamics simulations is the calculation...
AbstractThis paper presents a parallel version of the fast multipole method (FMM). The FMM is a rece...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
Fast summation methods like the FMM are the backbone of a multitude of simulations in MD, astrophysi...
Solving an N-body problem, electrostatic or gravitational, is a crucial task and the main computatio...
Algorithmic improvements to the parallel, distributed-memory multilevel fast multipole algorithm (ML...
The brain is probably the most complex organ in the human body. To understand processes such as lear...
We present a new adaptive fast multipole algorithm and its parallel implementation. The algorithm is...
<b>Invited Lecture at the SIAM <i>"Encuentro Nacional de Ingeniería Matemática,"</i> at Pontificia U...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
We present new analysis, algorithmic techniques, and implementations of the Fast Multipole Method (F...
The N-body problem appears in many computational physics simulations. At each time step the computat...
Solving an N-body problem, electrostatic or gravitational, is a crucial task and the main computatio...
A significant and computationally most demanding part of molecular dynamics simulations is the calcu...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
An important and computationally demanding part of molecular dynamics simulations is the calculation...
AbstractThis paper presents a parallel version of the fast multipole method (FMM). The FMM is a rece...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
Fast summation methods like the FMM are the backbone of a multitude of simulations in MD, astrophysi...
Solving an N-body problem, electrostatic or gravitational, is a crucial task and the main computatio...
Algorithmic improvements to the parallel, distributed-memory multilevel fast multipole algorithm (ML...
The brain is probably the most complex organ in the human body. To understand processes such as lear...
We present a new adaptive fast multipole algorithm and its parallel implementation. The algorithm is...