It has been shown that fast multipole methods can achieve good scalability on multi-core architectures. We have for an adaptive single-threaded fast multipole method implemented multithreading support via the OpenMP API. The downward- and upward pass in the fast multipole method are parallelized, and the multithreaded implementation achieves on a quad-core architecture for uniform distributions a 6.6x speedup and a non-uniform distribution a 4.2x speedup. The lower speedup for the non-uniform distributions results from poor load balancing caused by higher variance in connectivity. We conclude that future research in how connectivity affects parallel performance is needed
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
International audienceLearn about the fast multipole method (FMM) and its optimization on NVIDIA GPU...
<b>Invited Lecture at the SIAM <i>"Encuentro Nacional de Ingeniería Matemática,"</i> at Pontificia U...
It has been shown that fast multipole methods can achieve good scalability on multi-core architectur...
We present parallel versions of a representative N-body application that uses Greengard and Rokhlin&...
In the last two decades, physical constraints in chip design have spawned a paradigm shift in comput...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
This work presents the first extensive study of single- node performance optimization, tuning, and a...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...
We present new analysis, algorithmic techniques, and implementations of the Fast Multipole Method (F...
We present efficient algorithms to build data structures and the lists needed for fast multipole met...
Abstract. We discuss an implementation of adaptive fast multipole meth-ods targeting hybrid multicor...
This paper describes experiments with two paral-lel implementations of the Fast Multipole Method { o...
The Fast Multipole Method (FMM) is well known to possess a bottleneck arising from decreasing worklo...
We present a new adaptive fast multipole algorithm and its parallel implementation. The algorithm is...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
International audienceLearn about the fast multipole method (FMM) and its optimization on NVIDIA GPU...
<b>Invited Lecture at the SIAM <i>"Encuentro Nacional de Ingeniería Matemática,"</i> at Pontificia U...
It has been shown that fast multipole methods can achieve good scalability on multi-core architectur...
We present parallel versions of a representative N-body application that uses Greengard and Rokhlin&...
In the last two decades, physical constraints in chip design have spawned a paradigm shift in comput...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
This work presents the first extensive study of single- node performance optimization, tuning, and a...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...
We present new analysis, algorithmic techniques, and implementations of the Fast Multipole Method (F...
We present efficient algorithms to build data structures and the lists needed for fast multipole met...
Abstract. We discuss an implementation of adaptive fast multipole meth-ods targeting hybrid multicor...
This paper describes experiments with two paral-lel implementations of the Fast Multipole Method { o...
The Fast Multipole Method (FMM) is well known to possess a bottleneck arising from decreasing worklo...
We present a new adaptive fast multipole algorithm and its parallel implementation. The algorithm is...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
International audienceLearn about the fast multipole method (FMM) and its optimization on NVIDIA GPU...
<b>Invited Lecture at the SIAM <i>"Encuentro Nacional de Ingeniería Matemática,"</i> at Pontificia U...