In this paper, we present two new parallel formulations of the Barnes-Hut method. These parallel formulations are especially suited for simulations with irregular particle densities. We first present a parallel formulation that uses a static partitioning of the domain and assignment of subdomains to processors. We demonstrate that this scheme delivers acceptable load balance, and coupled with two collective communication operations, it yields good performance. We present a second parallel formulation which combines static decomposition of the domain with an assignment of subdomains to processors based on Morton ordering. This alleviates the load imbalance inherent in the first scheme. The second parallel formulation is inspired by two curre...
The O(N) hierarchical N-body algorithms and Massively Parallel Processors allow particle systems of ...
Parallel computer programs are used to speed up the calculation of computationally-demanding scienti...
Many physical models require the simulation of a large number ($N$) of particles interacting throug...
In this paper, we present two new parallel formulations of the Barnes-Hut method. These parallel for...
We describe the design of several portable and efficient parallel implementations of adaptive N-body...
The efficient parallelization of fast multipole-based algorithms for the N-body problem is one of th...
We present an ecient and provably good partitioning and load balancing algorithm for parallel adapti...
We discuss the performance of direct summation codes used in the simulation of astrophysical stellar...
N-body methods simulate the evolution of systems of particles (or bodies). They are critical for sci...
Hierarchical N-body methods, which are based on a fundamental insight into the nature of many physic...
We present a data-parallel formulation of an adaptive version of Anderson's method for N-body partic...
Simulations of interacting particles are common in science and engineering, appearing in such divers...
We present a performance analysis of different parallelization schemes for direct codes used in the ...
feasible implementation of these algorithms. The execution codes have been written in NESL, a parall...
The O(N) hierarchical N–body algorithms and Massively Parallel Processors allow particle systems of ...
The O(N) hierarchical N-body algorithms and Massively Parallel Processors allow particle systems of ...
Parallel computer programs are used to speed up the calculation of computationally-demanding scienti...
Many physical models require the simulation of a large number ($N$) of particles interacting throug...
In this paper, we present two new parallel formulations of the Barnes-Hut method. These parallel for...
We describe the design of several portable and efficient parallel implementations of adaptive N-body...
The efficient parallelization of fast multipole-based algorithms for the N-body problem is one of th...
We present an ecient and provably good partitioning and load balancing algorithm for parallel adapti...
We discuss the performance of direct summation codes used in the simulation of astrophysical stellar...
N-body methods simulate the evolution of systems of particles (or bodies). They are critical for sci...
Hierarchical N-body methods, which are based on a fundamental insight into the nature of many physic...
We present a data-parallel formulation of an adaptive version of Anderson's method for N-body partic...
Simulations of interacting particles are common in science and engineering, appearing in such divers...
We present a performance analysis of different parallelization schemes for direct codes used in the ...
feasible implementation of these algorithms. The execution codes have been written in NESL, a parall...
The O(N) hierarchical N–body algorithms and Massively Parallel Processors allow particle systems of ...
The O(N) hierarchical N-body algorithms and Massively Parallel Processors allow particle systems of ...
Parallel computer programs are used to speed up the calculation of computationally-demanding scienti...
Many physical models require the simulation of a large number ($N$) of particles interacting throug...