We consider the problem of communication avoidance in computing interactions between a set of particles in scenarios with and without a cutoff radius for interaction. Our strategy, which we show to be optimal in communication, divides the work in the iteration space rather than simply dividing the particles over processors, so more than one processor may be responsible for computing updates to a single particle. Similar to a force decomposition in molecular dynamics, this approach requires up to √p times more memory than a particle decomposition, but reduces communication costs by factors up to √p and is often faster in practice than a particle decomposition [1]. We examine a generalized force decomposition algorithm that tolerates the memo...
O(N) algorithms for N-body simulations enable the simulation of particle systems with up to 100 mill...
We present new analysis, algorithmic techniques, and implementations of the Fast Multipole Method (F...
This paper initiates the study of communication complexity when the processors have limited work spa...
We consider the problem of communication avoidance in computing interactions between a set of partic...
Traditional particle simulation methods are used to calculate pair wise potentials, but some problem...
Simulations of interacting particles are common in science and engineering, appearing in such divers...
In parallel computing environments from multicore systems to cloud computers and supercomputers, dat...
Molecular dynamics simulations (MDS), no matter in which form, have always spent a lot of effort on ...
We introduce a new class of parallel algorithms for the exact computation of systems with pairwise m...
A new, innovative approach to solving the N-body problem is proposed, called self-scheduling N-body ...
Molecular mechanics and dynamics are becoming widely used to perform simulations of molecular system...
In today's MD simulations the scaling bottleneck is shifted more and more from computation towards c...
Molecular mechanics and dynamics are becoming widely used to perform simulations of molecular system...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Many physical models require the simulation of a large number ($N$) of particles interacting throug...
O(N) algorithms for N-body simulations enable the simulation of particle systems with up to 100 mill...
We present new analysis, algorithmic techniques, and implementations of the Fast Multipole Method (F...
This paper initiates the study of communication complexity when the processors have limited work spa...
We consider the problem of communication avoidance in computing interactions between a set of partic...
Traditional particle simulation methods are used to calculate pair wise potentials, but some problem...
Simulations of interacting particles are common in science and engineering, appearing in such divers...
In parallel computing environments from multicore systems to cloud computers and supercomputers, dat...
Molecular dynamics simulations (MDS), no matter in which form, have always spent a lot of effort on ...
We introduce a new class of parallel algorithms for the exact computation of systems with pairwise m...
A new, innovative approach to solving the N-body problem is proposed, called self-scheduling N-body ...
Molecular mechanics and dynamics are becoming widely used to perform simulations of molecular system...
In today's MD simulations the scaling bottleneck is shifted more and more from computation towards c...
Molecular mechanics and dynamics are becoming widely used to perform simulations of molecular system...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Many physical models require the simulation of a large number ($N$) of particles interacting throug...
O(N) algorithms for N-body simulations enable the simulation of particle systems with up to 100 mill...
We present new analysis, algorithmic techniques, and implementations of the Fast Multipole Method (F...
This paper initiates the study of communication complexity when the processors have limited work spa...