We investigate the performance gains from hyper-systolic implementations of n2-loop problems on the massively parallel computer Quadrics, exploiting its three-dimensional interprocessor connectivity. For illustration we study the communication aspects of an exact molecular dynamics simulation of n particles with Coulomb (or gravitational) interactions.We compare the interprocessor communication costs of the standard-systolic and the hyper-systolic approaches for various granularities. We predict gain factors as large as three on the Q4 and eight on the QH4 and measure actual performances on these machine configurations. We conclude that it appears feasible to investigate the thermodynamics of a full gravitating n-body problem with O(16.000)...
O(N) algorithms for N-body simulations enable the simulation of particle systems with up to 100 mill...
The O(N) hierarchical N-body algorithms and mas-sively parallel processors allow particle systems of...
We discuss the performance of direct summation codes used in the simulation of astrophysical stellar...
We introduce a new class of parallel algorithms for the exact computation of systems with pairwise m...
Hyper-systolic algorithms represent a new class of parallel computing structures. Because of their r...
Direct-summation N-body algorithms compute the gravitational interaction between stars in an exact w...
We present a performance analysis of different parallelization schemes for direct codes used in the ...
The O(N) hierarchical N–body algorithms and Massively Parallel Processors allow particle systems of ...
. Basic Linear Algebra Subroutines (BLAS-3) [1] are building blocks to solve a lot of numerical prob...
We present a new implementation of the numerical integration of the classical, gravitational, N-body...
This work considers the organization and performance of computations on parallel computers of tree...
The O(N) hierarchical N-body algorithms and Massively Parallel Processors allow particle systems of ...
Direct-summation N-body algorithms compute the gravitational interaction between stars in an exact w...
The gravitational N-body algorithm of Barnes and Hut [1] has been successfully implemented on a hype...
Hybrid computational architectures based on the joint power of Central Processing Units (CPUs) and G...
O(N) algorithms for N-body simulations enable the simulation of particle systems with up to 100 mill...
The O(N) hierarchical N-body algorithms and mas-sively parallel processors allow particle systems of...
We discuss the performance of direct summation codes used in the simulation of astrophysical stellar...
We introduce a new class of parallel algorithms for the exact computation of systems with pairwise m...
Hyper-systolic algorithms represent a new class of parallel computing structures. Because of their r...
Direct-summation N-body algorithms compute the gravitational interaction between stars in an exact w...
We present a performance analysis of different parallelization schemes for direct codes used in the ...
The O(N) hierarchical N–body algorithms and Massively Parallel Processors allow particle systems of ...
. Basic Linear Algebra Subroutines (BLAS-3) [1] are building blocks to solve a lot of numerical prob...
We present a new implementation of the numerical integration of the classical, gravitational, N-body...
This work considers the organization and performance of computations on parallel computers of tree...
The O(N) hierarchical N-body algorithms and Massively Parallel Processors allow particle systems of ...
Direct-summation N-body algorithms compute the gravitational interaction between stars in an exact w...
The gravitational N-body algorithm of Barnes and Hut [1] has been successfully implemented on a hype...
Hybrid computational architectures based on the joint power of Central Processing Units (CPUs) and G...
O(N) algorithms for N-body simulations enable the simulation of particle systems with up to 100 mill...
The O(N) hierarchical N-body algorithms and mas-sively parallel processors allow particle systems of...
We discuss the performance of direct summation codes used in the simulation of astrophysical stellar...