This paper describes experiments with two paral-lel implementations of the Fast Multipole Method { one using the shared memory programming model (OpenMP), the other the message passing model (MPI) { on the SGI Altix 3700 at Oak Ridge Na-tional Laboratory. The purpose of our experiments was to test our hypothesis that direct access to shared memory enabled by hardware-based cache coherence is a better match for the ne-grained communica-tion requirements of the tree-code computations of the algorithm, while the message passing version's gather/scatter approach is more ecient for the more coarse-grained interaction computation. After providing some context for the discussion with a description of the algorithm and our paral-lelizations, ...
Present and future multi-core computational system architecture attracts researchers to utilize this...
To design effective large-scale multiprocessors, designers need to understand the characteristics of...
September 24, 1993This work was performed while Kaushik Ghosh was on an internship at Kendall Square...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
Over the last few decades, Message Passing Interface (MPI) has become the parallel-communication sta...
The computational solution of large-scale linear systems of equations necessitates the use of fast a...
It has been shown that fast multipole methods can achieve good scalability on multi-core architectur...
In today's MD simulations the scaling bottleneck is shifted more and more from computation towards c...
In the last two decades, physical constraints in chip design have spawned a paradigm shift in comput...
Shared-memory and message-passing are two op- posite models to develop parallel computations. The sh...
The majority of current HPC applications are composed of complex and irregular data structures that ...
This dissertation considers the use of data prefetching and an alternative mechanism, data forwardin...
Two paradigms for distributed shared memory on loosely‐coupled computing systems are compared: the s...
Shared memory is the most popular parallel programming model for multi-core processors, while messag...
Present and future multi-core computational system architecture attracts researchers to utilize this...
To design effective large-scale multiprocessors, designers need to understand the characteristics of...
September 24, 1993This work was performed while Kaushik Ghosh was on an internship at Kendall Square...
In this paper, we analyze the communication pattern and study the scalability of a distributed memor...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
Over the last few decades, Message Passing Interface (MPI) has become the parallel-communication sta...
The computational solution of large-scale linear systems of equations necessitates the use of fast a...
It has been shown that fast multipole methods can achieve good scalability on multi-core architectur...
In today's MD simulations the scaling bottleneck is shifted more and more from computation towards c...
In the last two decades, physical constraints in chip design have spawned a paradigm shift in comput...
Shared-memory and message-passing are two op- posite models to develop parallel computations. The sh...
The majority of current HPC applications are composed of complex and irregular data structures that ...
This dissertation considers the use of data prefetching and an alternative mechanism, data forwardin...
Two paradigms for distributed shared memory on loosely‐coupled computing systems are compared: the s...
Shared memory is the most popular parallel programming model for multi-core processors, while messag...
Present and future multi-core computational system architecture attracts researchers to utilize this...
To design effective large-scale multiprocessors, designers need to understand the characteristics of...
September 24, 1993This work was performed while Kaushik Ghosh was on an internship at Kendall Square...