Performing BMMC Permutations Efficiently on Distributed-Memory Multiprocessors with MPI

Cormen, Thomas H

Open PDF

Open link

Publication date

May 1997

Publisher

Dartmouth Digital Commons

Language

English

Abstract

This paper presents an architecture-independent method for performing BMMC permutations on multiprocessors with distributed memory. All interprocessor communication uses the MPI function MPI_Sendrecv_replace(). The number of elements and number of processors must be powers of 2, with at least one element per processor, and there is no inherent upper bound on the ratio of elements per processor. Our method transmits only data without transmitting any source or target indices, which conserves network bandwidth. When data is transmitted, the source and target processors implicitly agree on each other\u27s identity and the indices of the elements being transmitted. A C-callable implementation of our method is available from Netlib. The implemen...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Performing BMMC Permutations Efficiently on Distributed-Memory Multiprocessors with MPI

Abstract

Extracted data

Performing BMMC Permutations Efficiently on Distributed-Memory Multiprocessors with MPI

Abstract

Extracted data

Related items

Related items