We have implemented eight of the MPI collective routines using MPI point-to-point communication routines with algorithms designed to be ecient for large messages. The performance of our implementations of these collective routines is compared with the vendor implementations on the Cray T3E-600, the Cray Origin 2000 and on the IBM SP. Many of our implementations signicantly outperformed vendor implementations on the T3E and the Origin 2000. On the SP, only our implementation of the broadcast signicantly outperformed IBM's implementation. Keywords: MPI; Collective Communication Routines for Large Messages; Cray T3E; Origin 2000; IBM SP. 1 Introduction Today, MPI [15] is probably the most used message passing library for program...
. The performance of collective communication is critical to the overall system performance. In gene...
The performance of MPI implementation operations still presents critical issues for high performance...
IBM SP--2 has become a popular MPP for scientific community. Its programming environment includes se...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
We evaluate the architectural support of collective communication operations on the IBM SP2, Cray T3...
The message passing interface standard released in April 1994 by the MPI Forum [2], defines a set of...
Collective communication is an important subset of Message Passing Interface. Improving the perform...
We discuss the design and high-performance implementation of collective communications operations on...
In this paper the parallel benchmark code PSTSWM is used to evaluate the performance of the vendor-s...
We evaluate the architectural support of collective communication operations on the IBM SP2, Cray T3...
We give an overview of the algorithms and implementations in the high-performance MPI libraries MPI/...
Many parallel applications from scientific computing use MPI collective communication operations to ...
In order for collective communication routines to achieve high performance on different platforms, t...
Parallel computing on clusters of workstations and personal computers has very high potential, since...
Parallel computing on clusters of workstations and personal computers has very high potential, sinc...
. The performance of collective communication is critical to the overall system performance. In gene...
The performance of MPI implementation operations still presents critical issues for high performance...
IBM SP--2 has become a popular MPP for scientific community. Its programming environment includes se...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
We evaluate the architectural support of collective communication operations on the IBM SP2, Cray T3...
The message passing interface standard released in April 1994 by the MPI Forum [2], defines a set of...
Collective communication is an important subset of Message Passing Interface. Improving the perform...
We discuss the design and high-performance implementation of collective communications operations on...
In this paper the parallel benchmark code PSTSWM is used to evaluate the performance of the vendor-s...
We evaluate the architectural support of collective communication operations on the IBM SP2, Cray T3...
We give an overview of the algorithms and implementations in the high-performance MPI libraries MPI/...
Many parallel applications from scientific computing use MPI collective communication operations to ...
In order for collective communication routines to achieve high performance on different platforms, t...
Parallel computing on clusters of workstations and personal computers has very high potential, since...
Parallel computing on clusters of workstations and personal computers has very high potential, sinc...
. The performance of collective communication is critical to the overall system performance. In gene...
The performance of MPI implementation operations still presents critical issues for high performance...
IBM SP--2 has become a popular MPP for scientific community. Its programming environment includes se...