We give an overview of the algorithms and implementations in the high-performance MPI libraries MPI/SX and MPI/ES of some of the most important collective operations of MPI (the Message Passing Interface). The infrastructure of MPI/SX makes it easy to incorporate new algorithms and algorithms for common special cases (e.g. a single SX node, or a single MPI process per SX node). Algorithms that are among the best known are employed, and special hardware features of the SX architecture and Internode Crossbar Switch (IXS) are exploited wherever possible. We discuss in more detail the implementation of MPI Barrier, MPI Bcast, the MPI reduction collectives, MPI Alltoall, and the gather/scatter collectives. Performance figures and comparisons to ...
In this article we recount the sequence of steps by which MPICH, a high-performance, portable implem...
Many parallel applications from scientific computing use MPI collective communication operations to ...
Accelerators have revolutionised the high performance computing (HPC) community. Despite their advan...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
In order for collective communication routines to achieve high performance on different platforms, t...
The performance of collective communication operations is one of the deciding factors in the overa...
Collective communication is an important subset of Message Passing Interface. Improving the perform...
The performance of MPI implementation operations still presents critical issues for high performance...
In this article we recount the sequence of steps by which MPICH, a high-performance, portable implem...
Further performance improvements of parallel simulation applications will not be reached by simply s...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
This paper presents an implementation of the Message Passing Interface called PACX-MPI. The major go...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
The message passing interface standard released in April 1994 by the MPI Forum [2], defines a set of...
In this article we recount the sequence of steps by which MPICH, a high-performance, portable implem...
Many parallel applications from scientific computing use MPI collective communication operations to ...
Accelerators have revolutionised the high performance computing (HPC) community. Despite their advan...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
In order for collective communication routines to achieve high performance on different platforms, t...
The performance of collective communication operations is one of the deciding factors in the overa...
Collective communication is an important subset of Message Passing Interface. Improving the perform...
The performance of MPI implementation operations still presents critical issues for high performance...
In this article we recount the sequence of steps by which MPICH, a high-performance, portable implem...
Further performance improvements of parallel simulation applications will not be reached by simply s...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
This paper presents an implementation of the Message Passing Interface called PACX-MPI. The major go...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
The message passing interface standard released in April 1994 by the MPI Forum [2], defines a set of...
In this article we recount the sequence of steps by which MPICH, a high-performance, portable implem...
Many parallel applications from scientific computing use MPI collective communication operations to ...
Accelerators have revolutionised the high performance computing (HPC) community. Despite their advan...