Clusters of several thousand nodes interconnected with InfiniBand, an emerging high-performance intercon-nect, have already appeared in the Top 500 list. The next-generation InfiniBand clusters are expected to be even larger with tens-of-thousands of nodes. A high-performance scalable MPI design is crucial for MPI appli-cations in order to exploit the massive potential for paral-lelism in these very large clusters. MVAPICH is a popular implementation of MPI over InfiniBand based on its reli-able connection oriented model. The requirement of this model to make communication buffers available for each connection imposes a memory scalability problem. In or-der to mitigate this issue, the latest InfiniBand standard in-cludes a new feature calle...
The performance of MPI implementation operations still presents critical issues for high performance...
Many existing MPI-2 one-sided communication imple-mentations are built on top of MPI send/receive op...
Abstract—The rapid growth of supercomputing systems, both in scale and complexity, has been accompan...
In the area of cluster computing, InfiniBand is becoming increasingly popular due to its open standa...
With the Top500 list from June 2004, cluster systems exceeded not only the 50 % threshold in number ...
Recently, InfiniBand Architecture (IBA) has been proposed as the next generation interconnect for I/...
Although InfiniBand Architecture is relatively new in the high performance computing area, it o#ers ...
The MPI_Barrier-collective operation, as a part of the MPI-1.1 standard, is extremely important for ...
Fast and scalable process startup is one of the major challenges in parallel computing over large sc...
The performance of collective communication operations is one of the deciding factors in the overa...
In the area of cluster computing, InfiniBand is becoming increasingly popular due to its open standa...
Looking at the TOP 500 list of supercomputers we can see that different architectures and networking...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
For several years, MPI has been the de facto standard for writing parallel applications. One of the ...
Large scale InfiniBand clusters are becoming increasingly popular, as reflected by the TOP 500 Super...
The performance of MPI implementation operations still presents critical issues for high performance...
Many existing MPI-2 one-sided communication imple-mentations are built on top of MPI send/receive op...
Abstract—The rapid growth of supercomputing systems, both in scale and complexity, has been accompan...
In the area of cluster computing, InfiniBand is becoming increasingly popular due to its open standa...
With the Top500 list from June 2004, cluster systems exceeded not only the 50 % threshold in number ...
Recently, InfiniBand Architecture (IBA) has been proposed as the next generation interconnect for I/...
Although InfiniBand Architecture is relatively new in the high performance computing area, it o#ers ...
The MPI_Barrier-collective operation, as a part of the MPI-1.1 standard, is extremely important for ...
Fast and scalable process startup is one of the major challenges in parallel computing over large sc...
The performance of collective communication operations is one of the deciding factors in the overa...
In the area of cluster computing, InfiniBand is becoming increasingly popular due to its open standa...
Looking at the TOP 500 list of supercomputers we can see that different architectures and networking...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
For several years, MPI has been the de facto standard for writing parallel applications. One of the ...
Large scale InfiniBand clusters are becoming increasingly popular, as reflected by the TOP 500 Super...
The performance of MPI implementation operations still presents critical issues for high performance...
Many existing MPI-2 one-sided communication imple-mentations are built on top of MPI send/receive op...
Abstract—The rapid growth of supercomputing systems, both in scale and complexity, has been accompan...