In the area of cluster computing, InfiniBand is becoming increasingly popular due to its open standard and high performance. However, even with InfiniBand, network bandwidth can still become the performance bottleneck for some of today’s most demanding applications. In this paper, we study the problem of how to overcome the bandwidth bottleneck by using multirail networks. We present different ways of setting up multirail networks with InfiniBand and propose a unified MPI design that can support all these approaches. We have also discussed various important design issues and provided in-depth discussions of different policies of using multirail networks, including an adaptive striping scheme that can dynamically change the striping paramete...
International audienceMulticore processors have not only reintroduced Non-Uniform Memory Access (NUM...
With the Top500 list from June 2004, cluster systems exceeded not only the 50 % threshold in number ...
The MPI Barrier() call can be crucial for several applications and has been target of different opti...
In the area of cluster computing, InfiniBand is becoming increasingly popular due to its open standa...
Clusters of several thousand nodes interconnected with InfiniBand, an emerging high-performance inte...
The MPI_Barrier-collective operation, as a part of the MPI-1.1 standard, is extremely important for ...
The performance of collective communication operations is one of the deciding factors in the overa...
The performance of MPI implementation operations still presents critical issues for high performance...
Large scale InfiniBand clusters are becoming increasingly popular, as reflected by the TOP 500 Super...
The performance of MPI implementation operations still presents critical issues for high performance...
Recently, InfiniBand Architecture (IBA) has been proposed as the next generation interconnect for I/...
We present a micro benchmark suite to evaluate InfiniBand TM implementations with regards to single ...
A recent trend in high performance computing shows a rising number of cores per compute node, while ...
uDAPL is a portable and platform independent communication library that provides RDMA as well as sen...
Abstract—Network congestion is an important factor affecting the performance of large scale jobs in ...
International audienceMulticore processors have not only reintroduced Non-Uniform Memory Access (NUM...
With the Top500 list from June 2004, cluster systems exceeded not only the 50 % threshold in number ...
The MPI Barrier() call can be crucial for several applications and has been target of different opti...
In the area of cluster computing, InfiniBand is becoming increasingly popular due to its open standa...
Clusters of several thousand nodes interconnected with InfiniBand, an emerging high-performance inte...
The MPI_Barrier-collective operation, as a part of the MPI-1.1 standard, is extremely important for ...
The performance of collective communication operations is one of the deciding factors in the overa...
The performance of MPI implementation operations still presents critical issues for high performance...
Large scale InfiniBand clusters are becoming increasingly popular, as reflected by the TOP 500 Super...
The performance of MPI implementation operations still presents critical issues for high performance...
Recently, InfiniBand Architecture (IBA) has been proposed as the next generation interconnect for I/...
We present a micro benchmark suite to evaluate InfiniBand TM implementations with regards to single ...
A recent trend in high performance computing shows a rising number of cores per compute node, while ...
uDAPL is a portable and platform independent communication library that provides RDMA as well as sen...
Abstract—Network congestion is an important factor affecting the performance of large scale jobs in ...
International audienceMulticore processors have not only reintroduced Non-Uniform Memory Access (NUM...
With the Top500 list from June 2004, cluster systems exceeded not only the 50 % threshold in number ...
The MPI Barrier() call can be crucial for several applications and has been target of different opti...