Scalability to large number of processes is one of the weaknesses of current MPI implementations. Standard implementations are able to scale to hundreds of nodes, but not beyond. The main problem in these implementations is that they assume some resources (for both data and control-data) will always be available to receive/process unexpected messages. As we will show, this is not always true, especially in short-memory machines like the BG/L that has 64K nodes but each node only has 512Mbytes of memory. The objective of this paper is to present one algorithm that improves the robustness of MPI implementations for short-memory MPPs, taking care of data and control-data reception, the system will scale up to any number of nodes. The proposed ...
Collective communications occupy 20-90% of total execution times in many MPI applications. In this p...
Reliability is increasingly becoming a challenge for highperformance computing (HPC) systems with th...
InfiniBand (IB) is a popular network technology for modern high-performance computing systems. MPI i...
Scalability to a large number of processes is one of the weaknesses of current MPI implementations. ...
Scalability to a large number of processes is one of the weaknesses of current MPI implementations. ...
Scalability to a large number of processes is one of the weaknesses of current MPI implementations. ...
With processor speeds no longer doubling every 18-24 months owing to the exponential increase in pow...
This paper presents a portable optimization for MPI communications, called PRAcTICaL-MPI (Portable A...
The processing of massive amounts of data on clusters with finite amount of memory has become an imp...
Many parallel applications from scientific computing use MPI collective communication operations to ...
The most widely used node type in high-performance computing nowadays is a 2-socket server node. The...
We examine the mechanics of the send and receive mechanism of MPI and in particular how we can impl...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
We examine the mechanics of the send and receive mechanism of MPI and in particular how we can imple...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
Collective communications occupy 20-90% of total execution times in many MPI applications. In this p...
Reliability is increasingly becoming a challenge for highperformance computing (HPC) systems with th...
InfiniBand (IB) is a popular network technology for modern high-performance computing systems. MPI i...
Scalability to a large number of processes is one of the weaknesses of current MPI implementations. ...
Scalability to a large number of processes is one of the weaknesses of current MPI implementations. ...
Scalability to a large number of processes is one of the weaknesses of current MPI implementations. ...
With processor speeds no longer doubling every 18-24 months owing to the exponential increase in pow...
This paper presents a portable optimization for MPI communications, called PRAcTICaL-MPI (Portable A...
The processing of massive amounts of data on clusters with finite amount of memory has become an imp...
Many parallel applications from scientific computing use MPI collective communication operations to ...
The most widely used node type in high-performance computing nowadays is a 2-socket server node. The...
We examine the mechanics of the send and receive mechanism of MPI and in particular how we can impl...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
We examine the mechanics of the send and receive mechanism of MPI and in particular how we can imple...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
Collective communications occupy 20-90% of total execution times in many MPI applications. In this p...
Reliability is increasingly becoming a challenge for highperformance computing (HPC) systems with th...
InfiniBand (IB) is a popular network technology for modern high-performance computing systems. MPI i...