A MPI library performs two tasks---computation on behalf of the application, and communication in the form of sending and receiving messages among processes forming the application. Efficient communication is key to a high-performance MPI library, and the use of specialized interconnect technologies has been a common way to achieve this goal. However, these custom technologies lack the portability and simplicity of a generic communication solution like TCP over Ethernet. This thesis first shows that even though TCP is a higher overhead protocol than UDP, as a messaging medium it performs better than the latter, because of library-level reliability overheads with UDP. Then, the thesis presents a technique to separate computation and communic...
We describe the design and implementation of MPI-NP, a Myrinet communication system tailored to sup...
Message Passing Interface[2] is the de facto standard for multicomputer and cluster message passing;...
This paper describes an Operating System approach to the problem of delivering low latency high band...
This paper describes the basic concepts of our solution to improve the performance of Ethernet Commu...
This paper describes the basic concepts of our solution to improve the performance of Ethernet Commu...
This paper compares the performance of Myrinet and Eth-ernet as a communication substrate for MPI li...
This work covers the implementation of a raw Ethernet communication module for the Open MPI message ...
Compiled communication has recently been proposed to improve communication performance for clusters ...
There is a need for systems to provide additional processing to extract useful information from the ...
Parallel computing on clusters of workstations and personal computers has very high potential, since...
Compiled communication has recently been proposed to improve communication performance for clusters ...
Abstract—Modern high-speed interconnection networks are designed with capabilities to support commun...
Parallel computing on clusters of workstations and personal computers has very high potential, sinc...
Message Passing Interface is widely used for Parallel and Distributed Computing. MPICH and LAM are p...
This work covers the implementation of a raw Ethernet communication module for the Open MPI message ...
We describe the design and implementation of MPI-NP, a Myrinet communication system tailored to sup...
Message Passing Interface[2] is the de facto standard for multicomputer and cluster message passing;...
This paper describes an Operating System approach to the problem of delivering low latency high band...
This paper describes the basic concepts of our solution to improve the performance of Ethernet Commu...
This paper describes the basic concepts of our solution to improve the performance of Ethernet Commu...
This paper compares the performance of Myrinet and Eth-ernet as a communication substrate for MPI li...
This work covers the implementation of a raw Ethernet communication module for the Open MPI message ...
Compiled communication has recently been proposed to improve communication performance for clusters ...
There is a need for systems to provide additional processing to extract useful information from the ...
Parallel computing on clusters of workstations and personal computers has very high potential, since...
Compiled communication has recently been proposed to improve communication performance for clusters ...
Abstract—Modern high-speed interconnection networks are designed with capabilities to support commun...
Parallel computing on clusters of workstations and personal computers has very high potential, sinc...
Message Passing Interface is widely used for Parallel and Distributed Computing. MPICH and LAM are p...
This work covers the implementation of a raw Ethernet communication module for the Open MPI message ...
We describe the design and implementation of MPI-NP, a Myrinet communication system tailored to sup...
Message Passing Interface[2] is the de facto standard for multicomputer and cluster message passing;...
This paper describes an Operating System approach to the problem of delivering low latency high band...