In this work we analyze the communication load imbalance generated by irregular-data applications running in a multi-node cluster. Experimental approaches to diminish communication load imbalance are evaluated using a hybrid programming model MPI+OpenMP including certain optimizations like computation-communication overlap, issuing communications in parallel and a new proposal based on message fragmentation in order to take advantage of the eager-protocol. Performance results show that overlapped versions can obtain a great benefit of this optimization because it avoids switching to rendez-vous protocols. However, non-overlapped versions showed better performance than overlapped ones. To evaluate also the impact due to network latency, the ...
Cluster computing has emerged as a primary and cost-effective platform for running parallel applicat...
The performance of MPI implementation operations still presents critical issues for high performance...
In the early years of parallel computing research, significant theoretical studies were done on inte...
In this work we analyze the communication load imbalance generated by irregular-data applications ru...
In modern MPI applications, communication between separate computational nodes quickly add up to a s...
Heterogeneity is becoming quite common in distributed parallel computing systems, both in processor ...
Abstract—Cluster computing has emerged as a primary and cost-effective platform for running parallel...
. With the advent of cheap and powerful hardware for workstations and networks, a new cluster-based ...
Conventional wisdom suggests that the most efficient use of modern computing clusters employs techni...
This paper demonstrates the one-sided communication used in languages like UPC can provide a signifi...
With the current continuation of Moore’s law and the presumed end of improved single core performanc...
The original publication can be found at www.springerlink.comThis paper gives an overview of two rel...
The performance evaluation of multiprocessor interconnects cannot be divorced from issues of traffic...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
Although logically available, applications may not exploit enough instantaneous communication concur...
Cluster computing has emerged as a primary and cost-effective platform for running parallel applicat...
The performance of MPI implementation operations still presents critical issues for high performance...
In the early years of parallel computing research, significant theoretical studies were done on inte...
In this work we analyze the communication load imbalance generated by irregular-data applications ru...
In modern MPI applications, communication between separate computational nodes quickly add up to a s...
Heterogeneity is becoming quite common in distributed parallel computing systems, both in processor ...
Abstract—Cluster computing has emerged as a primary and cost-effective platform for running parallel...
. With the advent of cheap and powerful hardware for workstations and networks, a new cluster-based ...
Conventional wisdom suggests that the most efficient use of modern computing clusters employs techni...
This paper demonstrates the one-sided communication used in languages like UPC can provide a signifi...
With the current continuation of Moore’s law and the presumed end of improved single core performanc...
The original publication can be found at www.springerlink.comThis paper gives an overview of two rel...
The performance evaluation of multiprocessor interconnects cannot be divorced from issues of traffic...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
Although logically available, applications may not exploit enough instantaneous communication concur...
Cluster computing has emerged as a primary and cost-effective platform for running parallel applicat...
The performance of MPI implementation operations still presents critical issues for high performance...
In the early years of parallel computing research, significant theoretical studies were done on inte...