In this work we analyze the communication load imbalance generated by irregular-data applications running in a multi-node cluster. Experimental approaches to diminish communication load imbalance are evaluated using a hybrid programming model MPI+OpenMP including certain optimizations like computation-communication overlap, issuing communications in parallel and a new proposal based on message fragmentation in order to take advantage of the eager-protocol. Performance results show that overlapped versions can obtain a great benefit of this optimization because it avoids switching to rendez-vous protocols. However, non-overlapped versions showed better performance than overlapped ones. To evaluate also the impact due to network latency, the ...
With the current continuation of Moore’s law and the presumed end of improved single core performanc...
This paper demonstrates the one-sided communication used in languages like UPC can provide a signifi...
International audienceCommunication performance is a critical issue in HPC applications, and many so...
In this work we analyze the communication load imbalance generated by irregular-data applications ru...
International audienceOverlapping communications with computations in distributed applications shoul...
Cluster computing has emerged as a primary and cost-effective platform for running parallel applicat...
Abstract—Cluster computing has emerged as a primary and cost-effective platform for running parallel...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
International audienceTo amortize the cost of MPI communications, distributed parallel HPC applicati...
In modern MPI applications, communication between separate computational nodes quickly add up to a s...
International audienceTo amortize the cost of MPI communications, distributed parallel HPC applicati...
International audienceThis paper focuses on message transfers across multiple heterogeneous high-per...
Conventional wisdom suggests that the most efficient use of modern computing clusters employs techni...
International audienceThe current trend in clusters architecture leads toward a massive use of multi...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
With the current continuation of Moore’s law and the presumed end of improved single core performanc...
This paper demonstrates the one-sided communication used in languages like UPC can provide a signifi...
International audienceCommunication performance is a critical issue in HPC applications, and many so...
In this work we analyze the communication load imbalance generated by irregular-data applications ru...
International audienceOverlapping communications with computations in distributed applications shoul...
Cluster computing has emerged as a primary and cost-effective platform for running parallel applicat...
Abstract—Cluster computing has emerged as a primary and cost-effective platform for running parallel...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
International audienceTo amortize the cost of MPI communications, distributed parallel HPC applicati...
In modern MPI applications, communication between separate computational nodes quickly add up to a s...
International audienceTo amortize the cost of MPI communications, distributed parallel HPC applicati...
International audienceThis paper focuses on message transfers across multiple heterogeneous high-per...
Conventional wisdom suggests that the most efficient use of modern computing clusters employs techni...
International audienceThe current trend in clusters architecture leads toward a massive use of multi...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
With the current continuation of Moore’s law and the presumed end of improved single core performanc...
This paper demonstrates the one-sided communication used in languages like UPC can provide a signifi...
International audienceCommunication performance is a critical issue in HPC applications, and many so...