International audienceThe current trend in clusters architecture leads toward a massive use of multicore chips. This hardware evolution raises bottleneck issues at the network interface level. The use of multiple parallel networks allows to overcome this problem as it provides an higher aggregate bandwidth. But this bandwidth remains theoretical as only a few communication libraries are able to exploit multiple networks. In this paper, we present an optimization strategy for the NewMadeleine communication library. This strategy is able to efficiently exploit parallel interconnect links. By sampling each network's capabilities, it is possible to estimate a transfer duration a priori. Splitting messages and sending chunks of messages over par...
International audienceAs the number of cores per node increases in modern clusters, intra-node commu...
International audienceRunning parallel applications on clusters with high-speed local networks requi...
International audienceCurrent generations of NUMA node clusters feature multicore or manycore proces...
International audienceThis paper focuses on message transfers across multiple heterogeneous high-per...
International audienceCommunication performance is a critical issue in HPC applications, and many so...
International audienceThe current trend in clusters leads towards an increase of the number of cores...
International audienceThis paper describes how the NewMadeleine communication library has been integ...
International audienceMulticore processors have not only reintroduced Non-Uniform Memory Access (NUM...
International audienceCommunication libraries have dramatically made progress over the fifteen years...
International audienceAlthough processors become massively multicore and therefore new programming m...
In the area of cluster computing, InfiniBand is becoming increasingly popular due to its open standa...
International audienceSince the advent of multi-core processors, the physionomy of typical clusters ...
AbstractModern multi-core design will continue Moore's law and facilitate platforms for both wired a...
This paper introduces the new version of the Madeleine portable multi-protocol communication library...
In this work we analyze the communication load imbalance generated by irregular-data applications ru...
International audienceAs the number of cores per node increases in modern clusters, intra-node commu...
International audienceRunning parallel applications on clusters with high-speed local networks requi...
International audienceCurrent generations of NUMA node clusters feature multicore or manycore proces...
International audienceThis paper focuses on message transfers across multiple heterogeneous high-per...
International audienceCommunication performance is a critical issue in HPC applications, and many so...
International audienceThe current trend in clusters leads towards an increase of the number of cores...
International audienceThis paper describes how the NewMadeleine communication library has been integ...
International audienceMulticore processors have not only reintroduced Non-Uniform Memory Access (NUM...
International audienceCommunication libraries have dramatically made progress over the fifteen years...
International audienceAlthough processors become massively multicore and therefore new programming m...
In the area of cluster computing, InfiniBand is becoming increasingly popular due to its open standa...
International audienceSince the advent of multi-core processors, the physionomy of typical clusters ...
AbstractModern multi-core design will continue Moore's law and facilitate platforms for both wired a...
This paper introduces the new version of the Madeleine portable multi-protocol communication library...
In this work we analyze the communication load imbalance generated by irregular-data applications ru...
International audienceAs the number of cores per node increases in modern clusters, intra-node commu...
International audienceRunning parallel applications on clusters with high-speed local networks requi...
International audienceCurrent generations of NUMA node clusters feature multicore or manycore proces...