International audienceThe current trend in clusters leads towards an increase of the number of cores per node. As a result, an increasing number of parallel applications is mixing message passing and multithreading as an attempt to better match the underlying architecture's structure. This naturally raises the problem of designing efficient, multithreaded implementations of MPI. In this paper, we present the design of a multithreaded communication engine able to exploit idle cores to speed up communications in two ways: it can move CPU-intensive operations out of the critical path (e.g. PIO transfers offload), and is able to let rendezvous transfers progress asynchronously. We have implemented these methods in the PM2 software suite, evalua...
Frank Cappello (Rapporteur), Thierry Priol (Rapporteur), Françoise Baude (Examinatrice), Jacques Bri...
La tendance actuelle des constructeurs pour le calcul scientifique est à l'utilisation de grappes de...
Many-core architectures, such as the Intel Xeon Phi, provide dozens of cores and hundreds of hardwar...
International audienceThe current trend in clusters leads towards an increase of the number of cores...
International audienceRecent cluster architectures include dozens of cores per node, with all cores ...
International audienceThe current trend in clusters architecture leads toward a massive use of multi...
International audienceAlthough processors become massively multicore and therefore new programming m...
International audienceSince the advent of multi-core processors, the physionomy of typical clusters ...
International audienceTo amortize the cost of MPI collective operations, non-blocking collectives ha...
International audienceRecent cluster architectures include dozens of cores per node, with all cores ...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
International audienceThis paper describes how the NewMadeleine communication library has been integ...
Abstract. With the ever-increasing numbers of cores per node on HPC systems, applications are increa...
International audienceNon-blocking collectives have been proposed so as to allow communications to b...
Abstract: Multicore is an integrated circuit chip that uses two or more computational engines (cores...
Frank Cappello (Rapporteur), Thierry Priol (Rapporteur), Françoise Baude (Examinatrice), Jacques Bri...
La tendance actuelle des constructeurs pour le calcul scientifique est à l'utilisation de grappes de...
Many-core architectures, such as the Intel Xeon Phi, provide dozens of cores and hundreds of hardwar...
International audienceThe current trend in clusters leads towards an increase of the number of cores...
International audienceRecent cluster architectures include dozens of cores per node, with all cores ...
International audienceThe current trend in clusters architecture leads toward a massive use of multi...
International audienceAlthough processors become massively multicore and therefore new programming m...
International audienceSince the advent of multi-core processors, the physionomy of typical clusters ...
International audienceTo amortize the cost of MPI collective operations, non-blocking collectives ha...
International audienceRecent cluster architectures include dozens of cores per node, with all cores ...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
International audienceThis paper describes how the NewMadeleine communication library has been integ...
Abstract. With the ever-increasing numbers of cores per node on HPC systems, applications are increa...
International audienceNon-blocking collectives have been proposed so as to allow communications to b...
Abstract: Multicore is an integrated circuit chip that uses two or more computational engines (cores...
Frank Cappello (Rapporteur), Thierry Priol (Rapporteur), Françoise Baude (Examinatrice), Jacques Bri...
La tendance actuelle des constructeurs pour le calcul scientifique est à l'utilisation de grappes de...
Many-core architectures, such as the Intel Xeon Phi, provide dozens of cores and hundreds of hardwar...