International audienceNon-blocking collectives have been proposed so as to allow communications to be overlapped with computation in order to amor-tize the cost of MPI collective operations. To obtain a good overlap ratio, communications and computation have to run in parallel. To achieve this, different hardware and software techniques exists. Dedicated some cores to run progress threads is one of them. However, some CPUs provide Simultaneous Multi-Threading, which is the ability for a core to have multiple hardware threads running simultaneously, sharing the same arithmetic units. Our idea is to use them to run progress threads to avoid dedicated cores allocation. We have run benchmarks on Haswell processors , using its Hyper-Threading ca...
Supercomputers used in HPC are composed of severals inter-connected machines. Usually, they are prog...
Supercomputers used in HPC are composed of severals inter-connected machines. Usually, they are prog...
Abstract—With the increasing prominence of many-core archi-tectures and decreasing per-core resource...
International audienceNon-blocking collectives have been proposed so as to allow communications to b...
International audienceTo amortize the cost of MPI collective operations, non-blocking collectives ha...
International audienceTo amortize the cost of MPI collective operations, nonblocking collectives hav...
International audienceTo amortize the cost of MPI collective operations, nonblocking collectives hav...
International audienceTo amortize the cost of MPI collective operations, non-blocking collectives ha...
Les supercalculateurs utilisés dans le HPC sont constitués de plusieurs machines inter-connectées. G...
Les supercalculateurs utilisés dans le HPC sont constitués de plusieurs machines inter-connectées. G...
Les supercalculateurs utilisés dans le HPC sont constitués de plusieurs machines inter-connectées. G...
This talk discusses optimized collective algorithms and the benefits of leveraging independent hardw...
International audienceBy allowing computation/communication overlap, MPI nonblocking collectives (NB...
To amortize the cost of MPI collective operations, non-blocking collectives have been proposed so a...
Supercomputers used in HPC are composed of severals inter-connected machines. Usually, they are prog...
Supercomputers used in HPC are composed of severals inter-connected machines. Usually, they are prog...
Supercomputers used in HPC are composed of severals inter-connected machines. Usually, they are prog...
Abstract—With the increasing prominence of many-core archi-tectures and decreasing per-core resource...
International audienceNon-blocking collectives have been proposed so as to allow communications to b...
International audienceTo amortize the cost of MPI collective operations, non-blocking collectives ha...
International audienceTo amortize the cost of MPI collective operations, nonblocking collectives hav...
International audienceTo amortize the cost of MPI collective operations, nonblocking collectives hav...
International audienceTo amortize the cost of MPI collective operations, non-blocking collectives ha...
Les supercalculateurs utilisés dans le HPC sont constitués de plusieurs machines inter-connectées. G...
Les supercalculateurs utilisés dans le HPC sont constitués de plusieurs machines inter-connectées. G...
Les supercalculateurs utilisés dans le HPC sont constitués de plusieurs machines inter-connectées. G...
This talk discusses optimized collective algorithms and the benefits of leveraging independent hardw...
International audienceBy allowing computation/communication overlap, MPI nonblocking collectives (NB...
To amortize the cost of MPI collective operations, non-blocking collectives have been proposed so a...
Supercomputers used in HPC are composed of severals inter-connected machines. Usually, they are prog...
Supercomputers used in HPC are composed of severals inter-connected machines. Usually, they are prog...
Supercomputers used in HPC are composed of severals inter-connected machines. Usually, they are prog...
Abstract—With the increasing prominence of many-core archi-tectures and decreasing per-core resource...