MPC: A Unified Parallel Runtime for Clusters of NUMA Machines

Herve ́ Jourdren
Raymond Namyst

Publication date

December 2014

Abstract

Abstract. Over the last decade, Message Passing Interface (MPI) has become a very successful parallel programming environment for dis-tributed memory architectures such as clusters. However, the architec-ture of cluster node is currently evolving from small symmetric shared memory multiprocessors towards massively multicore, Non-Uniform Memory Access (NUMA) hardware. Although regular MPI implemen-tations are using numerous optimizations to realize zero copy cache-oblivious data transfers within shared-memory nodes, they might prevent applications from achieving most of the hardware’s performance simply because the scheduling of heavyweight processes is not flexible enough to dynamically fit the underlying hardware topology. This explains wh...

Extracted data

We use cookies to provide a better user experience.

Data Protection

MPC: A Unified Parallel Runtime for Clusters of NUMA Machines

Abstract

Extracted data

MPC: A Unified Parallel Runtime for Clusters of NUMA Machines

Abstract

Extracted data

Related items

Related items