International audienceProcess placement, also called topology mapping, is a well-known strategy to improve parallel program execution by reducing the communication cost between processes. It requires two inputs: the topology of the target machine and a measure of the affinity between processes. In the literature, the dominant affinity measure is the communication matrix that describes the amount of communication between processes. The goal of this paper is to study the accuracy of the communication matrix as a measure of affinity. We have done an extensive set of tests with two fat-tree machines and a 3d-torus machine to evaluate several hypotheses that are often made in the literature and to discuss their validity. First, we check the corr...
International audienceMATCHING COMMUNICATION PATTERN WITH UNDERLYING HARDWARE ARCHITECTUR
The scaling of semiconductor technologies is leading to processors with increasing numbers of cores....
Communication is a necessary but overhead inducing component of parallel programming. Its impact on ...
International audienceProcess mapping (or process placement) is a useful algorithmic technique to op...
Abstract—this paper studies the influence that task placement may have on the performance of applica...
International audienceDue to the advent of modern hardware architectures of high-performance comput-...
Petascale machines with hundreds of thousands of cores are being built. These machines have varying ...
International audienceEfficiently programming shared-memory machines is a difficult challenge becaus...
International audienceProgramming multicore or manycore architectures efficiently is a challenge bec...
International audienceApplications’ performance is influenced by the mapping of processes to computi...
International audienceInterconnection networks in parallel platforms can be made of thousands of nod...
International audienceCurrent generations of NUMA node clusters feature multicore or manycore proces...
In the early years of parallel computing research, significant theoretical studies were done on inte...
The Message Passing Interface (MPI) standard defines virtual topologies that can be applied to syste...
International audienceA Resource and Job Management System (RJMS) is a crucial system software part ...
International audienceMATCHING COMMUNICATION PATTERN WITH UNDERLYING HARDWARE ARCHITECTUR
The scaling of semiconductor technologies is leading to processors with increasing numbers of cores....
Communication is a necessary but overhead inducing component of parallel programming. Its impact on ...
International audienceProcess mapping (or process placement) is a useful algorithmic technique to op...
Abstract—this paper studies the influence that task placement may have on the performance of applica...
International audienceDue to the advent of modern hardware architectures of high-performance comput-...
Petascale machines with hundreds of thousands of cores are being built. These machines have varying ...
International audienceEfficiently programming shared-memory machines is a difficult challenge becaus...
International audienceProgramming multicore or manycore architectures efficiently is a challenge bec...
International audienceApplications’ performance is influenced by the mapping of processes to computi...
International audienceInterconnection networks in parallel platforms can be made of thousands of nod...
International audienceCurrent generations of NUMA node clusters feature multicore or manycore proces...
In the early years of parallel computing research, significant theoretical studies were done on inte...
The Message Passing Interface (MPI) standard defines virtual topologies that can be applied to syste...
International audienceA Resource and Job Management System (RJMS) is a crucial system software part ...
International audienceMATCHING COMMUNICATION PATTERN WITH UNDERLYING HARDWARE ARCHITECTUR
The scaling of semiconductor technologies is leading to processors with increasing numbers of cores....
Communication is a necessary but overhead inducing component of parallel programming. Its impact on ...