The optimal mapping of tasks of a parallel program onto nodes of a parallel computing system has a remarkable impact on application performance. In this paper we propose an optimization framework to solve the mapping problem, which takes into account the communication matrix of the application and a cost matrix that depends on the topology of the parallel system. This cost function can be a distance matrix (the classic approach), or can take into account other considerations. We propose a novel definition of the cost criterion, applicable to torus networks, that tries to distribute traffic evenly over the different axes: the Traffic Distribution criterion. As the mapping problem is a particular instance of the Quadratic Assignment Problem, ...
Communication and topology aware process mapping is a powerful approach to reduce communication time...
The assignment of processes to processors (the mapping problem) is one of the major factors affectin...
Machines with distributed memory have the mapping problem—assigning processes to processors. In this...
Abstract. Static mapping is the assignment of parallel processes to the processing elements (PEs) of...
Petascale machines with hundreds of thousands of cores are being built. These machines have varying ...
This work explains a heuristic algorithm, task self mapping algorithm (TSMA), for mapping parallel a...
The mapping problem has been studied extensively. However, algorithms which were designed to map a p...
The orchestration of communication of distributed memory parallel applications on a parallel compute...
A faire apr`es Keywords: Parallel environment, Distributed-memory machines, Load-balancing, Mapping...
Abstract—We present a new method for mapping applica-tions ’ MPI tasks to cores of a parallel comput...
In order to minimize the execution time of a parallel application running on a heterogeneously distr...
The need for high-performance computing together with the increasing trend from single processor to ...
Abstract—The mapping problem involves the assig-nation of a set of tasks of a parallel application o...
Network contention has an increasingly adverse effect on the performance of parallel applications wi...
A fundamental issue affecting the performance of a parallel application running on message-passing p...
Communication and topology aware process mapping is a powerful approach to reduce communication time...
The assignment of processes to processors (the mapping problem) is one of the major factors affectin...
Machines with distributed memory have the mapping problem—assigning processes to processors. In this...
Abstract. Static mapping is the assignment of parallel processes to the processing elements (PEs) of...
Petascale machines with hundreds of thousands of cores are being built. These machines have varying ...
This work explains a heuristic algorithm, task self mapping algorithm (TSMA), for mapping parallel a...
The mapping problem has been studied extensively. However, algorithms which were designed to map a p...
The orchestration of communication of distributed memory parallel applications on a parallel compute...
A faire apr`es Keywords: Parallel environment, Distributed-memory machines, Load-balancing, Mapping...
Abstract—We present a new method for mapping applica-tions ’ MPI tasks to cores of a parallel comput...
In order to minimize the execution time of a parallel application running on a heterogeneously distr...
The need for high-performance computing together with the increasing trend from single processor to ...
Abstract—The mapping problem involves the assig-nation of a set of tasks of a parallel application o...
Network contention has an increasingly adverse effect on the performance of parallel applications wi...
A fundamental issue affecting the performance of a parallel application running on message-passing p...
Communication and topology aware process mapping is a powerful approach to reduce communication time...
The assignment of processes to processors (the mapping problem) is one of the major factors affectin...
Machines with distributed memory have the mapping problem—assigning processes to processors. In this...