International audienceConsidering the large number of processors and the size of the interconnection networks on exascale-capable supercomputers, mapping concurrently executable and communicating tasks of an application is a complex problem that needs to be dealt with care. For parallel applications, the communication overhead can be a significant bottleneck on scalability. Topology-aware task-mapping methods that map the tasks to the processors (i.e., cores) by exploiting the underlying network information are very effective to avoid, or at worst bend, this limitation. We propose novel, efficient, and effective task mapping algorithms employing a graph model. The experiments show that the methods are faster than the existing approaches pro...
A fundamental issue affecting the performance of a parallel application running on message-passing p...
International audienceDue to the advent of modern hardware architectures of high-performance comput-...
Network contention has an increasingly adverse effect on the performance of parallel applications wi...
International audienceConsidering the large number of processors and the size of the interconnection...
Abhinav Bhatele, Ph.D. student at the Parallel Programming Lab at the University of Illinois present...
Petascale machines with hundreds of thousands of cores are being built. These machines have varying ...
Abstract As the scale of supercomputers grows, so does the size of the in-terconnect network. Topolo...
Topology aware mapping has started to attain interest again by the development of supercomputers who...
Abstract—We present a new method for mapping applica-tions ’ MPI tasks to cores of a parallel comput...
International audienceCommunication between tasks and load imbalance have been identified as a major...
To execute a parallel program on a multicomputer system, the tasks of the program have to be mapped ...
To execute a parallel program on a multicomputer system, the tasks of the program have to be mapped ...
Obtaining the best performance from a parallel program involves four important steps: 1. Choice of t...
The orchestration of communication of distributed memory parallel applications on a parallel compute...
Large-scale multiprocessor computers have numerous communicating components, and therefore place gre...
A fundamental issue affecting the performance of a parallel application running on message-passing p...
International audienceDue to the advent of modern hardware architectures of high-performance comput-...
Network contention has an increasingly adverse effect on the performance of parallel applications wi...
International audienceConsidering the large number of processors and the size of the interconnection...
Abhinav Bhatele, Ph.D. student at the Parallel Programming Lab at the University of Illinois present...
Petascale machines with hundreds of thousands of cores are being built. These machines have varying ...
Abstract As the scale of supercomputers grows, so does the size of the in-terconnect network. Topolo...
Topology aware mapping has started to attain interest again by the development of supercomputers who...
Abstract—We present a new method for mapping applica-tions ’ MPI tasks to cores of a parallel comput...
International audienceCommunication between tasks and load imbalance have been identified as a major...
To execute a parallel program on a multicomputer system, the tasks of the program have to be mapped ...
To execute a parallel program on a multicomputer system, the tasks of the program have to be mapped ...
Obtaining the best performance from a parallel program involves four important steps: 1. Choice of t...
The orchestration of communication of distributed memory parallel applications on a parallel compute...
Large-scale multiprocessor computers have numerous communicating components, and therefore place gre...
A fundamental issue affecting the performance of a parallel application running on message-passing p...
International audienceDue to the advent of modern hardware architectures of high-performance comput-...
Network contention has an increasingly adverse effect on the performance of parallel applications wi...