International audienceIn this paper, we present a topology-aware load balancing algorithm for parallel multi-core machines and its proof of asymptotic convergence to an optimal solution. The algorithm, named HwTopoLB, aims to improve the application performance by reducing core idleness and communication delays. HwTopoLB was designed taking into account the properties of current parallel systems composed of multi-core compute nodes, namely their network interconnection, and their complex and hierarchical core topology. The latter comprises multiple levels of cache, and a memory subsystem with NUMA design. These systems provide high processing power at the expense of asymmetric communication costs, which can hamper the performance of paralle...
This thesis presents our research to provide performance portability and scalability to complex scie...
Abstract: In this paper we consider a new approach to load balancing for parallel systems. Today’s p...
The star network is one of the promising interconnection networks for future high speed parallel com...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceMulti-core compute nodes with non-uniform memory access (NUMA) are now a commo...
International audienceMulti-core compute nodes with non-uniform memory access (NUMA) are now a commo...
Abstract—Multi-core compute nodes with non-uniform mem-ory access (NUMA) are now a common architectu...
International audienceProgramming multicore or manycore architectures is a hard challenge particular...
International audienceThe evolution of massively parallel supercomputers make palpable two issues in...
This thesis presents our research to provide performance portability and scalability to complex scie...
This thesis presents our research to provide performance portability and scalability to complex scie...
This thesis presents our research to provide performance portability and scalability to complex scie...
Abstract: In this paper we consider a new approach to load balancing for parallel systems. Today’s p...
The star network is one of the promising interconnection networks for future high speed parallel com...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceCurrent multi-core machines feature a complex and hierarchical core topology, ...
International audienceMulti-core compute nodes with non-uniform memory access (NUMA) are now a commo...
International audienceMulti-core compute nodes with non-uniform memory access (NUMA) are now a commo...
Abstract—Multi-core compute nodes with non-uniform mem-ory access (NUMA) are now a common architectu...
International audienceProgramming multicore or manycore architectures is a hard challenge particular...
International audienceThe evolution of massively parallel supercomputers make palpable two issues in...
This thesis presents our research to provide performance portability and scalability to complex scie...
This thesis presents our research to provide performance portability and scalability to complex scie...
This thesis presents our research to provide performance portability and scalability to complex scie...
Abstract: In this paper we consider a new approach to load balancing for parallel systems. Today’s p...
The star network is one of the promising interconnection networks for future high speed parallel com...