Reduction is a core operation in parallel computing. Optimizing its cost has a high potential impact on the application execution time, particularly in MPI and MapReduce computations. In this paper, we propose an optimal algorithm for scheduling associative reductions. We focus on the case where communications and computations can be overlapped to fully exploit resources. Our algorithm greedily builds a spanning tree by starting from the sink and by adding a parent at each iteration. Bounds on the completion time of optimal schedules are then characterized. To show the algorithm extensibility, we adapt it to model variations in which either communication or computation resources are limited. Moreover, we study two specific spanning trees: w...
This thesis focus on the problem of scheduling the tasks of a parallel application taking into accou...
The classical redistribution problem aims at optimally scheduling communications when moving from an...
This thesis mainly deals with the mapping and the scheduling of applications on large heteroge- neou...
Reduction is a core operation in parallel computing. Optimizing its cost has a high potential impact...
Heterogeneous computing systems became a popular and powerful platform, containing several heterogen...
In this "Licence to supervise research" (HDR in French) thesis, we present a study of distributed as...
This paper investigates co-scheduling algorithms for processing a set of parallel applications. Inst...
In this thesis we study the behavior of parallel applications represented by a precedence graph. The...
Abstract—Collective communications are ubiquitous in parallel applications. We present two new algor...
Scientific applications are commonly modeled as the processing of directed acyclicgraphs of tasks, a...
Scientific workloads are often described as directed acyclic task graphs. In this paper, we focus o...
This paper is devoted to mapping iterative algorithms onto heterogeneous clusters. The application d...
Multiple applications that execute concurrently on heterogeneous platforms compete for CPU and netwo...
Applications in many scientific and engineering domains are structured in large numbers of independe...
In few years, parallel computers have been designed and have been widely developped. The main techni...
This thesis focus on the problem of scheduling the tasks of a parallel application taking into accou...
The classical redistribution problem aims at optimally scheduling communications when moving from an...
This thesis mainly deals with the mapping and the scheduling of applications on large heteroge- neou...
Reduction is a core operation in parallel computing. Optimizing its cost has a high potential impact...
Heterogeneous computing systems became a popular and powerful platform, containing several heterogen...
In this "Licence to supervise research" (HDR in French) thesis, we present a study of distributed as...
This paper investigates co-scheduling algorithms for processing a set of parallel applications. Inst...
In this thesis we study the behavior of parallel applications represented by a precedence graph. The...
Abstract—Collective communications are ubiquitous in parallel applications. We present two new algor...
Scientific applications are commonly modeled as the processing of directed acyclicgraphs of tasks, a...
Scientific workloads are often described as directed acyclic task graphs. In this paper, we focus o...
This paper is devoted to mapping iterative algorithms onto heterogeneous clusters. The application d...
Multiple applications that execute concurrently on heterogeneous platforms compete for CPU and netwo...
Applications in many scientific and engineering domains are structured in large numbers of independe...
In few years, parallel computers have been designed and have been widely developped. The main techni...
This thesis focus on the problem of scheduling the tasks of a parallel application taking into accou...
The classical redistribution problem aims at optimally scheduling communications when moving from an...
This thesis mainly deals with the mapping and the scheduling of applications on large heteroge- neou...