International audienceReduction is a core operation in parallel computing that combines distributed elements into a single result. Optimizing its cost may greatly reduce the application execution time, notably in MPI and MapReduce computations. In this paper, we propose an algorithm for scheduling associative reductions. We focus on the case where communications and computations can be overlapped to fully exploit resources. Our algorithm greedily builds a spanning tree by starting from the root and by adding a child at each iteration. Bounds on the completion time of optimal schedules are then characterized. To show the algorithm extensibility, we adapt it to model variations in which either communication or computation resources are limite...
International audienceIn this paper, we present an algorithm that builds optimal schedules for compl...
AbstractThis paper considers the problem of scheduling a tree-structured task system in a distribute...
AbstractWe present here an nτ+1 algorithm for optimally scheduling a dag of n nodes on a multiproces...
Reduction is a core operation in parallel computing. Optimizing its cost has a high potential impact...
Abstract—Collective communications are ubiquitous in parallel applications. We present two new algor...
This paper addresses the problem of designing a parallel reduction architecture for applicative lang...
This paper addresses the problem of designing a parallel reduction architecture for applicative lang...
Abstract: "This paper studies the relationship between parallel computation cost and communication c...
This thesis explores a fundamental issue in large-scale parallel computing: how to schedule tasks on...
AbstractWe consider a family of jobs that are organized as a task-tree which, in particular, capture...
International audienceMultiple applications that execute concurrently on heterogeneous platforms com...
Consider a scheduling problem of parallel computations in multiprocessor systems. Let a parallel pro...
AbstractWe study the problem of scheduling a parallel computation so as to minimize the maximum numb...
The dissertation studies how distributed devices that are disconnected for long and unknown periods ...
This paper tackles the problem of parallelizing heterogeneous computational tasks across a number of...
International audienceIn this paper, we present an algorithm that builds optimal schedules for compl...
AbstractThis paper considers the problem of scheduling a tree-structured task system in a distribute...
AbstractWe present here an nτ+1 algorithm for optimally scheduling a dag of n nodes on a multiproces...
Reduction is a core operation in parallel computing. Optimizing its cost has a high potential impact...
Abstract—Collective communications are ubiquitous in parallel applications. We present two new algor...
This paper addresses the problem of designing a parallel reduction architecture for applicative lang...
This paper addresses the problem of designing a parallel reduction architecture for applicative lang...
Abstract: "This paper studies the relationship between parallel computation cost and communication c...
This thesis explores a fundamental issue in large-scale parallel computing: how to schedule tasks on...
AbstractWe consider a family of jobs that are organized as a task-tree which, in particular, capture...
International audienceMultiple applications that execute concurrently on heterogeneous platforms com...
Consider a scheduling problem of parallel computations in multiprocessor systems. Let a parallel pro...
AbstractWe study the problem of scheduling a parallel computation so as to minimize the maximum numb...
The dissertation studies how distributed devices that are disconnected for long and unknown periods ...
This paper tackles the problem of parallelizing heterogeneous computational tasks across a number of...
International audienceIn this paper, we present an algorithm that builds optimal schedules for compl...
AbstractThis paper considers the problem of scheduling a tree-structured task system in a distribute...
AbstractWe present here an nτ+1 algorithm for optimally scheduling a dag of n nodes on a multiproces...