This paper studies the performance of parallel stream computations on a multiprocessor architecture using a work-stealing strategy. Incoming tasks are split in a number of jobs allocated to the processors and whenever a processor becomes idle, it steals a fraction (typically half) of the jobs from a busy processor. We propose a new model for the performance analysis of such parallel stream computations. This model takes into account both the algorithmic behavior of work-stealing as well as the hardware constraints of the architecture (synchronizations and bus contentions). Then, we show that this model can be solved using a recursive formula. We further show that this recursive analytical approach is more efficient than the classic global b...
International audienceWe study the impact of communication latency on the classical Work Stealing lo...
This paper addresses the problem of efficiently supporting parallelism within a managed runtime. A p...
We study the problem of minimizing total completion time on parallel machines subject to varying pro...
International audienceThis paper studies the performance of parallel stream computations on a multip...
We study the problem of parallel stream computations on a multiprocessor architecture. Modelling the...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
This paper investigates a variant of the work-stealing algorithm that we call the localized work-ste...
In this paper, we consider a generic model of computational grids, seen as several clusters of homog...
This paper studies the data locality of the work-stealing scheduling algorithm on hardware-controlle...
We present an adaptive work-stealing thread scheduler, A-STEAL, for fork-join multithreaded jobs, li...
Blumofe and Leiserson [6] gave the first provably good work-stealing work scheduler for mul-tithread...
Blumofe and Leiserson [6] gave the first provably good work-stealing work scheduler for mul-tithread...
Load balancing is a technique which allows efficient parallelization of irregular workloads, and a k...
International audienceIn this paper, we consider a generic model of computational grids, seen as sev...
The rise of many-core processor architectures in the market answers to a constantly growing need of ...
International audienceWe study the impact of communication latency on the classical Work Stealing lo...
This paper addresses the problem of efficiently supporting parallelism within a managed runtime. A p...
We study the problem of minimizing total completion time on parallel machines subject to varying pro...
International audienceThis paper studies the performance of parallel stream computations on a multip...
We study the problem of parallel stream computations on a multiprocessor architecture. Modelling the...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
This paper investigates a variant of the work-stealing algorithm that we call the localized work-ste...
In this paper, we consider a generic model of computational grids, seen as several clusters of homog...
This paper studies the data locality of the work-stealing scheduling algorithm on hardware-controlle...
We present an adaptive work-stealing thread scheduler, A-STEAL, for fork-join multithreaded jobs, li...
Blumofe and Leiserson [6] gave the first provably good work-stealing work scheduler for mul-tithread...
Blumofe and Leiserson [6] gave the first provably good work-stealing work scheduler for mul-tithread...
Load balancing is a technique which allows efficient parallelization of irregular workloads, and a k...
International audienceIn this paper, we consider a generic model of computational grids, seen as sev...
The rise of many-core processor architectures in the market answers to a constantly growing need of ...
International audienceWe study the impact of communication latency on the classical Work Stealing lo...
This paper addresses the problem of efficiently supporting parallelism within a managed runtime. A p...
We study the problem of minimizing total completion time on parallel machines subject to varying pro...