This is the full version of the paper, which includes more detailed proofs than the conference version.International audienceWork stealing has proven to be an effective method for scheduling parallel programs on multicore computers. To achieve high performance, work stealing distributes tasks between concurrent queues, called deques, which are assigned to each processor. Each processor operates on its deque locally except when performing load balancing via steals. Unfortunately, concurrent deques suffer from two limitations: 1) local deque operations require expensive memory fences in modern weak-memory architectures, 2) they can be very difficult to extend to support various optimizations and flexible forms of task distribution strategies ...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Computational task DAGs are executed on parallel computers by a task scheduling algorithm. Intellige...
In this paper we propose new insights into the problem of concurrently scheduling threads through ma...
This is the full version of the paper, which includes more detailed proofs than the conference versi...
Work-stealing is an efficient method to implement load balancing in fine-grained task parallelism. T...
Load balancing is a technique which allows efficient parallelization of irregular workloads, and a k...
International audienceChase and Lev's concurrent deque is a key data structure in shared- memory par...
Task-centric programming models offer a versatile method for exposing parallelism. Such programs are...
The fork-join paradigm of concurrent expression has gained popularity in conjunction with work-steal...
This paper investigates a variant of the work-stealing algorithm that we call the localized work-ste...
Abstract—This paper analyzes the overhead due to false sharing when parallel tasks are scheduled usi...
This paper addresses the problem of efficiently supporting parallelism within a managed runtime. A p...
Work-stealing systems are typically oblivious to the nature of the tasks they are scheduling. They d...
We prove the correctness of the concurrent deque component of a recent implementation of the work-st...
This thesis addresses the problem of scheduling multiple, concurrent, adaptively par-allel jobs on a...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Computational task DAGs are executed on parallel computers by a task scheduling algorithm. Intellige...
In this paper we propose new insights into the problem of concurrently scheduling threads through ma...
This is the full version of the paper, which includes more detailed proofs than the conference versi...
Work-stealing is an efficient method to implement load balancing in fine-grained task parallelism. T...
Load balancing is a technique which allows efficient parallelization of irregular workloads, and a k...
International audienceChase and Lev's concurrent deque is a key data structure in shared- memory par...
Task-centric programming models offer a versatile method for exposing parallelism. Such programs are...
The fork-join paradigm of concurrent expression has gained popularity in conjunction with work-steal...
This paper investigates a variant of the work-stealing algorithm that we call the localized work-ste...
Abstract—This paper analyzes the overhead due to false sharing when parallel tasks are scheduled usi...
This paper addresses the problem of efficiently supporting parallelism within a managed runtime. A p...
Work-stealing systems are typically oblivious to the nature of the tasks they are scheduling. They d...
We prove the correctness of the concurrent deque component of a recent implementation of the work-st...
This thesis addresses the problem of scheduling multiple, concurrent, adaptively par-allel jobs on a...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Computational task DAGs are executed on parallel computers by a task scheduling algorithm. Intellige...
In this paper we propose new insights into the problem of concurrently scheduling threads through ma...