Abstract. We present a work-stealing algorithm for runtime scheduling of data-parallel operations in the context of shared-memory architectures on data sets with highly-irregular workloads that are not known a priori to the scheduler. This scheduler can parallelize loops and operations expressible with a parallel reduce or a parallel scan. The scheduler is based on the work-stealing tree data structure, which allows workers to decide on the work division in a lock-free, workload-driven manner and attempts to minimize the amount of communication between them. A significant effort is given to showing that the algorithm has the least possible amount of overhead. We provide an extensive experimental evaluation, comparing the advantages and shor...
Work-stealing systems are typically oblivious to the nature of the tasks they are scheduling. They d...
In this paper we propose new insights into the problem of concurrently scheduling threads through ma...
Lazy-task creation is an efficient method of overcoming the overhead of the grain-size problem in pa...
Abstract. We present a work-stealing algorithm for runtime scheduling of data-parallel operations in...
The fork-join paradigm of concurrent expression has gained popularity in conjunction with work-steal...
Work-stealing is a promising approach for effectively exploiting software parallelism on parallel ha...
Abstract—Load balancing techniques (e.g. work stealing) are important to obtain the best performance...
Work-stealing systems are typically oblivious to the nature of the tasks they are scheduling. They d...
Load balancing is a technique which allows efficient parallelization of irregular workloads, and a k...
This paper studies the problem of eciently scheduling fully strict (i.e., well-structured) multithre...
Blumofe and Leiserson [6] gave the first provably good work-stealing work scheduler for mul-tithread...
We present an adaptive work-stealing thread scheduler, A-STEAL, for fork-join multithreaded jobs, li...
This paper studies the data locality of the work-stealing scheduling algorithm on hardware-controlle...
Work-stealing systems are typically oblivious to the nature of the tasks theyare scheduling. For ins...
Task-centric programming models offer a versatile method for exposing parallelism. Such programs are...
Work-stealing systems are typically oblivious to the nature of the tasks they are scheduling. They d...
In this paper we propose new insights into the problem of concurrently scheduling threads through ma...
Lazy-task creation is an efficient method of overcoming the overhead of the grain-size problem in pa...
Abstract. We present a work-stealing algorithm for runtime scheduling of data-parallel operations in...
The fork-join paradigm of concurrent expression has gained popularity in conjunction with work-steal...
Work-stealing is a promising approach for effectively exploiting software parallelism on parallel ha...
Abstract—Load balancing techniques (e.g. work stealing) are important to obtain the best performance...
Work-stealing systems are typically oblivious to the nature of the tasks they are scheduling. They d...
Load balancing is a technique which allows efficient parallelization of irregular workloads, and a k...
This paper studies the problem of eciently scheduling fully strict (i.e., well-structured) multithre...
Blumofe and Leiserson [6] gave the first provably good work-stealing work scheduler for mul-tithread...
We present an adaptive work-stealing thread scheduler, A-STEAL, for fork-join multithreaded jobs, li...
This paper studies the data locality of the work-stealing scheduling algorithm on hardware-controlle...
Work-stealing systems are typically oblivious to the nature of the tasks theyare scheduling. For ins...
Task-centric programming models offer a versatile method for exposing parallelism. Such programs are...
Work-stealing systems are typically oblivious to the nature of the tasks they are scheduling. They d...
In this paper we propose new insights into the problem of concurrently scheduling threads through ma...
Lazy-task creation is an efficient method of overcoming the overhead of the grain-size problem in pa...