This paper proposes an efficient run-time system to schedule general nested loops on multiprocessors. The work extends existing one-dimensional loop scheduling strategies such as static scheduling, affinity scheduling and various dynamic scheduling methods. The extensions are twofold. First, multiple independent loops as found in different branches of parbegin/parend constructs can be scheduled simultaneously. Secondly, multidimensional loops with dependencies and conditions can be aggressively scheduled. The ability to schedule multidimensional loops with dependencies is made possible by providing a dependence vector as an input to the scheduler. Based on this application-specific input, a continuation-passing run-time system using non-blo...
Chain-based scheduling [1] is an efficient partitioning and scheduling scheme for nested loops on di...
Abstract—Using runtime information of load distributions and processor affinity, we propose an adapt...
Task dataflow languages simplify the specification of parallel programs by dynamically detecting and...
This paper proposes an efficient run-time system to schedule general nested loops on multiprocessors...
This paper proposes an efficient run-time system to schedule general nested loops on multiprocessors...
Abstract—This article studies the scheduling problem of a set of tasks with time or data constraints...
We consider the problem of scheduling parallel loops that are characterized by highly varying execut...
Three related problems, among others, are faced when trying to execute an algorithm on a parallel ma...
Abstract — Distributed Computing Systems are a viable and less expensive alternative to parallel com...
. In this paper we present Dynamic Bisectioning or DBS, a simple but powerful comprehensive scheduli...
Fine-grain parallelism available in VLIW and superscalar processors can be mainly exploited in compu...
Fine-grain parallelism available in VLIW and superscalar processors can be mainly exploited in compu...
In this paper, we present two new run-time algorithms for the parallelization of loops that have ind...
Using parallel processing systems to execute scientific applications is one of the most common solut...
It is extremely difficult to parallelize DOACROSS loops with non-uniform loop-carried dependences. I...
Chain-based scheduling [1] is an efficient partitioning and scheduling scheme for nested loops on di...
Abstract—Using runtime information of load distributions and processor affinity, we propose an adapt...
Task dataflow languages simplify the specification of parallel programs by dynamically detecting and...
This paper proposes an efficient run-time system to schedule general nested loops on multiprocessors...
This paper proposes an efficient run-time system to schedule general nested loops on multiprocessors...
Abstract—This article studies the scheduling problem of a set of tasks with time or data constraints...
We consider the problem of scheduling parallel loops that are characterized by highly varying execut...
Three related problems, among others, are faced when trying to execute an algorithm on a parallel ma...
Abstract — Distributed Computing Systems are a viable and less expensive alternative to parallel com...
. In this paper we present Dynamic Bisectioning or DBS, a simple but powerful comprehensive scheduli...
Fine-grain parallelism available in VLIW and superscalar processors can be mainly exploited in compu...
Fine-grain parallelism available in VLIW and superscalar processors can be mainly exploited in compu...
In this paper, we present two new run-time algorithms for the parallelization of loops that have ind...
Using parallel processing systems to execute scientific applications is one of the most common solut...
It is extremely difficult to parallelize DOACROSS loops with non-uniform loop-carried dependences. I...
Chain-based scheduling [1] is an efficient partitioning and scheduling scheme for nested loops on di...
Abstract—Using runtime information of load distributions and processor affinity, we propose an adapt...
Task dataflow languages simplify the specification of parallel programs by dynamically detecting and...