When the inter-iteration dependency pattern of the iterations of a loop cannot be determined statically, compile time parallelization of the loop is not possible. In these cases, runtime parallelization [8] is the only alternative. The idea is to transform the loop into two code fragments: the inspector and the executor. When the program is run, the inspector examines the iteration dependencies and constructs a parallel schedule. The executor subsequently uses that schedule to carry out the actual computation in parallel. In this paper, we show how to reduce the overhead of running the inspector through its parallel execution. We describe two related approaches. The first, which emphasizes inspector efficiency, achieves nearly linear speedu...
This work leverages an original dependency analysis to parallelize loops regardless of their form i...
Abstract—Parallelization and locality optimization of affine loop nests has been successfully addres...
This is a post-peer-review, pre-copyedit version of an article published. The final authenticated ve...
[[abstract]]A run-time technique based on the inspector-executor scheme is proposed in this paper to...
In this paper, we present two new run-time algorithms for the parallelization of loops that have ind...
Current parallelizing compilers do a reasonable job of extracting parallelism from programs with reg...
In this paper we give a new run--time technique for finding an optimal parallel execution schedule f...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
While automatic parallelization of loops usually relies on compile-time analysis of data dependences...
This is a post-peer-review, pre-copyedit version of an article published in Lecture Notes in Compute...
The class of problems that can be effectively compiled by parallelizing compilers is discussed. This...
Abstract. A loop with irregular assignment computations contains loopcarried output data dependences...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
This work leverages an original dependency analysis to parallelize loops regardless of their form i...
Abstract—Parallelization and locality optimization of affine loop nests has been successfully addres...
This is a post-peer-review, pre-copyedit version of an article published. The final authenticated ve...
[[abstract]]A run-time technique based on the inspector-executor scheme is proposed in this paper to...
In this paper, we present two new run-time algorithms for the parallelization of loops that have ind...
Current parallelizing compilers do a reasonable job of extracting parallelism from programs with reg...
In this paper we give a new run--time technique for finding an optimal parallel execution schedule f...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
While automatic parallelization of loops usually relies on compile-time analysis of data dependences...
This is a post-peer-review, pre-copyedit version of an article published in Lecture Notes in Compute...
The class of problems that can be effectively compiled by parallelizing compilers is discussed. This...
Abstract. A loop with irregular assignment computations contains loopcarried output data dependences...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
This work leverages an original dependency analysis to parallelize loops regardless of their form i...
Abstract—Parallelization and locality optimization of affine loop nests has been successfully addres...
This is a post-peer-review, pre-copyedit version of an article published. The final authenticated ve...