If parallelism can be successfully exploited in a pro-gram, signicant reductions in execution time can be achieved. However, if sections of the code are domi-nated by parallel overheads, the overall program perfor-mance can degrade. We propose a framework, based on an inspector-executor model, for identifying loops that are dominated by parallel overheads and dynamically serializing these loops. We implement this framework in the Polaris parallelizing compiler and evaluate two portable methods for classifying loops as protable or unpro table. We show that for six benchmark programs from the Perfect Club and SPEC 95 suites, parallel pro-gram execution times can be improved by as much as 85 % on 16 processors of an Origin 2000.
Multiprocessor computers are rapidly becoming the norm. Parallel workstations are widely available t...
[[abstract]]The main function of parallelizing compilers is to analyze sequential programs, in parti...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
It is the goal of the Polaris project to develop a new parallelizing compiler that will overcome li...
Previous research has shown existence of a huge potential of the coarse-grain parallelism in program...
Abstract. Dynamic compilation is becoming a dominant compilation technique. Runtime compilation has ...
Automatic parallelization techniques for finding loop-based parallelism fail to find efficient paral...
It is the goal of the Polaris project to develop a new parallelizing compiler that will overcome lim...
When the inter-iteration dependency pattern of the iterations of a loop cannot be determined statica...
Even fully parallel sharedmemory program sections may perform signicantly be low the ideal speedup o...
Modern computers will increasingly rely on parallelism to achieve high computation rates. Techniques...
Current parallelizing compilers do a reasonable job of extracting parallelism from programs with reg...
International audienceThis paper describes a tool using one or more executions of a sequential progr...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
[[abstract]]A run-time technique based on the inspector-executor scheme is proposed in this paper to...
Multiprocessor computers are rapidly becoming the norm. Parallel workstations are widely available t...
[[abstract]]The main function of parallelizing compilers is to analyze sequential programs, in parti...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
It is the goal of the Polaris project to develop a new parallelizing compiler that will overcome li...
Previous research has shown existence of a huge potential of the coarse-grain parallelism in program...
Abstract. Dynamic compilation is becoming a dominant compilation technique. Runtime compilation has ...
Automatic parallelization techniques for finding loop-based parallelism fail to find efficient paral...
It is the goal of the Polaris project to develop a new parallelizing compiler that will overcome lim...
When the inter-iteration dependency pattern of the iterations of a loop cannot be determined statica...
Even fully parallel sharedmemory program sections may perform signicantly be low the ideal speedup o...
Modern computers will increasingly rely on parallelism to achieve high computation rates. Techniques...
Current parallelizing compilers do a reasonable job of extracting parallelism from programs with reg...
International audienceThis paper describes a tool using one or more executions of a sequential progr...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
[[abstract]]A run-time technique based on the inspector-executor scheme is proposed in this paper to...
Multiprocessor computers are rapidly becoming the norm. Parallel workstations are widely available t...
[[abstract]]The main function of parallelizing compilers is to analyze sequential programs, in parti...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...