An important part of a parallelizing compiler is the restructuring phase, which extracts parallelism from a sequential program. We consider an important restructuring transformation, called cycle shrinking [1], which partitions the iteration space of a loop so that the iterations within each partition can be executed in parallel. We propose a new cycle shrinking transformation, called extended cycle shrinking, which is an improvement over the existing methods. We present the conditions under which our method can be applied, and give an algorithm which performs this transformation. Further, we present results to show that our method always leads to a minimal number of partitions, whereas the earlier methods do not. Thus our algorithm is, in ...
A new program restructuring algorithm aimed at reducing the working set size of a program executing ...
(eng) In this paper, we survey loop parallelization algorithms, analyzing the dependence representat...
Over the past 20 years, increases in processor speed have dramatically outstripped performance incre...
An important part of a parallelizing compiler is the restructuring phase, which extracts parallelism...
In this tutorial, we address the problem of restructuring a (possibly sequential) program to improve...
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
We present a transformational system for extracting parallelism from programs. Our transformations g...
Developing efficient programs for many of the current parallel computers is not easy due to the arch...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
Reduction recognition and optimization are crucial techniques in parallelizing compilers. They are u...
It is well-known that, to optimize a program for speed-up, efforts should be focused on the regions ...
Modern computers will increasingly rely on parallelism to achieve high computation rates. Techniques...
In a sequential program, data are often structured in a way that is optimized for a sequential execu...
The paper extends the framework of linear loop transformations adding a new nonlinear step at the tr...
Reduction in strength is a traditional transformation for speeding up loop execution on sequential p...
A new program restructuring algorithm aimed at reducing the working set size of a program executing ...
(eng) In this paper, we survey loop parallelization algorithms, analyzing the dependence representat...
Over the past 20 years, increases in processor speed have dramatically outstripped performance incre...
An important part of a parallelizing compiler is the restructuring phase, which extracts parallelism...
In this tutorial, we address the problem of restructuring a (possibly sequential) program to improve...
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
We present a transformational system for extracting parallelism from programs. Our transformations g...
Developing efficient programs for many of the current parallel computers is not easy due to the arch...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
Reduction recognition and optimization are crucial techniques in parallelizing compilers. They are u...
It is well-known that, to optimize a program for speed-up, efforts should be focused on the regions ...
Modern computers will increasingly rely on parallelism to achieve high computation rates. Techniques...
In a sequential program, data are often structured in a way that is optimized for a sequential execu...
The paper extends the framework of linear loop transformations adding a new nonlinear step at the tr...
Reduction in strength is a traditional transformation for speeding up loop execution on sequential p...
A new program restructuring algorithm aimed at reducing the working set size of a program executing ...
(eng) In this paper, we survey loop parallelization algorithms, analyzing the dependence representat...
Over the past 20 years, increases in processor speed have dramatically outstripped performance incre...