In the data parallel programming style the user usually specifies the data parallelism explicitly so that the compiler can generate efficient code without enhanced analysis techniques. In some situations it is not possible to specify the parallelism explicitly or this might be not very convenient. This is especially true for loop nests with data dependences between the data of distributed dimensions. In the case of uniform loop nests there are scheduling, mapping and partitioning techniques available. Some different strategies have been considered and evaluated with existing High Performance Fortran compilation systems. This paper gives some experimental results about the performance and the benefits of the different techniques and optimiza...
This paper describes a compilation approach for a Fortran 90D/HPF compiler, a source-to-source paral...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Abstract. Multipartitioning is a skewed-cyclic block distribution that yields better parallel effici...
In the data parallel programming style the user usually specifies the data parallelism explicitly so...
In the data parallel programming style the user usually speci es the data parallelism explicitly so ...
Over the past few decades, scientific research has grown to rely increasingly on simulation and othe...
Automatic parallelization is one of the approaches aimed at a better and easier use of parallel comp...
Loops represent the core of most applications in that they contain the bulk of the computations. An ...
. Data-parallel languages, in particular HPF, provide a highlevel view of operators overs parallel d...
High Performance Fortran (HPF), as well as its predecessor FortranD, has attracted considerable atte...
This paper describes two new ideas by which an HPF compiler can deal with irregular computations eff...
In this paper, we present the first, preliminary results of HPF/Fortran-D language analysis based on...
High Performance Fortran (HPF), as well as its predecessor FortranD,has attracted considerable atten...
High performance Fortran (HPF), as well as its predecessor FortranD, has attracted considerable atte...
High Performance Fortran (HPF) is rapidly gaining acceptance as a language for parallel programming....
This paper describes a compilation approach for a Fortran 90D/HPF compiler, a source-to-source paral...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Abstract. Multipartitioning is a skewed-cyclic block distribution that yields better parallel effici...
In the data parallel programming style the user usually specifies the data parallelism explicitly so...
In the data parallel programming style the user usually speci es the data parallelism explicitly so ...
Over the past few decades, scientific research has grown to rely increasingly on simulation and othe...
Automatic parallelization is one of the approaches aimed at a better and easier use of parallel comp...
Loops represent the core of most applications in that they contain the bulk of the computations. An ...
. Data-parallel languages, in particular HPF, provide a highlevel view of operators overs parallel d...
High Performance Fortran (HPF), as well as its predecessor FortranD, has attracted considerable atte...
This paper describes two new ideas by which an HPF compiler can deal with irregular computations eff...
In this paper, we present the first, preliminary results of HPF/Fortran-D language analysis based on...
High Performance Fortran (HPF), as well as its predecessor FortranD,has attracted considerable atten...
High performance Fortran (HPF), as well as its predecessor FortranD, has attracted considerable atte...
High Performance Fortran (HPF) is rapidly gaining acceptance as a language for parallel programming....
This paper describes a compilation approach for a Fortran 90D/HPF compiler, a source-to-source paral...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Abstract. Multipartitioning is a skewed-cyclic block distribution that yields better parallel effici...