Data parallel compilers have long aimed to equal the performance of carefully hand-optimized parallel codes. For tightly-coupled applications based on line sweeps, this goal has been particularly elusive. In the Rice dHPF compiler, we have developed a wide spectrum of optimizations that enable us to closely approach hand-coded performance for tightly-coupled line sweep applications including the NAS SPand BT benchmark codes. From lightly-modified copies of standard serial versions of these benchmarks, dHPF generates MPI-based parallel code that is within 4 % of the performance of the hand-crafted MPI implementations of these codes for a 102 3 problem size (Class B) on 64 processors. We describe and quantitatively evaluate the impact of part...
High Performance Fortran (HPF) does not allow ecient expression of mixed task/data-parallel computat...
This paper describes methods to adapt existing optimizing compilers for sequential languages to prod...
Data-parallel languages, such as H scIGH P scERFORMANCE F scORTRAN or F scORTRAN D, provide a machin...
Multipartitioning is a skewed-cyclic block distribution that yields better parallel efficiency and s...
Abstract. Multipartitioning is a skewed-cyclic block distribution that yields better parallel effici...
Over the past few decades, scientific research has grown to rely increasingly on simulation and othe...
Communication coalescing is a static optimization that can reduce both communication frequency and r...
Reduction recognition and optimization are crucial techniques in parallelizing compilers. They are u...
In this paper, we propose and evaluate practical, automatic techniques that exploit compiler analysi...
In the data parallel programming style the user usually speci es the data parallelism explicitly so ...
In the data parallel programming style the user usually specifies the data parallelism explicitly so...
The data-parallel language High Performance Fortran (HPF) does not allow efficient expression of mix...
Effective memory hierarchy utilization is critical to the performance of modern multiprocessor archi...
(eng) Multipartitioning is a strategy for partitioning multi-dimensional arrays on a collection of p...
A portable program executes on different platforms and yields consistent performance. With the focus...
High Performance Fortran (HPF) does not allow ecient expression of mixed task/data-parallel computat...
This paper describes methods to adapt existing optimizing compilers for sequential languages to prod...
Data-parallel languages, such as H scIGH P scERFORMANCE F scORTRAN or F scORTRAN D, provide a machin...
Multipartitioning is a skewed-cyclic block distribution that yields better parallel efficiency and s...
Abstract. Multipartitioning is a skewed-cyclic block distribution that yields better parallel effici...
Over the past few decades, scientific research has grown to rely increasingly on simulation and othe...
Communication coalescing is a static optimization that can reduce both communication frequency and r...
Reduction recognition and optimization are crucial techniques in parallelizing compilers. They are u...
In this paper, we propose and evaluate practical, automatic techniques that exploit compiler analysi...
In the data parallel programming style the user usually speci es the data parallelism explicitly so ...
In the data parallel programming style the user usually specifies the data parallelism explicitly so...
The data-parallel language High Performance Fortran (HPF) does not allow efficient expression of mix...
Effective memory hierarchy utilization is critical to the performance of modern multiprocessor archi...
(eng) Multipartitioning is a strategy for partitioning multi-dimensional arrays on a collection of p...
A portable program executes on different platforms and yields consistent performance. With the focus...
High Performance Fortran (HPF) does not allow ecient expression of mixed task/data-parallel computat...
This paper describes methods to adapt existing optimizing compilers for sequential languages to prod...
Data-parallel languages, such as H scIGH P scERFORMANCE F scORTRAN or F scORTRAN D, provide a machin...