Abstract—This paper studies the impact of using automatic data-layout techniques on the process of coding the well-known multigrid MG NAS parallel benchmark. We describe the se-quential problem in detail, and discuss the parallel version and its optimizations. Then, we implement the parallel algorithm using Hitmap, a highly-efficient modular library for hierarchical tiling and mapping of arrays. We describe how to use the library plug-in system to add a new data-layout module that encapsulates a generalization of the data-alignment policy of the MG benchmark. The module system applies this policy to automatically adapt the data distribution and communication code to any grain level. The impact of using these techniques is qualitatively and ...
This thesis presents a generalized framework for the mapping and remapping of large regularly-gridd...
Problem topology is the key to efficient parallelization support for partially regular applications....
The scalable implementation of multigrid methods for machines with several thousands of processors i...
Parallel architectures with physically distributed memory providing computing cycles and large amoun...
An approach to programming distributed memory-parallel machines that has recently become popular is ...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Layout methods for dense and sparse data are often seen as two separate problems with its own partic...
The goal of languages like Fortran D or High Performance Fortran (HPF) is to provide a simple yet ef...
The goal of languages like Fortran D or High Performance Fortran (HPF) is to provide a simple yet ef...
In this paper we present a decentralized remapping method for data parallel applications on distribu...
High Performance Fortran (HPF) is rapidly gaining acceptance as a language for parallel programming....
In this paper we present a method to obtain a set of candidate distributions for a program fragment....
Abstract Layout methods for dense and sparse data are often seen as two separate problems with their...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1992.Distributed-memory parallel c...
Increased programmability for concurrent applications in distributed systems requires automatic supp...
This thesis presents a generalized framework for the mapping and remapping of large regularly-gridd...
Problem topology is the key to efficient parallelization support for partially regular applications....
The scalable implementation of multigrid methods for machines with several thousands of processors i...
Parallel architectures with physically distributed memory providing computing cycles and large amoun...
An approach to programming distributed memory-parallel machines that has recently become popular is ...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Layout methods for dense and sparse data are often seen as two separate problems with its own partic...
The goal of languages like Fortran D or High Performance Fortran (HPF) is to provide a simple yet ef...
The goal of languages like Fortran D or High Performance Fortran (HPF) is to provide a simple yet ef...
In this paper we present a decentralized remapping method for data parallel applications on distribu...
High Performance Fortran (HPF) is rapidly gaining acceptance as a language for parallel programming....
In this paper we present a method to obtain a set of candidate distributions for a program fragment....
Abstract Layout methods for dense and sparse data are often seen as two separate problems with their...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1992.Distributed-memory parallel c...
Increased programmability for concurrent applications in distributed systems requires automatic supp...
This thesis presents a generalized framework for the mapping and remapping of large regularly-gridd...
Problem topology is the key to efficient parallelization support for partially regular applications....
The scalable implementation of multigrid methods for machines with several thousands of processors i...