An algorithm for mapping an arbitrary, multidimensional array onto an arbitrarily shaped multidimensional nearest neighbor network of a distributed memory machine is presented. The individual dimensions of the array are labeled with high-level usage descriptors that can either be provided by the programmer or can be derived by sophisticated static compiler analysis. The presented algorithm achieves an appropriate exploitation of nearest neighbor communication and allows for efficient address calculations. We describe the integration of this technique into an optimizing compiler for Modula-2* and derive extensions that render efficient translation of nested parallelism possible and provide some support for thread scheduling. 1 Introduction ...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
Appropriate data distribution has been found to be critical for obtaining good performance on Distri...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
Data distribution is one of the key aspects that a parallelizing compiler for a distributed memory a...
This paper presents a technique for finding good distributions of arrays and suitable loop restructu...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
Data-parallel languages, such as High Performance Fortran, are designed to make programming of distr...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
In this paper, we develop an automatic compile-time computation and data decomposition technique for...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
An approach to programming distributed memory-parallel machines that has recently become popular is ...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
This paper addresses the problem of partitioning data for distributed memory machines or multicomput...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
Appropriate data distribution has been found to be critical for obtaining good performance on Distri...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
Data distribution is one of the key aspects that a parallelizing compiler for a distributed memory a...
This paper presents a technique for finding good distributions of arrays and suitable loop restructu...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
Data-parallel languages, such as High Performance Fortran, are designed to make programming of distr...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
In this paper, we develop an automatic compile-time computation and data decomposition technique for...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
An approach to programming distributed memory-parallel machines that has recently become popular is ...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
This paper addresses the problem of partitioning data for distributed memory machines or multicomput...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
Appropriate data distribution has been found to be critical for obtaining good performance on Distri...
We present algorithms for the transportation of data in parallel and distributed systems that would ...