An important problem facing numerous research projects on parallelizing compilers for distributed memory machines is that of automatically determining a suitable data partitioning scheme for a program. Any strategy for automatic data partitioning needs a mechanism for estimating the performance of a program under a given partitioning scheme, the most crucial part of which involves determining the communication costs incurred by the program. A methodology is described for estimating the communication costs at compile-time as functions of the numbers of processors over which various arrays are distributed. A strategy is described along with its theoretical basis, for making program transformations that expose opportunities for combining of me...
A variety of historically-proven computer languages have recently been extended to support parallel ...
Data distribution functions are introduced. They are matced with scheduling functions. The processor...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
Coordinated Science Laboratory was formerly known as Control Systems LaboratoryOffice of Naval Resea...
Most of the current compiler projects for distributed memory architectures leave the critical and ti...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
An approach to distributed memory parallel programming that has recently become popular is one wher...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
Distributed-memory parallel computers are increasingly being used to provide high levels of performa...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1992.Distributed-memory parallel c...
A variety of historically-proven computer languages have recently been extended to support parallel ...
Data distribution functions are introduced. They are matced with scheduling functions. The processor...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
Coordinated Science Laboratory was formerly known as Control Systems LaboratoryOffice of Naval Resea...
Most of the current compiler projects for distributed memory architectures leave the critical and ti...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
An approach to distributed memory parallel programming that has recently become popular is one wher...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
Distributed-memory parallel computers are increasingly being used to provide high levels of performa...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1992.Distributed-memory parallel c...
A variety of historically-proven computer languages have recently been extended to support parallel ...
Data distribution functions are introduced. They are matced with scheduling functions. The processor...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...