We present an intermediate representation of a program called the Alignment-Distribution Graph that exposes the communication requirements of the program. The representation exploits ideas developed in the static single assignment form of programs, but is tailored for communication optimization. It serves as the basis for algorithms that map the array data and program computation to the nodes of a distributed-memory parallel computer so as to minimize completion time. We describe the details of the representation, explain its construction from source text, show its use in modeling communication cost, outline several algorithms for determining mappings that approximately minimize residual communication, and compare it with other relate...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Implementing a data-parallel language such as Formm 90 on a distn_outed-memory parallel computer req...
Implementing a data-parallel language such as Fortran 90 on a distributed-memory parallel computer r...
Implementing a data-parallel language such as Fortran 90 on a distributed-memory parallel computer r...
Axis and stride alignment is an important optimization in compiling data-parallel programs for distr...
We consider distribution at compile time of the array data in a distributed-memory implementation of...
The assignment of processes to processors (the mapping problem) is one of the major factors affectin...
The assignment of processes to processors (the mapping problem) is one of the major factors affectin...
This paper describes an automatic data distribution method which deal with both the alignment and th...
When a data-parallel language like Fortran 90 is compiled for a distributed-memory machine, aggregat...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
Aggregate data objects (such as arrays) are distributed across the processor memories when compiling...
When a data-parallel language like FORTRAN 90 is compiled for a distributed-memory machine, aggregat...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Implementing a data-parallel language such as Formm 90 on a distn_outed-memory parallel computer req...
Implementing a data-parallel language such as Fortran 90 on a distributed-memory parallel computer r...
Implementing a data-parallel language such as Fortran 90 on a distributed-memory parallel computer r...
Axis and stride alignment is an important optimization in compiling data-parallel programs for distr...
We consider distribution at compile time of the array data in a distributed-memory implementation of...
The assignment of processes to processors (the mapping problem) is one of the major factors affectin...
The assignment of processes to processors (the mapping problem) is one of the major factors affectin...
This paper describes an automatic data distribution method which deal with both the alignment and th...
When a data-parallel language like Fortran 90 is compiled for a distributed-memory machine, aggregat...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
Aggregate data objects (such as arrays) are distributed across the processor memories when compiling...
When a data-parallel language like FORTRAN 90 is compiled for a distributed-memory machine, aggregat...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Massively Parallel Processor systems provide the required computational power to solve most large sc...