This paper addresses the problems of communication -free partitions of statement-iterations of nested loops and data accessed by these statement-iterations. Communication-free hyperplane partitions of disjoint subsets of data and statement-iterations are considered. This approach is more possible than existing methods in finding the data and program distribution patterns that can cause the processor fully-parallel execution without any interprocessor communication. In addition, this approach can be applied to the more general loop models such as the imperfectly nested loop or multiple imperfectly nested loops, not like most of the existing methods that can be applied only to the perfectly nested loop or multiple perfectly nested loops. Due ...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
This paper addresses the problem of partitioning data for distributed memory machines or multicomput...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
[[abstract]]Minimizing interprocessor communication is the key to a parallelized program on executio...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
In this paper, we present an efficient framework for intraprocedural performance based program parti...
Automatic Global Data Partitioning for Distributed Memory Machines (DMMs) is a difficult problem. Di...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
In this paper, we develop an automatic compile-time computation and data decomposition technique for...
This papers presents an approach to statement-level independent partitioning of uniform recurrences,...
Computation partition is one of the most important problems in parallel compilation and optimization...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
This paper addresses the problem of partitioning data for distributed memory machines or multicomput...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
[[abstract]]Minimizing interprocessor communication is the key to a parallelized program on executio...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
In this paper, we present an efficient framework for intraprocedural performance based program parti...
Automatic Global Data Partitioning for Distributed Memory Machines (DMMs) is a difficult problem. Di...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
In this paper, we develop an automatic compile-time computation and data decomposition technique for...
This papers presents an approach to statement-level independent partitioning of uniform recurrences,...
Computation partition is one of the most important problems in parallel compilation and optimization...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
This paper addresses the problem of partitioning data for distributed memory machines or multicomput...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...