To compile programs for message passing architectures and to obtain good performance on NUMA architectures it is necessary to control how computations and data are mapped to processors. Languages such as High-Performance Fortran use data distributions supplied by the programmer and the owner computes rule to specify this. However, the best data and computation decomposition may differ from machine to machine and require substantial expertise to determine. Therefore, automated decomposition is desirable. All existing methods for automated data/computation decomposition share a common failing: they are very sensitive to the original loop structure of the program. While they find a good decomposition for that loop structure, it may...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
An approach to distributed memory parallel programming that has recently become popular is one wher...
Loops are the main source of parallelism in scientific programs. Hence, several techniques were dev...
Data distribution functions are introduced. They are matced with scheduling functions. The processor...
The general problem studied is that of segmenting or partitioning programs for distribution across a...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
Scalable shared-memory multiprocessor systems are typically NUMA (nonuniform memory access) machines...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
We describe an approach to parallel compilation that seeks to harness the vast amount of fine-grain ...
technical reportThe inherently asynchronous nature of the data flow computation model allows the exp...
Data parallel languages like High Performance Fortran (HPF) are emerging as the architecture indepen...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
In this paper, we develop an automatic compile-time computation and data decomposition technique for...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
Researchers explore an approach to using general purpose parallel computers which involves mapping h...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
An approach to distributed memory parallel programming that has recently become popular is one wher...
Loops are the main source of parallelism in scientific programs. Hence, several techniques were dev...
Data distribution functions are introduced. They are matced with scheduling functions. The processor...
The general problem studied is that of segmenting or partitioning programs for distribution across a...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
Scalable shared-memory multiprocessor systems are typically NUMA (nonuniform memory access) machines...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
We describe an approach to parallel compilation that seeks to harness the vast amount of fine-grain ...
technical reportThe inherently asynchronous nature of the data flow computation model allows the exp...
Data parallel languages like High Performance Fortran (HPF) are emerging as the architecture indepen...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
In this paper, we develop an automatic compile-time computation and data decomposition technique for...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
Researchers explore an approach to using general purpose parallel computers which involves mapping h...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
An approach to distributed memory parallel programming that has recently become popular is one wher...
Loops are the main source of parallelism in scientific programs. Hence, several techniques were dev...