Porting scientific applications to parallel machines is one of the major challenges for scientists and computing professionals over the coming years. Existing codes span many thousands of lines and are often converted for parallel execution by someone other than the original author. Good tools are required to assist this process as much as possible. Two key design parameters that must be addressed on distributed memory computers are the way that parallel data structures are distributed across the machine and the way that processes are allocated to processors. This paper describes a tool called DCompose, which allows a user to rapidly determine the effect of different data decomposition and process allocation strategies. Using Decompose, imp...
An approach to programming distributed memory-parallel machines that has recently become popular is ...
Parallel computing can take many forms. From a user's perspective, it is important to consider the a...
The parallelization of real-world compute intensive Fortran application codes is generally not a tri...
Porting scientific applications to parallel machines is one of the major challenges for scientists a...
Distributed-memory multiprocessing systems (DMS), such as Intel’s hypercubes, the Paragon, Thinking ...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
Shared-memory multiprocessor systems can achieve high performance levels when appropriate work paral...
The performance of a computer system is important. One way of improving performance is to use multip...
Distributed Memory Multicomputers (DMMs) such as the IBM SP-2, the Intel Paragon and the Thinking Ma...
Despite the performance potential of parallel systems, several factors have hindered their widesprea...
This dissertation addresses creating portable and efficient parallel programs for scientific computi...
Introduction In general, a parallel computer is a computer that has multiple processors connected b...
Introduction In general, a parallel computer is a computer that has multiple processors connected b...
Reduction recognition and optimization are crucial techniques in parallelizing compilers. They are u...
One method to evaluate a distributed shared memory(DSM) system is to analyze its performance for a v...
An approach to programming distributed memory-parallel machines that has recently become popular is ...
Parallel computing can take many forms. From a user's perspective, it is important to consider the a...
The parallelization of real-world compute intensive Fortran application codes is generally not a tri...
Porting scientific applications to parallel machines is one of the major challenges for scientists a...
Distributed-memory multiprocessing systems (DMS), such as Intel’s hypercubes, the Paragon, Thinking ...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
Shared-memory multiprocessor systems can achieve high performance levels when appropriate work paral...
The performance of a computer system is important. One way of improving performance is to use multip...
Distributed Memory Multicomputers (DMMs) such as the IBM SP-2, the Intel Paragon and the Thinking Ma...
Despite the performance potential of parallel systems, several factors have hindered their widesprea...
This dissertation addresses creating portable and efficient parallel programs for scientific computi...
Introduction In general, a parallel computer is a computer that has multiple processors connected b...
Introduction In general, a parallel computer is a computer that has multiple processors connected b...
Reduction recognition and optimization are crucial techniques in parallelizing compilers. They are u...
One method to evaluate a distributed shared memory(DSM) system is to analyze its performance for a v...
An approach to programming distributed memory-parallel machines that has recently become popular is ...
Parallel computing can take many forms. From a user's perspective, it is important to consider the a...
The parallelization of real-world compute intensive Fortran application codes is generally not a tri...