We compare the performance of three major programming models— a load-store cache-coherent shared address space (CC-SAS), message passing (MP) and the segmented SHMEM model—on a modern, 64-processor hardware cache-coherent machine, one of the two major types of platforms upon which high-performance computing is converging. We focus on applications that are either regular and predictable or at least do not require fine-grained dynamic replication of irregularly accessed data. Within this class, we use programs with a range of important communication patterns. We examine whether the basic parallel algorithm and communication structuring approaches needed for best performance are similar or different among the models, whether some models have s...
The objective of this work is to compare the performance of three common environments for supporting...
This report compares the performance of different computer systems for basic message-passing. Latenc...
Shared memory is the most popular parallel programming model for multi-core processors, while messag...
Thesis (Ph. D.)--University of Washington, 1997Two recent trends are affecting the design of medium-...
This paper presents the comparison of the COMOPS benchmark performance in MPI and shared memory on t...
There are three major classes of MIMD multiprocessors: cache-coherent machines, NUMA (non-uniform me...
The majority of current HPC applications are composed of complex and irregular data structures that ...
There are two competing models for the on-chip memory in Chip Multiprocessor (CMP) systems: hardware...
There are several benchmark programs available to measure the performance of MPI on parallel comput...
The MPMD approach for parallel computing is attractive for programmers who seek fast development cy...
The range of high-end servers designed and manufactured by Bull includes cache-coherent distributed ...
International audienceShared memory MPI communication is an important part of the overall performanc...
Shared-memory multiprocessors built from commodity microprocessors are being increasingly used to pr...
Two interesting variations of large-scale shared-memory ma-chines that have recently emerged are cac...
All methods of multi-processing need some form of processor to processor communication. In shared me...
The objective of this work is to compare the performance of three common environments for supporting...
This report compares the performance of different computer systems for basic message-passing. Latenc...
Shared memory is the most popular parallel programming model for multi-core processors, while messag...
Thesis (Ph. D.)--University of Washington, 1997Two recent trends are affecting the design of medium-...
This paper presents the comparison of the COMOPS benchmark performance in MPI and shared memory on t...
There are three major classes of MIMD multiprocessors: cache-coherent machines, NUMA (non-uniform me...
The majority of current HPC applications are composed of complex and irregular data structures that ...
There are two competing models for the on-chip memory in Chip Multiprocessor (CMP) systems: hardware...
There are several benchmark programs available to measure the performance of MPI on parallel comput...
The MPMD approach for parallel computing is attractive for programmers who seek fast development cy...
The range of high-end servers designed and manufactured by Bull includes cache-coherent distributed ...
International audienceShared memory MPI communication is an important part of the overall performanc...
Shared-memory multiprocessors built from commodity microprocessors are being increasingly used to pr...
Two interesting variations of large-scale shared-memory ma-chines that have recently emerged are cac...
All methods of multi-processing need some form of processor to processor communication. In shared me...
The objective of this work is to compare the performance of three common environments for supporting...
This report compares the performance of different computer systems for basic message-passing. Latenc...
Shared memory is the most popular parallel programming model for multi-core processors, while messag...