T3E-900, the Cray Origin 2000 and the IBM P2SC on a collection of 13 communication tests. These tests call MPI routines using 2 to 64 processors with messages varying from 8 bytes to 10 MBytes. The relative performance of these machines varied depending on the communication test, but overall the T3E-900 was often 2 to 4 times faster than the Origin 2000 and P2SC. The Origin 2000 and P2SC performed about the same for most of the tests
The primary purpose of this technical report was to evaluate the performance of the MPI-2 one-sided...
This paper presents an automatic counter instrumentation and pro ling module added to the MPI librar...
This report compares the performance of different computer systems for basic message-passing. Latenc...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
In this paper the parallel benchmark code PSTSWM is used to evaluate the performance of the vendor-s...
There are several benchmark programs available to measure the performance of MPI on parallel comput...
Report 01/93 [Pre93] describes the ndings of a series of communication measurements performed on a M...
This paper presents scalability and communication performance results for a cluster of PCs running ...
AbstractThe HPC Challenge (HPCC) Benchmark suite and the Intel MPI Benchmark (IMB) are used to compa...
IBM SP--2 has become a popular MPP for scientific community. Its programming environment includes se...
This paper reports the measurements of MPI communication benchmarking on Khaldun cluster which ran o...
We evaluate the architectural support of collective communication operations on the IBM SP2, Cray T3...
This report compares the performance of different computer systems message passing. Latency and band...
In this paper we describe the difficulties inherent in making accurate, reproducible measurements of...
The primary purpose of this technical report was to evaluate the performance of the MPI-2 one-sided...
This paper presents an automatic counter instrumentation and pro ling module added to the MPI librar...
This report compares the performance of different computer systems for basic message-passing. Latenc...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
We have implemented eight of the MPI collective routines using MPI point-to-point communication rou...
In this paper the parallel benchmark code PSTSWM is used to evaluate the performance of the vendor-s...
There are several benchmark programs available to measure the performance of MPI on parallel comput...
Report 01/93 [Pre93] describes the ndings of a series of communication measurements performed on a M...
This paper presents scalability and communication performance results for a cluster of PCs running ...
AbstractThe HPC Challenge (HPCC) Benchmark suite and the Intel MPI Benchmark (IMB) are used to compa...
IBM SP--2 has become a popular MPP for scientific community. Its programming environment includes se...
This paper reports the measurements of MPI communication benchmarking on Khaldun cluster which ran o...
We evaluate the architectural support of collective communication operations on the IBM SP2, Cray T3...
This report compares the performance of different computer systems message passing. Latency and band...
In this paper we describe the difficulties inherent in making accurate, reproducible measurements of...
The primary purpose of this technical report was to evaluate the performance of the MPI-2 one-sided...
This paper presents an automatic counter instrumentation and pro ling module added to the MPI librar...
This report compares the performance of different computer systems for basic message-passing. Latenc...