The purpose of this paper is to evaluate how to use nodes in a cluster efficiently by studying the NAS Parallel Benchmarks (NPB) on Intel Xeon and AMD Opteron dual CPU Linux clusters. The performance results of NPB are presented both with one MPI process per node (1 ppn) and with two MPI processes per node (2 ppn). One would like to run all applications on a cluster with two processors per node using 2 ppn instead of 1 ppn in order to utilize the second processor on each node. However, the performance results from running the NPB and from the memory bandwidth benchmarks show that better performance can sometimes be achieved using 1 ppn. Our performance results show that the Opteron/Myrinet cluster is able to achieve significantly better uti...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
This research aims to study the relationship between parallel processing efficiency and several node...
Recently, energy has become an important issue in high-performance computing. For example, supercomp...
This report presents the results from various barrier implementations on the IHPCL clusters - beetl...
Piles of personal computers (PoPCs) have begun to challenge the performance of the traditional Massi...
Traditionally, a cluster is defined as a collection of homogeneous nodes interconnected by a single ...
This paper reports the measurements of MPI communication benchmarking on Khaldun cluster which ran o...
Cluster computer systems assembled from commodity off-the-shelf components have emerged as a viable ...
We present performance results for version 2.1 of the NAS Parallel Benchmarks (NPB) on the following...
We introduce a methodology for the study of the application-level performance of time-sharing parall...
Currently, most supercomputers are multicore clusters. This type of architectures is said to be hybr...
This paper discusses the benchmarking of three parallelized implementations of the popular LS-Dyna® ...
A multi-core cluster is a cluster composed of numbers of nodes where each node has a number of proce...
We describe a methodology for developing high performance programs running on clusters of SMP no...
The main topic of this thesis is the implementation and subsequent optimization of high performance ...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
This research aims to study the relationship between parallel processing efficiency and several node...
Recently, energy has become an important issue in high-performance computing. For example, supercomp...
This report presents the results from various barrier implementations on the IHPCL clusters - beetl...
Piles of personal computers (PoPCs) have begun to challenge the performance of the traditional Massi...
Traditionally, a cluster is defined as a collection of homogeneous nodes interconnected by a single ...
This paper reports the measurements of MPI communication benchmarking on Khaldun cluster which ran o...
Cluster computer systems assembled from commodity off-the-shelf components have emerged as a viable ...
We present performance results for version 2.1 of the NAS Parallel Benchmarks (NPB) on the following...
We introduce a methodology for the study of the application-level performance of time-sharing parall...
Currently, most supercomputers are multicore clusters. This type of architectures is said to be hybr...
This paper discusses the benchmarking of three parallelized implementations of the popular LS-Dyna® ...
A multi-core cluster is a cluster composed of numbers of nodes where each node has a number of proce...
We describe a methodology for developing high performance programs running on clusters of SMP no...
The main topic of this thesis is the implementation and subsequent optimization of high performance ...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
This research aims to study the relationship between parallel processing efficiency and several node...
Recently, energy has become an important issue in high-performance computing. For example, supercomp...