Our study proposes a novel MPI-only parallel programming model with improved performance for SMP clusters. By rescheduling tasks in a typical flat MPI solution, our model forces processors of an SMP node to work in different phases, thereby avoiding unneccessary communication and computation bottlenecks. This study achieves a significant performance improvement with a minimal programming effort. In comparison with a de-facto flat MPI solution, our algorithm can yield a 21 % performance improvement for a 16-node cluster of Xeon dual-processor SMPs while performing a distributed matrix multiplication
MPI / OpenMP mixed mode codes could potentially offer the most effective parallelisation strategy fo...
Hybrid MPI/OpenMP and pure MPI on clusters of multi-core SMP nodes involve several mismatch problems...
We investigate the use of the SMPSs programming model to leverage task parallelism in the execution ...
Nowadays, cluster of SMPs has been becoming the main architecture in the super computing world. Clus...
Clusters of SMPs are ubiquitous. They have been traditionally programmed by using MPI. But, the prod...
Most HPC systems are clusters of shared memory nodes. Parallel programming must combine the distribu...
Most HPC systems are clusters of shared memory nodes. Parallel programming must combine the distribu...
Many parallel applications from scientific computing use MPI collective communication operations to ...
International audienceSince the last decade, most of the supercomputer architectures are based on cl...
The symmetric multiprocessing (SMP) cluster system, which consists of shared memory nodes with sever...
Many parallel applications from scientific computing use MPI collective communication operations to ...
Our study proposes a Reducing-size Task Assignation technique (RTA), which is a novel approach to so...
We describe a methodology for developing high performance programs running on clusters of SMP no...
This paper analyzes the strength and weakness of several parallel programming models on clusters of ...
MPI / OpenMP mixed mode codes could potentially offer the most effective parallelisation strategy fo...
MPI / OpenMP mixed mode codes could potentially offer the most effective parallelisation strategy fo...
Hybrid MPI/OpenMP and pure MPI on clusters of multi-core SMP nodes involve several mismatch problems...
We investigate the use of the SMPSs programming model to leverage task parallelism in the execution ...
Nowadays, cluster of SMPs has been becoming the main architecture in the super computing world. Clus...
Clusters of SMPs are ubiquitous. They have been traditionally programmed by using MPI. But, the prod...
Most HPC systems are clusters of shared memory nodes. Parallel programming must combine the distribu...
Most HPC systems are clusters of shared memory nodes. Parallel programming must combine the distribu...
Many parallel applications from scientific computing use MPI collective communication operations to ...
International audienceSince the last decade, most of the supercomputer architectures are based on cl...
The symmetric multiprocessing (SMP) cluster system, which consists of shared memory nodes with sever...
Many parallel applications from scientific computing use MPI collective communication operations to ...
Our study proposes a Reducing-size Task Assignation technique (RTA), which is a novel approach to so...
We describe a methodology for developing high performance programs running on clusters of SMP no...
This paper analyzes the strength and weakness of several parallel programming models on clusters of ...
MPI / OpenMP mixed mode codes could potentially offer the most effective parallelisation strategy fo...
MPI / OpenMP mixed mode codes could potentially offer the most effective parallelisation strategy fo...
Hybrid MPI/OpenMP and pure MPI on clusters of multi-core SMP nodes involve several mismatch problems...
We investigate the use of the SMPSs programming model to leverage task parallelism in the execution ...