The hybrid programming model MPI+OpenMP are useful to solve the problems of load balancing of parallel applications independently of the architecture. Typical approaches to balance parallel applications using two levels of parallelism or only MPI consist of including complex codes that dynamically detect which data domains are more computational intensive and either manually redistribute the allocated processors or manually redistribute data. This approach has two drawbacks: it is time consuming and it requires an expert in application analysis. In this paper we present an automatic and dynamic approach for load balancing MPI+OpenMP applications. The system calculates the percentage of load imbalance and decides a processor distribution for...
The work in this paper focuses on providing malleability to MPI applications by using a novel perfor...
bulk synchronous parallel (BSP) communication model can hinder performance increases. This is due to...
OpenMP is a very convenient programming model to parallelize critical real-time applications for sev...
The modern computer-systems designed according to multiprocessor configurations. Multiple processors...
Load imbalance is a long-standing source of inefficiency in high performance computing. The situati...
Several performance analysis tools support hybrid applications. Most originated as MPI profiling or ...
This paper introduces FLEX-MPI, a novel runtime approach for the dynamic load balancing of MPI-based...
To provide increasing computational power for numerical simulations, supercomputers evolved and aren...
Holistic tuning and optimization of hybrid MPI and OpenMP applications is becoming focus for paralle...
After a brief introduction on Cross Motif Search and its OpenMP and Hybrid OpenMP-MPI implementatio...
Modern computing servers usually consist in clusters of computers with several multi-core CPUs featu...
The new generation of parallel applications are complex, involve simulation of dynamically varying s...
This paper presents the evolution of the free agent threads for OpenMP to the new role-shifting thre...
Clusters of symmetric multiprocessors (SMPs) are the most currently used architecture for large scal...
The main computing phases of numerical methods for solving partial differential equations are the al...
The work in this paper focuses on providing malleability to MPI applications by using a novel perfor...
bulk synchronous parallel (BSP) communication model can hinder performance increases. This is due to...
OpenMP is a very convenient programming model to parallelize critical real-time applications for sev...
The modern computer-systems designed according to multiprocessor configurations. Multiple processors...
Load imbalance is a long-standing source of inefficiency in high performance computing. The situati...
Several performance analysis tools support hybrid applications. Most originated as MPI profiling or ...
This paper introduces FLEX-MPI, a novel runtime approach for the dynamic load balancing of MPI-based...
To provide increasing computational power for numerical simulations, supercomputers evolved and aren...
Holistic tuning and optimization of hybrid MPI and OpenMP applications is becoming focus for paralle...
After a brief introduction on Cross Motif Search and its OpenMP and Hybrid OpenMP-MPI implementatio...
Modern computing servers usually consist in clusters of computers with several multi-core CPUs featu...
The new generation of parallel applications are complex, involve simulation of dynamically varying s...
This paper presents the evolution of the free agent threads for OpenMP to the new role-shifting thre...
Clusters of symmetric multiprocessors (SMPs) are the most currently used architecture for large scal...
The main computing phases of numerical methods for solving partial differential equations are the al...
The work in this paper focuses on providing malleability to MPI applications by using a novel perfor...
bulk synchronous parallel (BSP) communication model can hinder performance increases. This is due to...
OpenMP is a very convenient programming model to parallelize critical real-time applications for sev...