This paper presents the evolution of the free agent threads for OpenMP to the new role-shifting threads model and their integration with the Dynamic Load Balancing (DLB) library. We demonstrate how free agent threads can improve resource utilization in OpenMP applications with load imbalance in their nested parallel regions. We also demonstrate how DLB efficiently manages the malleability exposed by the role-shifting threads to address load imbalance issues. We use three real-world scientific applications, one of them to demonstrate that free agents alone can improve the OpenMP model without external tools, and two other MPI+OpenMP applications, one of them with a coupling case, to illustrate the potential of the free agent threads’ m...
OpenMP is the de-facto standard for shared memory systems in High-Performance Computing (HPC). It in...
OpenMP has been very successful in exploiting structured parallelism in applications. With increasin...
OpenMP, as the de-facto standard programming model in symmetric multiprocessing for HPC, has seen it...
In this paper, we introduce a design and implementation of the free agent threads for OpenMP. These ...
Efficiency is a must in the HPC world. Supercomputers are extensively used in public research instit...
The hybrid programming model MPI+OpenMP are useful to solve the problems of load balancing of parall...
In the design of future HPC systems, research in resource management is showing an increasing intere...
Tasking promises a model to program parallel applications that provides intuitive semantics. In the ...
Loop-based parallelism is a common in scientific codes. OpenMP proposes such work-sharing construct ...
OpenMP is a very convenient programming model to parallelize critical real-time applications for sev...
The most widely used node type in high-performance computing nowadays is a 2-socket server node. The...
High-level parallel programming models (PMs) are becoming crucial in order to extract the computatio...
Holistic tuning and optimization of hybrid MPI and OpenMP applications is becoming focus for paralle...
Load imbalance is a long-standing source of inefficiency in high performance computing. The situati...
OpenMP provides a portable programming interface for shared memory parallel computers (SMPs). Althou...
OpenMP is the de-facto standard for shared memory systems in High-Performance Computing (HPC). It in...
OpenMP has been very successful in exploiting structured parallelism in applications. With increasin...
OpenMP, as the de-facto standard programming model in symmetric multiprocessing for HPC, has seen it...
In this paper, we introduce a design and implementation of the free agent threads for OpenMP. These ...
Efficiency is a must in the HPC world. Supercomputers are extensively used in public research instit...
The hybrid programming model MPI+OpenMP are useful to solve the problems of load balancing of parall...
In the design of future HPC systems, research in resource management is showing an increasing intere...
Tasking promises a model to program parallel applications that provides intuitive semantics. In the ...
Loop-based parallelism is a common in scientific codes. OpenMP proposes such work-sharing construct ...
OpenMP is a very convenient programming model to parallelize critical real-time applications for sev...
The most widely used node type in high-performance computing nowadays is a 2-socket server node. The...
High-level parallel programming models (PMs) are becoming crucial in order to extract the computatio...
Holistic tuning and optimization of hybrid MPI and OpenMP applications is becoming focus for paralle...
Load imbalance is a long-standing source of inefficiency in high performance computing. The situati...
OpenMP provides a portable programming interface for shared memory parallel computers (SMPs). Althou...
OpenMP is the de-facto standard for shared memory systems in High-Performance Computing (HPC). It in...
OpenMP has been very successful in exploiting structured parallelism in applications. With increasin...
OpenMP, as the de-facto standard programming model in symmetric multiprocessing for HPC, has seen it...