Abstract. Thread affinity has appeared as an important technique to improve the overall program performance and for better performance stability. However, if we consider a program with multiple phases, it is unlikely that a single thread affinity produces the best program performance for all these phases. If we consider the case of OpenMP, applications may have multiple parallel regions, each with a dis-tinct inter-thread data sharing pattern. In this paper, we propose an approach that allows to change thread affinity dynamically (thread migrations) between parallel regions at runtime to account for these distinct inter-thread data sharing patterns. We demonstrate that as far as cache sharing is concerned for SPEC OMP01, not all the tested ...
High-level parallel programming models (PMs) are becoming crucial in order to extract the computatio...
Abstract—Multi-threaded applications execute their threads on different cores with their own local c...
Modern high performance computing architectures are based on multi-core and multi-threaded computing...
F. Wolf, B. Mohr, and D. an Ney (Eds.), pages 12, pp. 53-64International audienceThread affinity has...
This paper presents a set of proposals for the OpenMP shared-- memory programming model oriented to...
International audienceWith the introduction of multi-core processors, thread affinity has quickly ap...
This paper presents a set of proposals for the OpenMP shared--memory programming model oriented tow...
At the level of multi-core processors that share the same cache, data sharing among threads which be...
International audienceEfficiently programming shared-memory machines is a difficult challenge becaus...
In this paper we explore the possibility of reusing schedules to improve the scalability of numerica...
Holistic tuning and optimization of hybrid MPI and OpenMP applications is becoming focus for paralle...
We have developed compiler optimization techniques for explicit parallel programs using the OpenMP A...
Current architectures of multicore machines are becoming increasingly complex due to hierarchical de...
International audienceIn [8], we demonstrated that contrary to sequential applications, parallel Ope...
International audienceThe parallelism in shared-memory systems has increased significantly with the ...
High-level parallel programming models (PMs) are becoming crucial in order to extract the computatio...
Abstract—Multi-threaded applications execute their threads on different cores with their own local c...
Modern high performance computing architectures are based on multi-core and multi-threaded computing...
F. Wolf, B. Mohr, and D. an Ney (Eds.), pages 12, pp. 53-64International audienceThread affinity has...
This paper presents a set of proposals for the OpenMP shared-- memory programming model oriented to...
International audienceWith the introduction of multi-core processors, thread affinity has quickly ap...
This paper presents a set of proposals for the OpenMP shared--memory programming model oriented tow...
At the level of multi-core processors that share the same cache, data sharing among threads which be...
International audienceEfficiently programming shared-memory machines is a difficult challenge becaus...
In this paper we explore the possibility of reusing schedules to improve the scalability of numerica...
Holistic tuning and optimization of hybrid MPI and OpenMP applications is becoming focus for paralle...
We have developed compiler optimization techniques for explicit parallel programs using the OpenMP A...
Current architectures of multicore machines are becoming increasingly complex due to hierarchical de...
International audienceIn [8], we demonstrated that contrary to sequential applications, parallel Ope...
International audienceThe parallelism in shared-memory systems has increased significantly with the ...
High-level parallel programming models (PMs) are becoming crucial in order to extract the computatio...
Abstract—Multi-threaded applications execute their threads on different cores with their own local c...
Modern high performance computing architectures are based on multi-core and multi-threaded computing...