International audienceWhile task-based programming, such as OpenMP, is a promising solution to exploit large HPC compute nodes, it has to be mixed with data communications like MPI. However, performance or even more thread progression may depend on the underlying runtime implementations. In this paper, we focus on enhancing the application performance when an OpenMP task blocks inside MPI communications. This technique requires no additional effort on the application developers. It relies on an online task reordering strategy that aims at running first tasks that are sending data to other processes. We evaluate our approach on a Cholesky factorization and show that we gain around 19% of execution time on an Intel Skylake compute nodes machi...
Asynchronous task-based programming models are gaining popularity to address the programmability and...
Tasking promises a model to program parallel applications that provides intuitive semantics. In the ...
International audienceThe architecture of supercomputers is evolving to expose massive parallelism. ...
International audienceWhile task-based programming, such as OpenMP, is a promising solution to explo...
Editors: Michael Klemm; Bronis R. de Supinski et al.International audienceHeterogeneous supercompute...
After a brief introduction on Cross Motif Search and its OpenMP and Hybrid OpenMP-MPI implementatio...
Holistic tuning and optimization of hybrid MPI and OpenMP applications is becoming focus for paralle...
25th International Conference on Parallel and Distributed Computing, Göttingen, Germany, August 26-3...
Most high-performance, scientific libraries have adopted hybrid parallelization schemes - such as t...
Loop-based parallelism is a common in scientific codes. OpenMP proposes such work-sharing construct ...
International audienceWhen aiming for large scale parallel computing, waiting time due to network la...
Asynchronous task-based programming models are gaining popularity to address the programmability and...
Tasking promises a model to program parallel applications that provides intuitive semantics. In the ...
International audienceThe architecture of supercomputers is evolving to expose massive parallelism. ...
International audienceWhile task-based programming, such as OpenMP, is a promising solution to explo...
Editors: Michael Klemm; Bronis R. de Supinski et al.International audienceHeterogeneous supercompute...
After a brief introduction on Cross Motif Search and its OpenMP and Hybrid OpenMP-MPI implementatio...
Holistic tuning and optimization of hybrid MPI and OpenMP applications is becoming focus for paralle...
25th International Conference on Parallel and Distributed Computing, Göttingen, Germany, August 26-3...
Most high-performance, scientific libraries have adopted hybrid parallelization schemes - such as t...
Loop-based parallelism is a common in scientific codes. OpenMP proposes such work-sharing construct ...
International audienceWhen aiming for large scale parallel computing, waiting time due to network la...
Asynchronous task-based programming models are gaining popularity to address the programmability and...
Tasking promises a model to program parallel applications that provides intuitive semantics. In the ...
International audienceThe architecture of supercomputers is evolving to expose massive parallelism. ...