The road towards Exascale Computing requires a holistic effort to address three different challenges simultaneously: high performance, energy efficiency, and programmability. The use of runtime task schedulers to orchestrate parallel executions with minimal developer intervention has been introduced in recent years to tackle the programmability issue while maintaining, or even improving, performance. In this paper, we enhance the SuperMatrix runtime task scheduler integrated in the libflame library in two different directions that address high performance and energy efficiency. First, we extend the runtime by accom- modating hybrid parallel executions and managing task priorities for dense linear algebra operations, with remarkable performa...
peer reviewedWith the fast development of supercomputers, energy consumption by large scale compute...
Parallel applications often rely on work stealing schedulers in combination with fine-grained taskin...
International audienceAlthough the hardware has dramatically changed in the last few years, nodes of...
The road towards Exascale Computing requires a holistic effort to address three different challenges...
[EN] This paper analyzes the impact on power con- sumption of two DVFS-control strategies when appli...
This paper addresses the efficient exploitation of task-level parallelism, present in many dense lin...
This paper addresses the efficient explotation of task-level parallelism, present in many dense lin...
The emergence of new manycore architectures, such as the Intel Xeon Phi, poses new challenges in how...
To help shrink the programmability-performance efficiency gap, we discuss that adaptive runtime syst...
International audienceIn this paper, we analyse performance and energy consumption of five OpenMP ru...
The field of High Performance Computing (HPC) is characterized by the continuous evolution of comput...
Due to massive computation power of accelerators such as GPU, Xeon phi, multicore machines equipped ...
Version longue publiée dans Concurrency and Computation: Practice and Experience.International audie...
The field of High Performance Computing (HPC) is characterized by the contin-uous evolution of compu...
On the road to exascale computing, the gap between hardware peak performance and application perform...
peer reviewedWith the fast development of supercomputers, energy consumption by large scale compute...
Parallel applications often rely on work stealing schedulers in combination with fine-grained taskin...
International audienceAlthough the hardware has dramatically changed in the last few years, nodes of...
The road towards Exascale Computing requires a holistic effort to address three different challenges...
[EN] This paper analyzes the impact on power con- sumption of two DVFS-control strategies when appli...
This paper addresses the efficient exploitation of task-level parallelism, present in many dense lin...
This paper addresses the efficient explotation of task-level parallelism, present in many dense lin...
The emergence of new manycore architectures, such as the Intel Xeon Phi, poses new challenges in how...
To help shrink the programmability-performance efficiency gap, we discuss that adaptive runtime syst...
International audienceIn this paper, we analyse performance and energy consumption of five OpenMP ru...
The field of High Performance Computing (HPC) is characterized by the continuous evolution of comput...
Due to massive computation power of accelerators such as GPU, Xeon phi, multicore machines equipped ...
Version longue publiée dans Concurrency and Computation: Practice and Experience.International audie...
The field of High Performance Computing (HPC) is characterized by the contin-uous evolution of compu...
On the road to exascale computing, the gap between hardware peak performance and application perform...
peer reviewedWith the fast development of supercomputers, energy consumption by large scale compute...
Parallel applications often rely on work stealing schedulers in combination with fine-grained taskin...
International audienceAlthough the hardware has dramatically changed in the last few years, nodes of...