Abstract. Efficient implementations of parallel applications on hetero-geneous hybrid architectures require a careful balance between compu-tations and communications with accelerator devices. Even if most of the communication time can be overlapped by computations, it is es-sential to reduce the total volume of communicated data. The liter-ature therefore abounds with ad hoc methods to reach that balance, but that are architecture and application dependent. We propose here a generic mechanism to automatically optimize the scheduling between CPUs and GPUs, and compare two strategies within this mechanism: the classical Heterogeneous Earliest Finish Time (HEFT) algorithm and our new, parametrized, Distributed Affinity Dual Approximation algo...
In High Performance Computing, heterogeneity is now the norm with specialized accelerators like GPUs...
The aim of data and task parallel scheduling for dense linear algebra kernels is to minimize the pro...
Best PaperInternational audienceMore and more computers use hybrid architectures combin-ing multi-co...
International audienceEfficient implementations of parallel applications on hetero-geneous hybrid ar...
International audienceIn this paper, we present a comparison of scheduling strategies for heterogene...
With the emergence of General Purpose computation on GPU (GPGPU) and corresponding programming fram...
International audienceMore and more computers use hybrid architectures combining multi-core processo...
International audienceMost recent HPC platforms have heterogeneous nodes composed of multi-core CPUs...
Modern high-performance computers engage a variety of computing devices. Underutilization and oversu...
In this paper, we consider task-based dense linear algebra applications on a single heterogeneous no...
Today's heterogeneous architectures bring together multiple general purpose CPUs, domain specific GP...
To help shrink the programmability-performance efficiency gap, we discuss that adaptive runtime syst...
International audienceThe race for Exascale computing has naturally led the current technologies to ...
In this paper, we describe a runtime to automatically enhance the performance of applications runnin...
Heterogeneous parallel architectures like those comprised of CPUs and GPUs are a tantalizing compute...
In High Performance Computing, heterogeneity is now the norm with specialized accelerators like GPUs...
The aim of data and task parallel scheduling for dense linear algebra kernels is to minimize the pro...
Best PaperInternational audienceMore and more computers use hybrid architectures combin-ing multi-co...
International audienceEfficient implementations of parallel applications on hetero-geneous hybrid ar...
International audienceIn this paper, we present a comparison of scheduling strategies for heterogene...
With the emergence of General Purpose computation on GPU (GPGPU) and corresponding programming fram...
International audienceMore and more computers use hybrid architectures combining multi-core processo...
International audienceMost recent HPC platforms have heterogeneous nodes composed of multi-core CPUs...
Modern high-performance computers engage a variety of computing devices. Underutilization and oversu...
In this paper, we consider task-based dense linear algebra applications on a single heterogeneous no...
Today's heterogeneous architectures bring together multiple general purpose CPUs, domain specific GP...
To help shrink the programmability-performance efficiency gap, we discuss that adaptive runtime syst...
International audienceThe race for Exascale computing has naturally led the current technologies to ...
In this paper, we describe a runtime to automatically enhance the performance of applications runnin...
Heterogeneous parallel architectures like those comprised of CPUs and GPUs are a tantalizing compute...
In High Performance Computing, heterogeneity is now the norm with specialized accelerators like GPUs...
The aim of data and task parallel scheduling for dense linear algebra kernels is to minimize the pro...
Best PaperInternational audienceMore and more computers use hybrid architectures combin-ing multi-co...