National audienceHigh Performance Computing machines use more and more Graphical Processing Units as they are very efficient for homogeneous computation such as matrix operations. However before using these accelerators, one has to transfer data from the processor to them. Such a transfer can be slow. In this report, our aim is to study the impact of communication times on the makespan of a scheduling. Indeed, with a better anticipation of these communications, we could use the GPUs even more efficiently. More precisely, we will focus on machines with one or more GPUs and on applications with a low ratio of computations over communications. During this study, we have implemented two offline scheduling algorithms within XKAAPI's runtime. The...
International audienceEfficient implementations of parallel applications on hetero-geneous hybrid ar...
The results summarized in this document deal with the scheduling of independent tasks on large scale...
International audienceMulti-core systems are increasingly interesting candidates for executing paral...
National audienceHigh Performance Computing machines use more and more Graphical Processing Units as...
International audienceMore and more computers use hybrid architectures combining multi-core processo...
Institute for Computing Systems ArchitectureProgrammers face daunting problems when attempting to de...
The parallelism within an algorithm at any stage of execution can be defined as the number of indepe...
This thesis focus on the problem of scheduling the tasks of a parallel application taking into accou...
In this thesis we study the behavior of parallel applications represented by a precedence graph. The...
Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applic...
International audienceThe race for Exascale computing has naturally led the current technologies to ...
International audienceTo fully tap into the potential of heterogeneous machines composed of multicor...
Parallelization of many irregular applications results in unstructured collective communication. In ...
More and more computers use hybrid architectures combining multi-core processors (CPUs) and hardware...
International audienceIn this paper, we present a comparison of scheduling strategies for heterogene...
International audienceEfficient implementations of parallel applications on hetero-geneous hybrid ar...
The results summarized in this document deal with the scheduling of independent tasks on large scale...
International audienceMulti-core systems are increasingly interesting candidates for executing paral...
National audienceHigh Performance Computing machines use more and more Graphical Processing Units as...
International audienceMore and more computers use hybrid architectures combining multi-core processo...
Institute for Computing Systems ArchitectureProgrammers face daunting problems when attempting to de...
The parallelism within an algorithm at any stage of execution can be defined as the number of indepe...
This thesis focus on the problem of scheduling the tasks of a parallel application taking into accou...
In this thesis we study the behavior of parallel applications represented by a precedence graph. The...
Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applic...
International audienceThe race for Exascale computing has naturally led the current technologies to ...
International audienceTo fully tap into the potential of heterogeneous machines composed of multicor...
Parallelization of many irregular applications results in unstructured collective communication. In ...
More and more computers use hybrid architectures combining multi-core processors (CPUs) and hardware...
International audienceIn this paper, we present a comparison of scheduling strategies for heterogene...
International audienceEfficient implementations of parallel applications on hetero-geneous hybrid ar...
The results summarized in this document deal with the scheduling of independent tasks on large scale...
International audienceMulti-core systems are increasingly interesting candidates for executing paral...