National audienceHigh Performance Computing machines use more and more Graphical Processing Units as they are very efficient for homogeneous computation such as matrix operations. However before using these accelerators, one has to transfer data from the processor to them. Such a transfer can be slow. In this report, our aim is to study the impact of communication times on the makespan of a scheduling. Indeed, with a better anticipation of these communications, we could use the GPUs even more efficiently. More precisely, we will focus on machines with one or more GPUs and on applications with a low ratio of computations over communications. During this study, we have implemented two offline scheduling algorithms within XKAAPI's runtime. The...
International audienceEfficient implementations of parallel applications on hetero-geneous hybrid ar...
Scheduling is an important issue in parallel processing. Most scheduling algorithms makes assigns ta...
International audienceMulti-core systems are increasingly interesting candidates for executing paral...
National audienceHigh Performance Computing machines use more and more Graphical Processing Units as...
International audienceMore and more computers use hybrid architectures combining multi-core processo...
In this thesis we study the behavior of parallel applications represented by a precedence graph. The...
Institute for Computing Systems ArchitectureProgrammers face daunting problems when attempting to de...
This thesis focus on the problem of scheduling the tasks of a parallel application taking into accou...
Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applic...
The parallelism within an algorithm at any stage of execution can be defined as the number of indepe...
International audienceTo fully tap into the potential of heterogeneous machines composed of multicor...
More and more computers use hybrid architectures combining multi-core processors (CPUs) and hardware...
International audienceThe race for Exascale computing has naturally led the current technologies to ...
The results summarized in this document deal with the scheduling of independent tasks on large scale...
Parallelization of many irregular applications results in unstructured collective communication. In ...
International audienceEfficient implementations of parallel applications on hetero-geneous hybrid ar...
Scheduling is an important issue in parallel processing. Most scheduling algorithms makes assigns ta...
International audienceMulti-core systems are increasingly interesting candidates for executing paral...
National audienceHigh Performance Computing machines use more and more Graphical Processing Units as...
International audienceMore and more computers use hybrid architectures combining multi-core processo...
In this thesis we study the behavior of parallel applications represented by a precedence graph. The...
Institute for Computing Systems ArchitectureProgrammers face daunting problems when attempting to de...
This thesis focus on the problem of scheduling the tasks of a parallel application taking into accou...
Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applic...
The parallelism within an algorithm at any stage of execution can be defined as the number of indepe...
International audienceTo fully tap into the potential of heterogeneous machines composed of multicor...
More and more computers use hybrid architectures combining multi-core processors (CPUs) and hardware...
International audienceThe race for Exascale computing has naturally led the current technologies to ...
The results summarized in this document deal with the scheduling of independent tasks on large scale...
Parallelization of many irregular applications results in unstructured collective communication. In ...
International audienceEfficient implementations of parallel applications on hetero-geneous hybrid ar...
Scheduling is an important issue in parallel processing. Most scheduling algorithms makes assigns ta...
International audienceMulti-core systems are increasingly interesting candidates for executing paral...