In the context of HPC platforms, individual nodes nowadays consist in heterogenous processing resources such as GPU units and multicores. Those resources share communication and storage resources , what induces complex co-scheduling effects, and makes it hard to predict the exact duration of a task or of a communication. To cope with these issues, runtime dynamic schedulers such as StarPU have been developed. These systems base their decisions at runtime on the state of the platform and possibly on static priorities of tasks computed offline. In this paper, our goal is to quantify performance variability in the context of HPC heterogeneous nodes, by focusing on very regular dense linear algebra kernels. Then, we analyze the impact of this v...
International audienceIn the field of HPC, the current hardware trend is to design multiprocessor ar...
International audienceA now-classical way of meeting the increasing demand for computing speed by HP...
Our goal is to provide an analysis and comparison of static and dynamic strategies for task graph sc...
The tremendous increase in the size and heterogeneity of supercomputers makes it very difficult to p...
The tremendous increase in the size and heterogeneity of supercomputers makes it very difficult to p...
In High Performance Computing, heterogeneity is now the norm with specialized accelerators like GPUs...
With High Performance Computing moving towards Exascale, where parallel applications will be require...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
Load imbalance cause significant performance degradation in High Performance Computing applications....
Du fait des énormes capacités de calculs des accélérateurs tels que les GPUs et les Xeon Phi, l’util...
International audienceSUMMARY Multi-core architectures comprising several GPUs have become mainstrea...
Due to massive computation power of accelerators such as GPU, Xeon phi, multicore machines equipped ...
We consider the problem of allocating and scheduling dense linear application on fully heterogeneous...
Next generation HPC applications will increasingly time-share system resources with emerging workloa...
International audienceIn the field of HPC, the current hardware trend is to design multiprocessor ar...
International audienceIn the field of HPC, the current hardware trend is to design multiprocessor ar...
International audienceA now-classical way of meeting the increasing demand for computing speed by HP...
Our goal is to provide an analysis and comparison of static and dynamic strategies for task graph sc...
The tremendous increase in the size and heterogeneity of supercomputers makes it very difficult to p...
The tremendous increase in the size and heterogeneity of supercomputers makes it very difficult to p...
In High Performance Computing, heterogeneity is now the norm with specialized accelerators like GPUs...
With High Performance Computing moving towards Exascale, where parallel applications will be require...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
Load imbalance cause significant performance degradation in High Performance Computing applications....
Du fait des énormes capacités de calculs des accélérateurs tels que les GPUs et les Xeon Phi, l’util...
International audienceSUMMARY Multi-core architectures comprising several GPUs have become mainstrea...
Due to massive computation power of accelerators such as GPU, Xeon phi, multicore machines equipped ...
We consider the problem of allocating and scheduling dense linear application on fully heterogeneous...
Next generation HPC applications will increasingly time-share system resources with emerging workloa...
International audienceIn the field of HPC, the current hardware trend is to design multiprocessor ar...
International audienceIn the field of HPC, the current hardware trend is to design multiprocessor ar...
International audienceA now-classical way of meeting the increasing demand for computing speed by HP...
Our goal is to provide an analysis and comparison of static and dynamic strategies for task graph sc...