This thesis deals with two issues for future Exascale platforms, namely resilience and energy. We address these two issues in two main parts. The first part focuses on the importance of reliability for future Exascale platforms, while the second part discusses how to improve the energy consumption of these platforms. Considering the relative slopes describing the evolution of the reliability of individual components on one side, and the evolution of the number of components on the other side, the reliability of the entire platform is expected to decrease, due to probabilistic amplification. The mean time between two failures on Exascale systems is expected to be shorter than the time to do a system checkpoint. In the first part of this thes...
International audienceWe consider divisible load scientific applications executing on large-scale pl...
High performance computing applications must be tolerant to faults, which are common occurrences esp...
To enable future scientific breakthroughs and discoveries, the next generation of scientific applica...
This thesis deals with two issues for future Exascale platforms, namely resilience and energy. We ad...
This short paper deals with parallel scientific applications using non-blocking and periodic coordin...
International audienceThis short paper deals with parallel scientific applications using non-blockin...
This short paper deals with parallel scientific applications using non-blocking and periodic coordin...
This thesis focuses on a major problem for the HPC community: resilience. Computing platforms are bi...
This thesis focuses on a major problem for the HPC community: resilience. Computing platforms are bi...
This thesis focuses on a major problem for the HPC community: resilience. Computing platforms are bi...
This thesis focuses on a major problem for the HPC community: resilience. Computing platforms are bi...
Dans cette thèse, j'ai considéré d'un point de vue théorique deux problèmes importants pour les futu...
This short paper deals with parallel scientific applications using non-blocking and periodic coordin...
International audienceWe consider divisible load scientific applications executing on large-scale pl...
This short paper deals with parallel scientific applications using non-blocking and periodic co-ordi...
International audienceWe consider divisible load scientific applications executing on large-scale pl...
High performance computing applications must be tolerant to faults, which are common occurrences esp...
To enable future scientific breakthroughs and discoveries, the next generation of scientific applica...
This thesis deals with two issues for future Exascale platforms, namely resilience and energy. We ad...
This short paper deals with parallel scientific applications using non-blocking and periodic coordin...
International audienceThis short paper deals with parallel scientific applications using non-blockin...
This short paper deals with parallel scientific applications using non-blocking and periodic coordin...
This thesis focuses on a major problem for the HPC community: resilience. Computing platforms are bi...
This thesis focuses on a major problem for the HPC community: resilience. Computing platforms are bi...
This thesis focuses on a major problem for the HPC community: resilience. Computing platforms are bi...
This thesis focuses on a major problem for the HPC community: resilience. Computing platforms are bi...
Dans cette thèse, j'ai considéré d'un point de vue théorique deux problèmes importants pour les futu...
This short paper deals with parallel scientific applications using non-blocking and periodic coordin...
International audienceWe consider divisible load scientific applications executing on large-scale pl...
This short paper deals with parallel scientific applications using non-blocking and periodic co-ordi...
International audienceWe consider divisible load scientific applications executing on large-scale pl...
High performance computing applications must be tolerant to faults, which are common occurrences esp...
To enable future scientific breakthroughs and discoveries, the next generation of scientific applica...