FT-GReLoSSS (FTG) is a C++/MPI framework to ease the development of fault-tolerant parallel applications belonging to a SPMD family termed GReLoSSS. The originality of FTG is to rely on the MoLOToF programming model principles to facilitate the addition of an efficient checkpoint-based fault tolerance at the application level. Main features of MoLOToF encompass a structured application development based on fault-tolerant "skeletons" and lay emphasis on collaborations. The latter exist between the programmer, the framework and the underlying runtime middleware/environment. Together with the structured approach they contribute into achieving reduced checkpoint sizes, as well as reduced checkpoint and recovery overhead at runtime. This paper i...
The always increasing performance demands of applications such as cryptography, scientific simulatio...
Les simulateurs industriels deviennent de plus en plus complexes car ils doivent intégrer de façon p...
Embedded systems designers are moving to multicores to increase the performance of their application...
FT-GReLoSSS (FTG) is a C++/MPI framework to ease the development of fault-tolerant parallel applicat...
International audienceFT-GReLoSSS (FTG) is a C++/MPI framework to ease the development of fault-tole...
High performance computing applications must be resilient to faults, which are common occurrences es...
Facing the limits of traditional tools of resource management within computational grids (related to...
We propose a framework built around a JavaSpace to ease the development of bag-of-tasks applications...
This work studies the reliability of embedded systems with approximate computing on software and har...
International audienceThe move towards exascale super-computers requires new fault tolerance solutio...
In this thesis, we describe and analyze a fully distributed approach for parallel Branch-and-Bound. ...
This work deals with scheduling and checkpointing strategies to execute scientific workflows on fail...
[Abstract] Current high-performance computing (HPC) systems are comprised of thousands of CPU core...
Exploiting concurrency to achieve greater performance is a difficult and important challenge for cur...
PC clusters are distributed architectures whose adoption spreads as a result of their low cost but a...
The always increasing performance demands of applications such as cryptography, scientific simulatio...
Les simulateurs industriels deviennent de plus en plus complexes car ils doivent intégrer de façon p...
Embedded systems designers are moving to multicores to increase the performance of their application...
FT-GReLoSSS (FTG) is a C++/MPI framework to ease the development of fault-tolerant parallel applicat...
International audienceFT-GReLoSSS (FTG) is a C++/MPI framework to ease the development of fault-tole...
High performance computing applications must be resilient to faults, which are common occurrences es...
Facing the limits of traditional tools of resource management within computational grids (related to...
We propose a framework built around a JavaSpace to ease the development of bag-of-tasks applications...
This work studies the reliability of embedded systems with approximate computing on software and har...
International audienceThe move towards exascale super-computers requires new fault tolerance solutio...
In this thesis, we describe and analyze a fully distributed approach for parallel Branch-and-Bound. ...
This work deals with scheduling and checkpointing strategies to execute scientific workflows on fail...
[Abstract] Current high-performance computing (HPC) systems are comprised of thousands of CPU core...
Exploiting concurrency to achieve greater performance is a difficult and important challenge for cur...
PC clusters are distributed architectures whose adoption spreads as a result of their low cost but a...
The always increasing performance demands of applications such as cryptography, scientific simulatio...
Les simulateurs industriels deviennent de plus en plus complexes car ils doivent intégrer de façon p...
Embedded systems designers are moving to multicores to increase the performance of their application...