This paper deals with the impact of fault prediction techniques on checkpointing strategies. We extend the classical first-order analysis of Young and Daly in the presence of a fault prediction system, characterized by its recall and its precision. In this framework, we provide optimal algorithms to decide whether and when to take predictions into account, and we derive the optimal value of the checkpointing period. These results allow to analytically assess the key parameters that impact the performance of fault predictors at very large scale.
International audienceWe provide a framework to analyze multi-level checkpointing protocols, by form...
International audienceThe Young/Daly formula provides an approximation of the optimal checkpoint per...
The probability for errors to occur in electronic systems is not known in advance, but depends on ma...
This paper deals with the impact of fault prediction techniques on checkpointing strategies. We exte...
International audienceThis paper deals with the impact of fault prediction techniques on checkpointi...
Abstract—This paper deals with the impact of fault predic-tion techniques on checkpointing strategie...
International audienceThis paper deals with the impact of fault prediction techniques on checkpointi...
This report provides an introduction to resilience methods. The emphasis is on checkpointing, the de...
This report provides an introduction to the design of scheduling algorithms to cope with faults on l...
The application of checkpointing as a fault-tolerance measure for real-time services (i.e., services...
International audienceParallel execution time is expected to decrease as the number of processors in...
The application of checkpointing as a fault-tolerance measure for real-time services (i.e., services...
International audienceThis work provides an optimal checkpointing strategy to protect iterative appl...
Prediction of individual sequences is investigated for cases in which the decision maker observes a ...
Shrinking hardware structures and decreasing operating voltages lead to an increasing number of tran...
International audienceWe provide a framework to analyze multi-level checkpointing protocols, by form...
International audienceThe Young/Daly formula provides an approximation of the optimal checkpoint per...
The probability for errors to occur in electronic systems is not known in advance, but depends on ma...
This paper deals with the impact of fault prediction techniques on checkpointing strategies. We exte...
International audienceThis paper deals with the impact of fault prediction techniques on checkpointi...
Abstract—This paper deals with the impact of fault predic-tion techniques on checkpointing strategie...
International audienceThis paper deals with the impact of fault prediction techniques on checkpointi...
This report provides an introduction to resilience methods. The emphasis is on checkpointing, the de...
This report provides an introduction to the design of scheduling algorithms to cope with faults on l...
The application of checkpointing as a fault-tolerance measure for real-time services (i.e., services...
International audienceParallel execution time is expected to decrease as the number of processors in...
The application of checkpointing as a fault-tolerance measure for real-time services (i.e., services...
International audienceThis work provides an optimal checkpointing strategy to protect iterative appl...
Prediction of individual sequences is investigated for cases in which the decision maker observes a ...
Shrinking hardware structures and decreasing operating voltages lead to an increasing number of tran...
International audienceWe provide a framework to analyze multi-level checkpointing protocols, by form...
International audienceThe Young/Daly formula provides an approximation of the optimal checkpoint per...
The probability for errors to occur in electronic systems is not known in advance, but depends on ma...