This paper presents a checkpointing-recovery scheme for Time Warp parallel simulation. The scheme relies on a checkpointing protocol, namely mixed state saving, embedding both sparse and incremental state saving modes, and on a state recovery procedure embedding both forward and backward recovery modes. This scheme is a generalization of many previous solutions, which can be obtained as particular instances of it by selecting appropriate values for the checkpointing protocol parameters. We also present two regulating algorithms to adaptively tune the checkpointing protocol parameters, in order to make the protocol reacting to variable rollback behavior. A synthetic benchmark in several different configurations has been used for evaluating a...
IEEE International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication ...
A rollback operation in a speculative parallel discrete event simulator has traditionally targeted t...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
Time warp discrete event simulators take advantage of the parallel processing of simulation events. ...
The Time Warp synchronization protocol for Parallel Discrete Event Simulation (PDES) is universally ...
Abstract Index-based checkpointing allows the use of simple and efficient algorithms for dom-ino-eff...
technical reportThe Time Warp mechanism offers an elegant approach to attacking difficult clock sync...
Checkpointing is widely used in robust fault-tolerant applications. We present an efficient incremen...
Discrete event simulation is an important tool for modeling and analysis. Some of the simulation app...
In this paper we present a software approach, namely Fast-software-Checkpointing (FSC), to reduce th...
Parallel simulation is a well developed technique for executing large and complex simulation models ...
Traditional checkpoint and recovery are based upon two basic assumptions. The first is the need to h...
In traditional distributed simulation schemes, entire simulation needs to be restarted if any of the...
The rollback operation is a fundamental building block to support the correct execution of a specula...
This paper describes a non-blocking checkpointing mode in support of optimistic parallel discrete e...
IEEE International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication ...
A rollback operation in a speculative parallel discrete event simulator has traditionally targeted t...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
Time warp discrete event simulators take advantage of the parallel processing of simulation events. ...
The Time Warp synchronization protocol for Parallel Discrete Event Simulation (PDES) is universally ...
Abstract Index-based checkpointing allows the use of simple and efficient algorithms for dom-ino-eff...
technical reportThe Time Warp mechanism offers an elegant approach to attacking difficult clock sync...
Checkpointing is widely used in robust fault-tolerant applications. We present an efficient incremen...
Discrete event simulation is an important tool for modeling and analysis. Some of the simulation app...
In this paper we present a software approach, namely Fast-software-Checkpointing (FSC), to reduce th...
Parallel simulation is a well developed technique for executing large and complex simulation models ...
Traditional checkpoint and recovery are based upon two basic assumptions. The first is the need to h...
In traditional distributed simulation schemes, entire simulation needs to be restarted if any of the...
The rollback operation is a fundamental building block to support the correct execution of a specula...
This paper describes a non-blocking checkpointing mode in support of optimistic parallel discrete e...
IEEE International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication ...
A rollback operation in a speculative parallel discrete event simulator has traditionally targeted t...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...