This paper describes a non-blocking checkpointing mode in support of optimistic parallel discrete event simulation. This mode allows real concurrency in the execution of state saving and other simulation specific operations (e.g. event list update, event execution), with the aim at removing the cost of recording state information from the completion time of the parallel simulation application. We present an implementation of a C library supporting non-blocking checkpointing on a myrinet based cluster, which demonstrates the practical viability of this checkpointing mode on standard o#-the-shelf hardware. By the results of an empirical study on classical parameterized synthetic benchmarks we show that, except for the case of minimal s...
Nowadays hardware platforms offer a plethora of innovative facities for profiling the execution of p...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
Abstract: To rapidly evaluate performances and power consumption in design space exploration of mode...
CCL (checkpointing and communication library) is a software layer in support of optimistic parallel ...
Checkpointing-and-Communication Library (CCL) is a recently developed software implementing CPU offl...
Great effort has been devoted to the design of optimized checkpointing strategies for optimistic par...
In this paper we present a software approach, namely Fast-software-Checkpointing (FSC), to reduce th...
Recently a Checkpointing and Communication Library (CCL) for optimistic simulation on Myrinet based ...
This paper presents a checkpointing-recovery scheme for Time Warp parallel simulation. The scheme re...
Checkpointing overhead is a major obstacle for the effectiveness of Time Warp parallel discrete even...
AbstractThere are two approaches to reduce the overhead associated with coordinated checkpointing: f...
Discrete event simulation is an important tool for modeling and analysis. Some of the simulation app...
CCL (Checkpointing and Communication Library) is a recently developed software in support of optimis...
This thesis explores methods to decrease overheads in an optimistic parallel discrete event simulati...
In this paper we present a communication layer for Myrinet based clusters, designed to efficiently s...
Nowadays hardware platforms offer a plethora of innovative facities for profiling the execution of p...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
Abstract: To rapidly evaluate performances and power consumption in design space exploration of mode...
CCL (checkpointing and communication library) is a software layer in support of optimistic parallel ...
Checkpointing-and-Communication Library (CCL) is a recently developed software implementing CPU offl...
Great effort has been devoted to the design of optimized checkpointing strategies for optimistic par...
In this paper we present a software approach, namely Fast-software-Checkpointing (FSC), to reduce th...
Recently a Checkpointing and Communication Library (CCL) for optimistic simulation on Myrinet based ...
This paper presents a checkpointing-recovery scheme for Time Warp parallel simulation. The scheme re...
Checkpointing overhead is a major obstacle for the effectiveness of Time Warp parallel discrete even...
AbstractThere are two approaches to reduce the overhead associated with coordinated checkpointing: f...
Discrete event simulation is an important tool for modeling and analysis. Some of the simulation app...
CCL (Checkpointing and Communication Library) is a recently developed software in support of optimis...
This thesis explores methods to decrease overheads in an optimistic parallel discrete event simulati...
In this paper we present a communication layer for Myrinet based clusters, designed to efficiently s...
Nowadays hardware platforms offer a plethora of innovative facities for profiling the execution of p...
Checkpoint is defined as a designated place in a program at which normal processing is interrupted s...
Abstract: To rapidly evaluate performances and power consumption in design space exploration of mode...