High-frequency memory checkpointing is an important technique in several application domains, such as automatic error recovery (where frequent checkpoints allow the system to transparently mask failures) and application debugging (where frequent checkpoints enable fast and accurate time-traveling support). Unfortunately, existing (typically incremental) checkpointing frameworks incur substantial performance overhead in high-frequency memory checkpointing applications, thus discouraging their adoption in practice. This paper presents Speculative Memory Checkpointing (SMC), a new low-overhead technique for high-frequency memory checkpointing. Our motivating analysis identifies key bottlenecks in existing frameworks and demonstrates that the p...
Improving application performance is a major challenge for computer architects. Two important reason...
AbstractSpeculative software parallelism has gained renewed interest recently as a mechanism to leve...
By leveraging the enormous amount of computational capabilities, scientists today are being able to ...
High-frequency memory checkpointing is an important tech-nique in several application domains, such ...
Checkpointing is a pivotal technique in system research, with applications ranging from crash recove...
With processor vendors pursuing multicore products, often at the expense of the complexity and aggre...
Checkpointing is a pivotal technique in system research, with applications ranging from crash recove...
Checkpointing support allows program execution to roll-back to an earlier program point, discarding ...
textTo make progress in the face of failures, long-running parallel applications need to save their ...
Abstract. As modern supercomputing systems reach the peta-flop performance range, they grow in both ...
Checkpoint prediction and intelligent management have been recently proposed for reducing the number...
International audienceWith increasing scale and complexity of supercomputing and cloud computing arc...
For checkpointing to be practical, it has to introduce low overhead for the targeted application. As...
In this paper, we study real-time in-memory checkpointing as an effective means to improve the relia...
The MapReduce has become popular in big data environment due to its efficient parallel processing. H...
Improving application performance is a major challenge for computer architects. Two important reason...
AbstractSpeculative software parallelism has gained renewed interest recently as a mechanism to leve...
By leveraging the enormous amount of computational capabilities, scientists today are being able to ...
High-frequency memory checkpointing is an important tech-nique in several application domains, such ...
Checkpointing is a pivotal technique in system research, with applications ranging from crash recove...
With processor vendors pursuing multicore products, often at the expense of the complexity and aggre...
Checkpointing is a pivotal technique in system research, with applications ranging from crash recove...
Checkpointing support allows program execution to roll-back to an earlier program point, discarding ...
textTo make progress in the face of failures, long-running parallel applications need to save their ...
Abstract. As modern supercomputing systems reach the peta-flop performance range, they grow in both ...
Checkpoint prediction and intelligent management have been recently proposed for reducing the number...
International audienceWith increasing scale and complexity of supercomputing and cloud computing arc...
For checkpointing to be practical, it has to introduce low overhead for the targeted application. As...
In this paper, we study real-time in-memory checkpointing as an effective means to improve the relia...
The MapReduce has become popular in big data environment due to its efficient parallel processing. H...
Improving application performance is a major challenge for computer architects. Two important reason...
AbstractSpeculative software parallelism has gained renewed interest recently as a mechanism to leve...
By leveraging the enormous amount of computational capabilities, scientists today are being able to ...