International audienceIn this paper, we aim at optimizing fault-tolerance tech- niques based on a checkpointing/restart mechanism, in the context of cloud computing. Our contribution is three-fold. (1) We derive a fresh formula to compute the optimal num- ber of checkpoints for cloud jobs with varied distributions of failure events. Our analysis is not only generic with no assumption on failure probability distribution, but also at- tractively simple to apply in practice. (2) We design an adaptive algorithm to optimize the impact of checkpointing regarding various costs like checkpointing/restart overhead. (3) We evaluate our optimized solution in a real cluster en- vironment with hundreds of virtual machines and Berke- ley Lab Checkpoint/R...
Cloud computing is a promising paradigm that provides users with higher computing benefits in terms ...
This paper was submitted to IEEE Cloud 2010.Recently introduced spot instances in the Amazon Elastic...
In cloud computing, users can rent computing resources from service providers according to their dem...
International audienceIn this paper, we aim at optimizing fault-tolerance tech- niques based on a ch...
In this paper, we aim at optimizing fault-tolerance techniques based on a checkpointing/restart mech...
Fault tolerance in cloud computing is considered as one of the most vital issues to deliver reliable...
Cloud computing is playing a vital role for processing big data. The infrastructure is built on top ...
Cloud fault tolerance is an important issue in cloud computing platforms and applications. In the ev...
Checkpointing has been widely adopted in support of fault-tolerance and job migration essential for ...
International audienceA non-invasive, cloud-agnostic approach is demonstratedfor extending existing ...
The introduction of computers has been a huge plus to human life in its entirety because it provides...
Cloud computing is a web based technology which is the next step in the evolution of distributed com...
Abstract — Main objective of this research work is to improve the checkpoint efficiency for integrat...
Abstract. Cloud computing is a new benchmark towards enterprise application development that can fac...
International audience<span id="ctl00_ctl00_cphMain_cphSection_lblAbstract" class="margin-bottom-10"...
Cloud computing is a promising paradigm that provides users with higher computing benefits in terms ...
This paper was submitted to IEEE Cloud 2010.Recently introduced spot instances in the Amazon Elastic...
In cloud computing, users can rent computing resources from service providers according to their dem...
International audienceIn this paper, we aim at optimizing fault-tolerance tech- niques based on a ch...
In this paper, we aim at optimizing fault-tolerance techniques based on a checkpointing/restart mech...
Fault tolerance in cloud computing is considered as one of the most vital issues to deliver reliable...
Cloud computing is playing a vital role for processing big data. The infrastructure is built on top ...
Cloud fault tolerance is an important issue in cloud computing platforms and applications. In the ev...
Checkpointing has been widely adopted in support of fault-tolerance and job migration essential for ...
International audienceA non-invasive, cloud-agnostic approach is demonstratedfor extending existing ...
The introduction of computers has been a huge plus to human life in its entirety because it provides...
Cloud computing is a web based technology which is the next step in the evolution of distributed com...
Abstract — Main objective of this research work is to improve the checkpoint efficiency for integrat...
Abstract. Cloud computing is a new benchmark towards enterprise application development that can fac...
International audience<span id="ctl00_ctl00_cphMain_cphSection_lblAbstract" class="margin-bottom-10"...
Cloud computing is a promising paradigm that provides users with higher computing benefits in terms ...
This paper was submitted to IEEE Cloud 2010.Recently introduced spot instances in the Amazon Elastic...
In cloud computing, users can rent computing resources from service providers according to their dem...