Analysis of checkpointing schemes for multiprocessor systems

Avi Ziv

Publication date

January 1993

Abstract

Parallel computing systems provide hardware re-dundancy that helps t o achieve low cost fault-tolerance, by duplicating the task into more than a single pro-cessor, and comparing the states of the processors a t checkpoints. This paper suggests a novel technique, based on a Markov Reward Model (MRM) , f o r ana-lyzing the performance of checkpointing schemes with task duplication. W e show how this technique can be used to derive the average execution t ime of a task and other important parameters related t o the perfor-mance of checkpointing schemes. Our analytical re-sults match well the values we obtained using a simula-t ion program. W e compare the average task execution t ime and total work of f our checkpointing schemes, and show th...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Analysis of checkpointing schemes for multiprocessor systems

Abstract

Extracted data

Analysis of checkpointing schemes for multiprocessor systems

Abstract

Extracted data

Related items

Related items