Optimum Checkpoint Interval for MapReduce Fault-Tolerance

Naychi Nway Nway
Julia Myint

Publication date

November 2017

Abstract

MapReduce is the efficient framework for parallel processing of distributed big data in cluster environment. In such a cluster, task failures can impact on performance of applications. Although MapReduce automatically reschedules the failed tasks, it takes long completion time because it starts from scratch. The checkpointing mechanism is the valuable technique to avoid reexecution of failed tasks in MapReduce. However, defining incorrect checkpoint interval can still decrease the performance of MapReduce applications and job completion time. In this paper, the optimum checkpoint interval is proposed to reduce MapReduce job completion time when failures occur. The proposed system defines checkpoint interval that is based on five parameters:...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Optimum Checkpoint Interval for MapReduce Fault-Tolerance

Abstract

Extracted data

Optimum Checkpoint Interval for MapReduce Fault-Tolerance

Abstract

Extracted data

Related items

Related items