Improving MapReduce Performance Using Smart Speculative Execution Strategy

Chen, Qi
Liu, Cheng
Xiao, Zhen

Open link

Publication date

January 2014

DOI

10.1109/TC.2013.15

Publisher

ieee transactions on computers

ISSN

0018-9340

Journal

issn:1557-9956

Abstract

MapReduce is a widely used parallel computing framework for large scale data processing. The two major performance metrics in MapReduce are job execution time and cluster throughput. They can be seriously impacted by straggler machines-machines on which tasks take an unusually long time to finish. Speculative execution is a common approach for dealing with the straggler problem by simply backing up those slow running tasks on alternative machines. Multiple speculative execution strategies have been proposed, but they have some pitfalls: i) Use average progress rate to identify slow tasks while in reality the progress rate can be unstable and misleading, ii) Cannot appropriately handle the situation when there exists data skew among the task...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Improving MapReduce Performance Using Smart Speculative Execution Strategy

Abstract

Extracted data

Improving MapReduce Performance Using Smart Speculative Execution Strategy

Abstract

Extracted data

Related items

Related items