OMO: Optimize MapReduce Overlap with a Good Start (Reduce) and a Good Finish (Map)

Wang, Jiayin
Yao, Yi
Mao, Ying
Sheng, Bo
Mi, Ningfang

Open link

Publication date

February 2016

DOI

10.1109/PCCC.2015.7410279

Publisher

Montclair State University Digital Commons

Abstract

MapReduce has become a popular data processing framework in the past few years. Scheduling algorithm is crucial to the performance of a MapReduce cluster, especially when the cluster is concurrently executing a batch of MapReduce jobs. However, the scheduling problem in MapReduce is different from the traditional job scheduling problem as the reduce phase usually starts before the map phase is finished to shuffle the intermediate data. This paper develops a new strategy, named OMO, which particularly aims to optimize the overlap between the map and reduce phases. Our solution includes two new techniques, lazy start of reduce tasks and batch finish of map tasks, which catch the characteristics of the overlap in a MapReduce process and achiev...

Extracted data

We use cookies to provide a better user experience.

Data Protection

OMO: Optimize MapReduce Overlap with a Good Start (Reduce) and a Good Finish (Map)

Abstract

Extracted data

OMO: Optimize MapReduce Overlap with a Good Start (Reduce) and a Good Finish (Map)

Abstract

Extracted data

Related items

Related items