A hybrid scheduling algorithm for data intensive workloads in a map reduce environment

Phuong Nguyen
Tyler Simon
Milton Halem
David Chapman
Quang Le

Publication date

January 2012

DOI

10.1109/ucc.2012.32

Abstract

Abstract — The specific choice of workload task schedulers for Hadoop MapReduce applications can have a dramatic effect on job workload latency. The Hadoop Fair Scheduler (FairS) assigns resources to jobs such that all jobs get, on average, an equal share of resources over time. Thus, it addresses the problem with a FIFO scheduler when short jobs have to wait for long running jobs to complete. We show that even for the FairS, jobs are still forced to wait significantly when the MapReduce system assigns equal sharing of resources due to dependencies between Map, Shuffle, Sort, Reduce phases. We propose a Hybrid Scheduler (HybS) algorithm based on dynamic priority in order to reduce the latency for variable length concurrent jobs, while maint...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A hybrid scheduling algorithm for data intensive workloads in a map reduce environment

Abstract

Extracted data

A hybrid scheduling algorithm for data intensive workloads in a map reduce environment

Abstract

Extracted data

Related items

Related items