Optimized Runtime Systems for MapReduce Applications in Multi-core Clusters

Zhang, Yunming

Publication date

January 2016

Abstract

This research proposes a novel runtime system, Habanero Hadoop, to tackle the inefficient utilization of multi-core machines' memory in the existing Hadoop MapReduce runtime system. Insufficient memory for each map task leads to the inability to tackle large-scale problems such as genome sequencing and data clustering. The Habanero Hadoop system integrates a shared memory model into the fully distributed memory model of the Hadoop MapReduce system. The improvements eliminate duplication of in-memory data structures used in the map phase, making more memory available to each map task. Previous works optimizing multi-core performance for MapReduce runtime focused on maximizing CPU utilization rather than memory efficiency. My work provided mu...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Optimized Runtime Systems for MapReduce Applications in Multi-core Clusters

Abstract

Extracted data

Optimized Runtime Systems for MapReduce Applications in Multi-core Clusters

Abstract

Extracted data

Related items

Related items