MAPSkew: Metaheuristic Approaches for Partitioning Skew in MapReduce

Matheus H. M. Pericini
Lucas G. M. Leite
Francisco H. de Carvalho-Junior
Javam C. Machado
Cenez A. Rezende

Open link

Publication date

December 2018

DOI

10.3390/a12010005

Publisher

MDPI AG

ISSN

1999-4893

Journal

Algorithms

Abstract

MapReduce is a parallel computing model in which a large dataset is split into smaller parts and executed on multiple machines. Due to its simplicity, MapReduce has been widely used in various applications domains. MapReduce can significantly reduce the processing time of a large amount of data by dividing the dataset into smaller parts and processing them in parallel in multiple machines. However, when data are not uniformly distributed, we have the so called partitioning skew, where the allocation of tasks to machines becomes unbalanced, either by the distribution function splitting the dataset unevenly or because a part of the data is more complex and requires greater computational effort. To solve this problem, we propose an approach ba...

Extracted data

We use cookies to provide a better user experience.

Data Protection

MAPSkew: Metaheuristic Approaches for Partitioning Skew in MapReduce

Abstract

Extracted data

MAPSkew: Metaheuristic Approaches for Partitioning Skew in MapReduce

Abstract

Extracted data

Related items

Related items