MCSA: A multi-criteria shuffling algorithm for the MapReduce framework

Corriveau, J. (Jean-Pierre)
Lyu, L. (Leo)
Elhabyan, R. (Riham)
Shi, W. (Wei)

Open link

Publication date

June 2018

DOI

10.1109/UIC-ATC.2017.8397651

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

During the shuffle stage of the MapReduce framework, a large volume of data may be relocated to the same destination at the same time. This, in turn, may lead to the network hotspot problem. On the other hand, it is always more effective to achieve better data locality by moving the computation closer to the data than the other way around. However, doing this may result in the partitioning skew problem, which is characterized by the unbalanced computational loads between the destinations. Consequently, shuffling algorithms should consider all the following criteria: data locality, partitioning skew, and network hotspot. In order to do so, we introduce MCSA, a Multi-Criteria shuffling algorithm for the MapReduce scheduling stage that rests o...

Extracted data

We use cookies to provide a better user experience.

Data Protection

MCSA: A multi-criteria shuffling algorithm for the MapReduce framework

Abstract

Extracted data

MCSA: A multi-criteria shuffling algorithm for the MapReduce framework

Abstract

Extracted data

Related items

Related items