Towards machine learning-based self-tuning of Hadoop-Spark system

Rahman, Md. Armanur
Hossen, Jakir
Venkataseshaiah, Chinthakunta
Bhuvaneswari, Thangavel
Sultana, Aziza
Hossen, Abid

Open link

Publication date

February 2019

DOI

10.11591/ijeecs.v15.i2.pp1076-1085

Publisher

Institute of Advanced Engineering and Science

Abstract

Apache Spark is an open source distributed platform which uses the concept of distributed memory for processing big data. Spark has more than 180 predominant configuration parameter. Configuration settings directly control the efficiency of Apache spark while processing big data, to get the best outcome yet a challenging task as it has many configuration parameters. Currently, these predominant parameters are tuned manually by trial and error. To overcome this manual tuning problem in this paper proposed and developed a self-tuning approach using machine learning. This approach can tune the parameter value when it’s required. The approach was implemented on Dell server and experiment was done on five different sizes of the dataset and para...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Towards machine learning-based self-tuning of Hadoop-Spark system

Abstract

Extracted data

Towards machine learning-based self-tuning of Hadoop-Spark system

Abstract

Extracted data

Related items

Related items