The class imbalance problem, one of the common data irregularities, causes the development of under-represented models. To resolve this issue, the present study proposes a new cluster-based MapReduce design, entitled Distributed Cluster-based Resampling for Imbalanced Big Data (DIBID). The design aims at modifying the existing dataset to increase the classification success. Within the study, DIBID has been implemented on public datasets under two strategies. The first strategy has been designed to present the success of the model on data sets with different imbalanced ratios. The second strategy has been designed to compare the success of the model with other imbalanced big data solutions in the literature. According to the results, DIBID o...
Imbalanced class problem (machine learning) is a problem that arises because of the significant diff...
The volume of data in today’s applications has meant a change in the way Machine Learning issues are...
The problem of classification of imbalanced datasets is a critical one. With an increase in the numb...
Big Data applications are emerging during the last years, and researchers from many disciplines are ...
Abstract—The “big data ” term has caught the attention of experts in the context of learning from da...
Classification of imbalanced data has been reported to require modification of standard classificati...
Classification techniques in the big data scenario are in high demand in a wide variety of applicati...
Abstract — Classification techniques in the big data scenario are in high demand in a wide variety o...
Abstract In a majority–minority classification problem, class imbalance in the dataset(s) can dramat...
Addressing the huge amount of data continuously generated is an important challenge in the Machine L...
Most existing classification approaches assume the underlying training set is evenly distributed. In...
Imbalance datasets exist in many real-world domains. It is straightforward to apply classification a...
In many application domains such as medicine, information retrieval, cybersecurity, social media, et...
Learning from imbalanced data is among the most challenging areas in contemporary machine learning. ...
In many applications of data mining, class imbalance is noticed when examples in one class are overr...
Imbalanced class problem (machine learning) is a problem that arises because of the significant diff...
The volume of data in today’s applications has meant a change in the way Machine Learning issues are...
The problem of classification of imbalanced datasets is a critical one. With an increase in the numb...
Big Data applications are emerging during the last years, and researchers from many disciplines are ...
Abstract—The “big data ” term has caught the attention of experts in the context of learning from da...
Classification of imbalanced data has been reported to require modification of standard classificati...
Classification techniques in the big data scenario are in high demand in a wide variety of applicati...
Abstract — Classification techniques in the big data scenario are in high demand in a wide variety o...
Abstract In a majority–minority classification problem, class imbalance in the dataset(s) can dramat...
Addressing the huge amount of data continuously generated is an important challenge in the Machine L...
Most existing classification approaches assume the underlying training set is evenly distributed. In...
Imbalance datasets exist in many real-world domains. It is straightforward to apply classification a...
In many application domains such as medicine, information retrieval, cybersecurity, social media, et...
Learning from imbalanced data is among the most challenging areas in contemporary machine learning. ...
In many applications of data mining, class imbalance is noticed when examples in one class are overr...
Imbalanced class problem (machine learning) is a problem that arises because of the significant diff...
The volume of data in today’s applications has meant a change in the way Machine Learning issues are...
The problem of classification of imbalanced datasets is a critical one. With an increase in the numb...