With the rapid development of internet technology, the amount of collected or generated data has increased exponentially. The sheer volume, complexity, and unbalanced nature of this data pose a challenge to the scientific community to extract meaningful information from this data within a reasonable time. In this paper, we implemented a scalable design of an artificial bee colony for big data classification using Apache Spark. In addition, a new fitness function is proposed to handle unbalanced data. Two experiments were performed using the real unbalanced datasets to assess the performance and scalability of our proposed algorithm. The performance results reveal that our proposed fitness function can efficiently deal with unbalanced datase...
© 2015 IEEE. Class imbalanced data is a common problem for predictive modelling in domains such as b...
Artificial Bee Colony (ABC) algorithm is a relatively new swarm intelligence technique for clusterin...
For improving the classification accuracy of the classifier, a novel classification methodology base...
The digital age has added a new term to the literature of information and computer sciences called a...
The classification of datasets with a skewed class distribution is an important problem in data mini...
In recent years, the researchers have witnessed the changes or transformations driven by the existen...
The big data term and its formal definition have changed the properties of some of the computational...
Artificial Bee Colony (ABC) algorithm which is one of the most recently introduced optimization algo...
Artificial Bee Colony (ABC) algorithm is considered new and widely used in searching for optimum sol...
Artificial Bee Colony (ABC) algorithm is considered new and widely used in searching for optimum sol...
Many system of interest in sciences can be represented as network (social network, biological networ...
Apache spark, famously known for big data handling ability, is a distributed open-source framework t...
Part 3: MHDWInternational audienceOne of the main characteristics of our time is the growth of the d...
The design of efficient big data learning models has become a common need in a great number of appli...
Nowadays, the big data marketplace is rising rapidly. The big challenge is finding a system that can...
© 2015 IEEE. Class imbalanced data is a common problem for predictive modelling in domains such as b...
Artificial Bee Colony (ABC) algorithm is a relatively new swarm intelligence technique for clusterin...
For improving the classification accuracy of the classifier, a novel classification methodology base...
The digital age has added a new term to the literature of information and computer sciences called a...
The classification of datasets with a skewed class distribution is an important problem in data mini...
In recent years, the researchers have witnessed the changes or transformations driven by the existen...
The big data term and its formal definition have changed the properties of some of the computational...
Artificial Bee Colony (ABC) algorithm which is one of the most recently introduced optimization algo...
Artificial Bee Colony (ABC) algorithm is considered new and widely used in searching for optimum sol...
Artificial Bee Colony (ABC) algorithm is considered new and widely used in searching for optimum sol...
Many system of interest in sciences can be represented as network (social network, biological networ...
Apache spark, famously known for big data handling ability, is a distributed open-source framework t...
Part 3: MHDWInternational audienceOne of the main characteristics of our time is the growth of the d...
The design of efficient big data learning models has become a common need in a great number of appli...
Nowadays, the big data marketplace is rising rapidly. The big challenge is finding a system that can...
© 2015 IEEE. Class imbalanced data is a common problem for predictive modelling in domains such as b...
Artificial Bee Colony (ABC) algorithm is a relatively new swarm intelligence technique for clusterin...
For improving the classification accuracy of the classifier, a novel classification methodology base...