International audienceRecent research advances on Tangled Program Graphs (TPGs) have demonstrated that Genetic Programming (GP) can be used to build accurate classifiers. However, this performance has been tested on balanced classification problems while most of the real world classification problems are imbalanced, with both over-represented classes and rare classes. This paper explores the effect of imbalanced data on the performance of a TPG classifier, and proposes mitigation methods for imbalance-caused classifier performance degradation using adapted GP selection phases. The GP selection phase is characterized by a fitness function, and by a comparison operator. We show that adapting the TPG to imbalanced data significantly improves t...
One of the major challenges in automatic classification is to deal with highly dimensional data. Sev...
Classification is one of the most researchable ideas in machine learning and data mining. A wide ran...
The study of problem difficulty is an open issue in Genetic Programming (GP). Thegoal of this work i...
International audienceRecent research advances on Tangled Program Graphs (TPGs) have demonstrated th...
In classification,machine learning algorithms can suffer a performance bias when data sets are unbal...
In classification,machine learning algorithms can suffer a performance bias when data sets are unbal...
Class imbalance and high dimensionality have been acknowledged as two tough issues in classification...
This work investigates the use of sampling methods in Genetic Programming (GP) to improve the classi...
In supervised learning, class imbalanced data set is a state where the class distribution is not un...
Abstract — Classification in imbalanced domains is an important problem in Data Mining. We refer to ...
Classification of imbalanced datasets is a critical problem in numerous contexts. In these applicati...
peer-reviewedIn Machine Learning classification tasks, the class imbalance problem is an important o...
In some practical classification problems in which the number of instances of a particular class is ...
The term “data imbalance ” in classification is a well established phenomenon in which data set cont...
The estimation of problem difficulty is an open issue in genetic programming (GP). The goal of this ...
One of the major challenges in automatic classification is to deal with highly dimensional data. Sev...
Classification is one of the most researchable ideas in machine learning and data mining. A wide ran...
The study of problem difficulty is an open issue in Genetic Programming (GP). Thegoal of this work i...
International audienceRecent research advances on Tangled Program Graphs (TPGs) have demonstrated th...
In classification,machine learning algorithms can suffer a performance bias when data sets are unbal...
In classification,machine learning algorithms can suffer a performance bias when data sets are unbal...
Class imbalance and high dimensionality have been acknowledged as two tough issues in classification...
This work investigates the use of sampling methods in Genetic Programming (GP) to improve the classi...
In supervised learning, class imbalanced data set is a state where the class distribution is not un...
Abstract — Classification in imbalanced domains is an important problem in Data Mining. We refer to ...
Classification of imbalanced datasets is a critical problem in numerous contexts. In these applicati...
peer-reviewedIn Machine Learning classification tasks, the class imbalance problem is an important o...
In some practical classification problems in which the number of instances of a particular class is ...
The term “data imbalance ” in classification is a well established phenomenon in which data set cont...
The estimation of problem difficulty is an open issue in genetic programming (GP). The goal of this ...
One of the major challenges in automatic classification is to deal with highly dimensional data. Sev...
Classification is one of the most researchable ideas in machine learning and data mining. A wide ran...
The study of problem difficulty is an open issue in Genetic Programming (GP). Thegoal of this work i...