Imbalanced data constitutes a challenge for knowledge management. This problem is even more complex in the presence of hybrid (numeric and categorical data) having missing values and multiple decision classes. Unfortunately, health-related information is often multiclass, hybrid, and imbalanced. This paper introduces a novel undersampling procedure that deals with multiclass hybrid data. We explore its impact on the performance of the recently proposed customized naïve associative classifier (CNAC). The experiments made, and the statistical analysis, show that the proposed method surpasses existing classifiers, with the advantage of being able to deal with multiclass, hybrid, and incomplete data with a low computational cost. In addition, o...
This paper presents a data pre-processing algorithm to tackle class imbalance in classification prob...
In many application domains such as medicine, information retrieval, cybersecurity, social media, et...
Dealing with imbalanced datasets is a recurrent issue in health-care data processing. Most literatur...
This paper proposes a novel undersampling method, for dealing with imbalanced datasets. The proposal...
Learning from imbalanced data poses significant challenges for the classifier. This becomes even mor...
Undersampling has been widely used in the class-imbalance learning area. The main deficiency of most...
Multi-class imbalanced data classification in supervised learning is one of the most challenging res...
Class imbalance is a crucial problem in machine learning and occurs in many domains. Specifically, t...
Many machine learning classification algorithms assume that the target classes share similar prior p...
Class imbalanced datasets are common across different domains including health, security, banking an...
There is an unprecedented amount of data available. This has caused knowledge discovery to garner at...
In this thesis, a comparison of three different pre-processing methods for imbalanced classification...
In this paper, a novel inverse random undersampling (IRUS) method is proposed for the class imbalanc...
A dataset is said to be imbalanced when its classes are disproportionately represented in terms of t...
In the medical field, many outcome variables are dichotomized, and the two possible values of a dich...
This paper presents a data pre-processing algorithm to tackle class imbalance in classification prob...
In many application domains such as medicine, information retrieval, cybersecurity, social media, et...
Dealing with imbalanced datasets is a recurrent issue in health-care data processing. Most literatur...
This paper proposes a novel undersampling method, for dealing with imbalanced datasets. The proposal...
Learning from imbalanced data poses significant challenges for the classifier. This becomes even mor...
Undersampling has been widely used in the class-imbalance learning area. The main deficiency of most...
Multi-class imbalanced data classification in supervised learning is one of the most challenging res...
Class imbalance is a crucial problem in machine learning and occurs in many domains. Specifically, t...
Many machine learning classification algorithms assume that the target classes share similar prior p...
Class imbalanced datasets are common across different domains including health, security, banking an...
There is an unprecedented amount of data available. This has caused knowledge discovery to garner at...
In this thesis, a comparison of three different pre-processing methods for imbalanced classification...
In this paper, a novel inverse random undersampling (IRUS) method is proposed for the class imbalanc...
A dataset is said to be imbalanced when its classes are disproportionately represented in terms of t...
In the medical field, many outcome variables are dichotomized, and the two possible values of a dich...
This paper presents a data pre-processing algorithm to tackle class imbalance in classification prob...
In many application domains such as medicine, information retrieval, cybersecurity, social media, et...
Dealing with imbalanced datasets is a recurrent issue in health-care data processing. Most literatur...