a b s t r a c t The purpose of this paper is to analyze the imbalanced learning task in the multilabel scenario, aiming to accomplish two different goals. The first one is to present specialized measures directed to assess the imbalance level in multilabel datasets (MLDs). Using these measures we will be able to conclude which MLDs are imbalanced, and therefore would need an appropriate treatment. The second objective is to propose several algorithms designed to reduce the imbalance in MLDs in a classifier-independent way, by means of resampling techniques. Two different approaches to divide the instances in minority and majority groups are studied. One of them considers each label combination as class identifier, whereas the other one perf...
Most existing classification approaches assume the underlying training set is evenly distributed. In...
Learning from imbalanced data poses significant challenges for the classifier. This becomes even mor...
Multi-class imbalanced data classification in supervised learning is one of the most challenging res...
Label imbalance is one of the characteristics of multilabel data, and imbalanced data seriously affe...
Simultaneous multiple labelling of documents, also known as multilabel text classification, will not...
Simultaneous multiple labelling of documents, also known as multilabel text classification, will not...
Multilabel classification is a challenging research problem in which each instance is assigned to a ...
In this paper, a novel inverse random undersampling (IRUS) method is proposed for the class imbalanc...
In many application domains such as medicine, information retrieval, cybersecurity, social media, et...
Imbalanced datasets are a well-known problem in data mining, where the datasets are composed of two ...
Abstract. Learning classifiers from imbalanced or skewed datasets is an important topic, arising ver...
Since many important real-world classification problems involve learning from unbalanced data, the c...
Imbalanced class problem (machine learning) is a problem that arises because of the significant diff...
Many machine learning classification algorithms assume that the target classes share similar prior p...
The relations between multiple imbalanced classes can be handled with a specialized approach which e...
Most existing classification approaches assume the underlying training set is evenly distributed. In...
Learning from imbalanced data poses significant challenges for the classifier. This becomes even mor...
Multi-class imbalanced data classification in supervised learning is one of the most challenging res...
Label imbalance is one of the characteristics of multilabel data, and imbalanced data seriously affe...
Simultaneous multiple labelling of documents, also known as multilabel text classification, will not...
Simultaneous multiple labelling of documents, also known as multilabel text classification, will not...
Multilabel classification is a challenging research problem in which each instance is assigned to a ...
In this paper, a novel inverse random undersampling (IRUS) method is proposed for the class imbalanc...
In many application domains such as medicine, information retrieval, cybersecurity, social media, et...
Imbalanced datasets are a well-known problem in data mining, where the datasets are composed of two ...
Abstract. Learning classifiers from imbalanced or skewed datasets is an important topic, arising ver...
Since many important real-world classification problems involve learning from unbalanced data, the c...
Imbalanced class problem (machine learning) is a problem that arises because of the significant diff...
Many machine learning classification algorithms assume that the target classes share similar prior p...
The relations between multiple imbalanced classes can be handled with a specialized approach which e...
Most existing classification approaches assume the underlying training set is evenly distributed. In...
Learning from imbalanced data poses significant challenges for the classifier. This becomes even mor...
Multi-class imbalanced data classification in supervised learning is one of the most challenging res...