Practitioners of data mining and machine learning have long observed that the imbalance of classes in a data set negatively impacts the quality of classifiers trained on that data. Numerous techniques for coping with such imbalances have been proposed, but nearly all lack any theoretical grounding. By contrast, the standard theoretical analysis of machine learning admits no dependence on the imbalance of classes at all. The basic theorems of statistical learning establish the number of examples needed to estimate the accuracy of a classifier as a function of its complexity (VC-dimension) and the confidence desired; the class imbalance does not enter these formulas anywhere. In this work, we consider the measures of classifier performance in...
The most widely spread measure of performance, accuracy, suffers from a paradox: predictive models w...
In this contribution, the question of reporting performance of binary classifiers is opened in cont...
Abstract In machine learning problems, dierences in prior class probabilities|or class imbalances|ha...
Abstract. Many real world datasets exhibit skewed class distributions in which almost all instances ...
Background: There has been much discussion amongst automated software defect prediction researchers ...
Abstract—In the last decade, class imbalance has attracted a huge amount of attention from researche...
Class-wise characteristics of training examples affect the performance of deep classifiers. A well-s...
There are several aspects that might influence the performance achieved by existing learning systems...
We present a comprehensive suite of experimentation on the subject of learning from imbalanced data....
From a machine learning perspective, information retrieval may be viewed as a problem of classifying...
The present paper studies the influence of two distinct factors on the performance of some resamplin...
Invited paper for the AAAI'2000 Workshop on Imbalanced Data Sets.For research to progress most effec...
Many real world applications involve highly imbalanced class distribution. Research into learning fr...
The field of machine learning has made a lot of progress in the recent years. As it is used more fre...
Abstract—Evaluating the performance of a classification algorithm critically requires a measure of t...
The most widely spread measure of performance, accuracy, suffers from a paradox: predictive models w...
In this contribution, the question of reporting performance of binary classifiers is opened in cont...
Abstract In machine learning problems, dierences in prior class probabilities|or class imbalances|ha...
Abstract. Many real world datasets exhibit skewed class distributions in which almost all instances ...
Background: There has been much discussion amongst automated software defect prediction researchers ...
Abstract—In the last decade, class imbalance has attracted a huge amount of attention from researche...
Class-wise characteristics of training examples affect the performance of deep classifiers. A well-s...
There are several aspects that might influence the performance achieved by existing learning systems...
We present a comprehensive suite of experimentation on the subject of learning from imbalanced data....
From a machine learning perspective, information retrieval may be viewed as a problem of classifying...
The present paper studies the influence of two distinct factors on the performance of some resamplin...
Invited paper for the AAAI'2000 Workshop on Imbalanced Data Sets.For research to progress most effec...
Many real world applications involve highly imbalanced class distribution. Research into learning fr...
The field of machine learning has made a lot of progress in the recent years. As it is used more fre...
Abstract—Evaluating the performance of a classification algorithm critically requires a measure of t...
The most widely spread measure of performance, accuracy, suffers from a paradox: predictive models w...
In this contribution, the question of reporting performance of binary classifiers is opened in cont...
Abstract In machine learning problems, dierences in prior class probabilities|or class imbalances|ha...