A major issue in the classification of class imbalanced datasets involves the determination of the most suitable performance metrics to be used. In previous work using several examples, it has been shown that imbalance can exert a major impact on the value and meaning of accuracy and on certain other well-known performance metrics. In this paper, our approach goes beyond simply studying case studies and develops a systematic analysis of this impact by simulating the results obtained using binary classifiers. A set of functions and numerical indicators are attained which enables the comparison of the behaviour of several performance metrics based on the binary confusion matrix when they are faced with imbalanced datasets. Throughout the pape...
AbstractPerformance measures are used in various stages of the process aimed at solving a classifica...
This paper introduces a framework that allows to mitigate the impact of class imbalance on most scal...
The present paper studies the influence of two distinct factors on the performance of some resamplin...
In binary classification, when the distribution of numbers in the class is imbalanced, we are aimed ...
In this contribution, the question of reporting performance of binary classifiers is opened in cont...
The vast majority of statistical theory on binary classification characterizes performance in terms ...
This paper introduces a new metric, named Index of Balanced Accuracy, for evaluating learning proces...
This research tested the following well known strategies to deal with binary imbalanced data on 82 d...
In imbalanced multi-class classification problems, the misclassification rate as an error measure ma...
In this contribution, the question of reporting performance of binary classifiers is opened in conte...
Machine learning classifiers are designed with the underlying assumption of a roughly balanced numbe...
Machine learning classifiers are designed with the underlying assumption of a roughly balanced numbe...
Machine learning classifiers are designed with the underlying assumption of a roughly balanced numbe...
Machine learning models may not be able to effectively learn and predict from imbalanced data in the...
The class imbalance problem is prevalent in many domains including medical, natural language process...
AbstractPerformance measures are used in various stages of the process aimed at solving a classifica...
This paper introduces a framework that allows to mitigate the impact of class imbalance on most scal...
The present paper studies the influence of two distinct factors on the performance of some resamplin...
In binary classification, when the distribution of numbers in the class is imbalanced, we are aimed ...
In this contribution, the question of reporting performance of binary classifiers is opened in cont...
The vast majority of statistical theory on binary classification characterizes performance in terms ...
This paper introduces a new metric, named Index of Balanced Accuracy, for evaluating learning proces...
This research tested the following well known strategies to deal with binary imbalanced data on 82 d...
In imbalanced multi-class classification problems, the misclassification rate as an error measure ma...
In this contribution, the question of reporting performance of binary classifiers is opened in conte...
Machine learning classifiers are designed with the underlying assumption of a roughly balanced numbe...
Machine learning classifiers are designed with the underlying assumption of a roughly balanced numbe...
Machine learning classifiers are designed with the underlying assumption of a roughly balanced numbe...
Machine learning models may not be able to effectively learn and predict from imbalanced data in the...
The class imbalance problem is prevalent in many domains including medical, natural language process...
AbstractPerformance measures are used in various stages of the process aimed at solving a classifica...
This paper introduces a framework that allows to mitigate the impact of class imbalance on most scal...
The present paper studies the influence of two distinct factors on the performance of some resamplin...