Despite the tremendous advances in machine learning (ML), training with imbalanced data still poses challenges in many real-world applications. Among a series of diverse techniques to solve this problem, sampling algorithms are regarded as an efficient solution. However, the problem is more fundamental, with many works emphasizing the importance of instance hardness. This issue refers to the significance of managing unsafe or potentially noisy instances that are more likely to be misclassified and serve as the root cause of poor classification performance. This paper introduces HardVis, a visual analytics system designed to handle instance hardness mainly in imbalanced classification scenarios. Our proposed system assists users in visually ...
Abstract In the classification framework there are prob-lems in which the number of examples per cla...
There are several aspects that might influence the performance achieved by existing learning systems...
Underrepresentation and misrepresentation of protected groups in the training data is a significant ...
Despite the tremendous advances in machine learning (ML), training with imbalanced data still poses ...
Despite the tremendous advances in machine learning (ML), training with imbalanced data still poses ...
The field of machine learning has made a lot of progress in the recent years. As it is used more fre...
Most performance metrics for learning algorithms do not provide information about the misclassified ...
We introduce a method to deal with the problem of learning from imbalanced data sets, where examples...
Undersampling has been widely used in the class-imbalance learning area. The main deficiency of most...
Data plays a key role in the design of expert and intelligent systems and therefore, data preprocess...
Dissertation presented as the partial requirement for obtaining a Master's degree in Statistics and...
Machine learning applications are plagued by the imbalance observed among the class sizes in many re...
This paper applies various statistical techniques with the goal of maximizing model performance for ...
Deep metric learning aims to learn a deep embedding that can capture the semantic similarity of data...
Abstract—Under-sampling is a popular method in deal-ing with class-imbalance problems, which uses on...
Abstract In the classification framework there are prob-lems in which the number of examples per cla...
There are several aspects that might influence the performance achieved by existing learning systems...
Underrepresentation and misrepresentation of protected groups in the training data is a significant ...
Despite the tremendous advances in machine learning (ML), training with imbalanced data still poses ...
Despite the tremendous advances in machine learning (ML), training with imbalanced data still poses ...
The field of machine learning has made a lot of progress in the recent years. As it is used more fre...
Most performance metrics for learning algorithms do not provide information about the misclassified ...
We introduce a method to deal with the problem of learning from imbalanced data sets, where examples...
Undersampling has been widely used in the class-imbalance learning area. The main deficiency of most...
Data plays a key role in the design of expert and intelligent systems and therefore, data preprocess...
Dissertation presented as the partial requirement for obtaining a Master's degree in Statistics and...
Machine learning applications are plagued by the imbalance observed among the class sizes in many re...
This paper applies various statistical techniques with the goal of maximizing model performance for ...
Deep metric learning aims to learn a deep embedding that can capture the semantic similarity of data...
Abstract—Under-sampling is a popular method in deal-ing with class-imbalance problems, which uses on...
Abstract In the classification framework there are prob-lems in which the number of examples per cla...
There are several aspects that might influence the performance achieved by existing learning systems...
Underrepresentation and misrepresentation of protected groups in the training data is a significant ...