The long-tailed data distribution in real-world greatly increases the difficulty of training deep neural networks. Oversampling minority classes is one of the commonly used techniques to tackle this problem. In this paper, we first analyze that the commonly used oversampling technique tends to distort the representation learning and harm the network’s generalizability. Then we propose two novel methods to increase the minority feature’s diversity to alleviate such issue. Specifically, from the data perspective, we propose a mixup-based Synthetic Minority Over-sampling TEchnique called mixSMOTE, where tail class samples are synthesized from head classes so that a balanced training distribution can be obtained. Then from the model perspective...
Abstract. Imbalanced class distribution is a challenging problem in many real-life classification pr...
Real-world data often have a long-tailed distribution, where the number of samples per class is not ...
Deep learning algorithms face great challenges with long-tailed data distribution which, however, is...
The long-tailed data distribution in real-world greatly increases the difficulty of training deep ne...
The visual world naturally exhibits an imbalance in the number of object or scene instances resultin...
In the real world, large-scale datasets for visual recognition typically exhibit a long-tailed distr...
Real-world visual data often exhibits a long-tailed distribution, where some “head” classes have a l...
Thesis (Master's)--University of Washington, 2023In many real-world applications, the frequency dist...
International audienceLearning from imbalanced datasets remains a significant challenge for real-wor...
Many real-world recognition problems are characterized by long-tailed label distributions. These dis...
A dramatic increase in real-world video volume with extremely diverse and emerging topics naturally ...
The heavy reliance on data is one of the major reasons that currently limit the development of deep ...
Deep neural networks perform poorly on heavily class-imbalanced datasets. Given the promising perfor...
Long-tailed image recognition presents massive challenges to deep learning systems since the imbalan...
Thesis (Ph.D.)--University of Washington, 2022The predefined artificially-balanced training classes ...
Abstract. Imbalanced class distribution is a challenging problem in many real-life classification pr...
Real-world data often have a long-tailed distribution, where the number of samples per class is not ...
Deep learning algorithms face great challenges with long-tailed data distribution which, however, is...
The long-tailed data distribution in real-world greatly increases the difficulty of training deep ne...
The visual world naturally exhibits an imbalance in the number of object or scene instances resultin...
In the real world, large-scale datasets for visual recognition typically exhibit a long-tailed distr...
Real-world visual data often exhibits a long-tailed distribution, where some “head” classes have a l...
Thesis (Master's)--University of Washington, 2023In many real-world applications, the frequency dist...
International audienceLearning from imbalanced datasets remains a significant challenge for real-wor...
Many real-world recognition problems are characterized by long-tailed label distributions. These dis...
A dramatic increase in real-world video volume with extremely diverse and emerging topics naturally ...
The heavy reliance on data is one of the major reasons that currently limit the development of deep ...
Deep neural networks perform poorly on heavily class-imbalanced datasets. Given the promising perfor...
Long-tailed image recognition presents massive challenges to deep learning systems since the imbalan...
Thesis (Ph.D.)--University of Washington, 2022The predefined artificially-balanced training classes ...
Abstract. Imbalanced class distribution is a challenging problem in many real-life classification pr...
Real-world data often have a long-tailed distribution, where the number of samples per class is not ...
Deep learning algorithms face great challenges with long-tailed data distribution which, however, is...