Adversarial training (AT) methods are effective against adversarial attacks, yet they introduce severe disparity of accuracy and robustness between different classes, known as the robust fairness problem. Previously proposed Fair Robust Learning (FRL) adaptively reweights different classes to improve fairness. However, the performance of the better-performed classes decreases, leading to a strong performance drop. In this paper, we observed two unfair phenomena during adversarial training: different difficulties in generating adversarial examples from each class (source-class fairness) and disparate target class tendencies when generating adversarial examples (target-class fairness). From the observations, we propose Balance Adversarial Tra...
The remarkable performance of deep learning models and their applications in consequential domains (...
Recent advances in Machine Learning (ML) and Deep Learning (DL) have resulted in the widespread adop...
Current SOTA adversarially robust models are mostly based on adversarial training (AT) and differ on...
Adversarial training (AT) methods are effective against adversarial attacks, yet they introduce seve...
Adversarial training is an effective learning technique to improve the robustness of deep neural net...
Adversarial Training is proved to be an efficient method to defend against adversarial examples, bei...
Deep Neural Networks (DNN) have been shown to be vulnerable to adversarial examples. Adversarial tra...
Adversarial training is a common approach for bias mitigation in natural language processing. Althou...
Adversarial training is an approach of increasing the robustness of models to adversarial attacks by...
International audienceUnwanted bias is a major concern in machine learning, raising in particular si...
Adversarial robustness has become a central goal in deep learning, both in theory and in practice. H...
Extended version of paper published in ACM AISec 2019; first two authors contributed equallyInternat...
International audienceIn recent years, a growing body of work has emerged on how to learn machine le...
Adversarial robustness has become a central goal in deep learning, both in the theory and the practi...
Adversarial examples are inputs for machine learning models that have been designed by attackers to ...
The remarkable performance of deep learning models and their applications in consequential domains (...
Recent advances in Machine Learning (ML) and Deep Learning (DL) have resulted in the widespread adop...
Current SOTA adversarially robust models are mostly based on adversarial training (AT) and differ on...
Adversarial training (AT) methods are effective against adversarial attacks, yet they introduce seve...
Adversarial training is an effective learning technique to improve the robustness of deep neural net...
Adversarial Training is proved to be an efficient method to defend against adversarial examples, bei...
Deep Neural Networks (DNN) have been shown to be vulnerable to adversarial examples. Adversarial tra...
Adversarial training is a common approach for bias mitigation in natural language processing. Althou...
Adversarial training is an approach of increasing the robustness of models to adversarial attacks by...
International audienceUnwanted bias is a major concern in machine learning, raising in particular si...
Adversarial robustness has become a central goal in deep learning, both in theory and in practice. H...
Extended version of paper published in ACM AISec 2019; first two authors contributed equallyInternat...
International audienceIn recent years, a growing body of work has emerged on how to learn machine le...
Adversarial robustness has become a central goal in deep learning, both in the theory and the practi...
Adversarial examples are inputs for machine learning models that have been designed by attackers to ...
The remarkable performance of deep learning models and their applications in consequential domains (...
Recent advances in Machine Learning (ML) and Deep Learning (DL) have resulted in the widespread adop...
Current SOTA adversarially robust models are mostly based on adversarial training (AT) and differ on...