Vision-language models can encode societal biases and stereotypes, but there are challenges to measuring and mitigating these harms. Prior proposed bias measurements lack robustness and feature degradation occurs when mitigating bias without access to pretraining data. We address both of these challenges in this paper: First, we evaluate different bias measures and propose the use of retrieval metrics to image-text representations via a bias measuring framework. Second, we investigate debiasing methods and show that optimizing for adversarial loss via learnable token embeddings minimizes various bias measures without substantially degrading feature representations.Comment: 24 pages, 10 figures. For code and trained token embeddings, see h...
Data distortion is commonly applied in vision models during both training (e.g methods like MixUp an...
Distributional word vectors have recently been shown to encode many of the human biases, most notabl...
Masked Language Models (MLMs) have been successful in many natural language processing tasks. Howeve...
Bias in classifiers is a severe issue of modern deep learning methods, especially for their applicat...
In image classification, debiasing aims to train a classifier to be less susceptible to dataset bias...
We generalize the notion of measuring social biases in word embeddings to visually grounded word emb...
International audienceMany datasets are biased, namely they contain easy-to-learn features that are ...
Adversarial training is a common approach for bias mitigation in natural language processing. Althou...
Language Representation Models (LRMs) trained with real-world data may capture and exacerbate undesi...
The presence of a bias in each image data collection has recently attracted a lot of attention in th...
Deep learning models have achieved excellent recognition results on large-scale video benchmarks. Ho...
Distributional word vectors have recently been shown to encode many of the human biases, most notabl...
Generating images from textual descriptions has gained a lot of attention. Recently, DALL-E, a multi...
Large pre-trained language models are successfully being used in a variety of tasks, across many lan...
<p>The presence of bias in existing object recognition datasets is now well-known in the computer vi...
Data distortion is commonly applied in vision models during both training (e.g methods like MixUp an...
Distributional word vectors have recently been shown to encode many of the human biases, most notabl...
Masked Language Models (MLMs) have been successful in many natural language processing tasks. Howeve...
Bias in classifiers is a severe issue of modern deep learning methods, especially for their applicat...
In image classification, debiasing aims to train a classifier to be less susceptible to dataset bias...
We generalize the notion of measuring social biases in word embeddings to visually grounded word emb...
International audienceMany datasets are biased, namely they contain easy-to-learn features that are ...
Adversarial training is a common approach for bias mitigation in natural language processing. Althou...
Language Representation Models (LRMs) trained with real-world data may capture and exacerbate undesi...
The presence of a bias in each image data collection has recently attracted a lot of attention in th...
Deep learning models have achieved excellent recognition results on large-scale video benchmarks. Ho...
Distributional word vectors have recently been shown to encode many of the human biases, most notabl...
Generating images from textual descriptions has gained a lot of attention. Recently, DALL-E, a multi...
Large pre-trained language models are successfully being used in a variety of tasks, across many lan...
<p>The presence of bias in existing object recognition datasets is now well-known in the computer vi...
Data distortion is commonly applied in vision models during both training (e.g methods like MixUp an...
Distributional word vectors have recently been shown to encode many of the human biases, most notabl...
Masked Language Models (MLMs) have been successful in many natural language processing tasks. Howeve...