Added toxicity in the context of translation refers to the fact of producing a translation output with more toxicity than there exists in the input. In this paper, we present MinTox which is a novel pipeline to identify added toxicity and mitigate this issue which works at inference time. MinTox uses a toxicity detection classifier which is multimodal (speech and text) and works in languages at scale. The mitigation method is applied to languages at scale and directly in text outputs. MinTox is applied to SEAMLESSM4T, which is the latest multimodal and massively multilingual machine translation system. For this system, MinTox achieves significant added toxicity mitigation across domains, modalities and language directions. MinTox manages to...
ABSTRACT: The impact of dictionaries, translation memories and monolingual corpora on linguistic err...
Language models (LMs) can reproduce (or amplify) toxic language seen during training, which poses a ...
In recent years, the widespread use of social media has led to an increase in the generation of toxi...
In this paper we present the DETOXIS task, DEtection of TOxicity in comments In Spanish, which took ...
Detection of some types of toxic language is hampered by extreme scarcity of labeled training data. ...
[EN] This paper describes our participation in the DEtection of TOXicity in comments In Spanish (DET...
Transformer-based Language Models (LMs) have achieved impressive results on natural language underst...
Toxic language detection systems often falsely flag text that contains minority group mentions as to...
Thesis (Master's)--University of Washington, 2021Biased associations have been a challenge in the de...
Social media provides a public and convenient platform for people to communicate. However, it is als...
Transformer-based language models are able to generate fluent text and be efficiently adapted across...
In this paper we present the DETOXIS task, DEtection of TOxicity in comments In Spanish, which took ...
Pre-trained language models (LMs) are shown to easily generate toxic language. In this work, we syst...
Warning: this paper contains model outputs exhibiting offensiveness and biases. Recently pre-trained...
Speech-to-speech translation is a challenging task mixing two of the most ambitious Natural Language...
ABSTRACT: The impact of dictionaries, translation memories and monolingual corpora on linguistic err...
Language models (LMs) can reproduce (or amplify) toxic language seen during training, which poses a ...
In recent years, the widespread use of social media has led to an increase in the generation of toxi...
In this paper we present the DETOXIS task, DEtection of TOxicity in comments In Spanish, which took ...
Detection of some types of toxic language is hampered by extreme scarcity of labeled training data. ...
[EN] This paper describes our participation in the DEtection of TOXicity in comments In Spanish (DET...
Transformer-based Language Models (LMs) have achieved impressive results on natural language underst...
Toxic language detection systems often falsely flag text that contains minority group mentions as to...
Thesis (Master's)--University of Washington, 2021Biased associations have been a challenge in the de...
Social media provides a public and convenient platform for people to communicate. However, it is als...
Transformer-based language models are able to generate fluent text and be efficiently adapted across...
In this paper we present the DETOXIS task, DEtection of TOxicity in comments In Spanish, which took ...
Pre-trained language models (LMs) are shown to easily generate toxic language. In this work, we syst...
Warning: this paper contains model outputs exhibiting offensiveness and biases. Recently pre-trained...
Speech-to-speech translation is a challenging task mixing two of the most ambitious Natural Language...
ABSTRACT: The impact of dictionaries, translation memories and monolingual corpora on linguistic err...
Language models (LMs) can reproduce (or amplify) toxic language seen during training, which poses a ...
In recent years, the widespread use of social media has led to an increase in the generation of toxi...