Neural networks have seen a spike in popularity in natural language processing in re- cent years. They consistently outperform the traditional methods and require less human labor to perfect as they are trained unsupervised on large text corpora. However, these corpora may contain unwanted elements such as biases. We inspect multiple language models, primarily focusing on a Czech monolingual model - RobeCzech. In the first part of this work, we present a dynamic benchmarking tool for identifying gender stereotypes in a language model. We present the tool to a group of annotators to create a dataset of biased sentences. In the second part, we introduce a method of measuring the model's perceived political values of men and women and compare ...
In recent years, the use of deep neural networks and dense vector embeddings for text representation...
Multilingual Neural Machine Translation architectures mainly differ in the amount of sharing modules...
In recent years, the use of deep neural networks and dense vector embeddings for text representation...
The ever-increasing number of systems based on semantic text analysis is making natural language und...
The ever-increasing number of applications based on semantic text analysis is making natural languag...
Language models are used for a variety of downstream applications, such as improving web search resu...
While the prevalence of large pre-trained language models has led to significant improvements in the...
International audiencePretraining language models led to significant improvements for NLP tasks. How...
Gender bias has been identified in many models for Natural Language Processing, stemming from implic...
Neural Machine Translation has the power of learning from a large collection of data, which allows i...
Gender stereotypes are perceptions about the typical physical, emotional, and social characteristics...
In a more connected world, communication between different native speakers has became more necessary...
Masked Language Models (MLMs) pre-trained by predicting masked tokens on large corpora have been use...
Sociolinguistic studies suggest that a relationship exists between the gender of a speaker and the ...
Artificial intelligence systems copy and amplify existing societal biases, a problem that by now is ...
In recent years, the use of deep neural networks and dense vector embeddings for text representation...
Multilingual Neural Machine Translation architectures mainly differ in the amount of sharing modules...
In recent years, the use of deep neural networks and dense vector embeddings for text representation...
The ever-increasing number of systems based on semantic text analysis is making natural language und...
The ever-increasing number of applications based on semantic text analysis is making natural languag...
Language models are used for a variety of downstream applications, such as improving web search resu...
While the prevalence of large pre-trained language models has led to significant improvements in the...
International audiencePretraining language models led to significant improvements for NLP tasks. How...
Gender bias has been identified in many models for Natural Language Processing, stemming from implic...
Neural Machine Translation has the power of learning from a large collection of data, which allows i...
Gender stereotypes are perceptions about the typical physical, emotional, and social characteristics...
In a more connected world, communication between different native speakers has became more necessary...
Masked Language Models (MLMs) pre-trained by predicting masked tokens on large corpora have been use...
Sociolinguistic studies suggest that a relationship exists between the gender of a speaker and the ...
Artificial intelligence systems copy and amplify existing societal biases, a problem that by now is ...
In recent years, the use of deep neural networks and dense vector embeddings for text representation...
Multilingual Neural Machine Translation architectures mainly differ in the amount of sharing modules...
In recent years, the use of deep neural networks and dense vector embeddings for text representation...