The blind application of machine learning runs the risk of amplifying biases present in data. Such a danger is facing us with word embedding, a popular framework to represent text data as vectors which has been used in many machine learning and natural language processing tasks. We show that even word embeddings trained on Google News articles exhibit female/male gender stereotypes to a disturbing extent. This raises concerns because their widespread use, as we describe, often tends to amplify these biases. Geometrically, gender bias is first shown to be captured by a direction in the word embedding. Second, gender neutral words are shown to be linearly separable from gender definition words in the word embedding. Using these properties, we...
Word embedding captures the semantic and syntactic meaning of words into dense vectors. It contains ...
The ever-increasing number of applications based on semantic text analysis is making natural languag...
As machine learning becomes more influential in everyday life, we must begin addressing potential sh...
With the constant advancement of the way that we use technology, there is often a blind application ...
Machine learning algorithms are optimized to model statistical properties of the training data. If...
From Curriculum Vitae parsing to web search and recommendation systems, Word2Vec and other word embe...
From Curriculum Vitae parsing to web search and recommendation systems, Word2Vec and other word embe...
With widening deployments of natural language processing (NLP) in daily life, inherited social biase...
Words embeddings are the fundamental input to a wide and varied range of NLP applications. It has be...
Word embedding has become essential for natural language processing as it boosts empirical performan...
International audienceRecent advances in Representation Learning have discovered a strong inclinatio...
The ever-increasing number of systems based on semantic text analysis is making natural language und...
Word embeddings are useful for various applications, such as sentiment classification (Tang et al., ...
It has been shown that word embeddings can exhibit gender bias, and various methods have been propos...
Research on language and gender has a long tradition, and large electronic text corpora and novel co...
Word embedding captures the semantic and syntactic meaning of words into dense vectors. It contains ...
The ever-increasing number of applications based on semantic text analysis is making natural languag...
As machine learning becomes more influential in everyday life, we must begin addressing potential sh...
With the constant advancement of the way that we use technology, there is often a blind application ...
Machine learning algorithms are optimized to model statistical properties of the training data. If...
From Curriculum Vitae parsing to web search and recommendation systems, Word2Vec and other word embe...
From Curriculum Vitae parsing to web search and recommendation systems, Word2Vec and other word embe...
With widening deployments of natural language processing (NLP) in daily life, inherited social biase...
Words embeddings are the fundamental input to a wide and varied range of NLP applications. It has be...
Word embedding has become essential for natural language processing as it boosts empirical performan...
International audienceRecent advances in Representation Learning have discovered a strong inclinatio...
The ever-increasing number of systems based on semantic text analysis is making natural language und...
Word embeddings are useful for various applications, such as sentiment classification (Tang et al., ...
It has been shown that word embeddings can exhibit gender bias, and various methods have been propos...
Research on language and gender has a long tradition, and large electronic text corpora and novel co...
Word embedding captures the semantic and syntactic meaning of words into dense vectors. It contains ...
The ever-increasing number of applications based on semantic text analysis is making natural languag...
As machine learning becomes more influential in everyday life, we must begin addressing potential sh...