Probabilistic FastText for Multi-Sense Word Embeddings

Athiwaratkun, Ben
Wilson, Andrew Gordon
Anandkumar, Anima

Publication date

June 2018

Publisher

Association for Computational Linguistics (ACL)

Abstract

We introduce Probabilistic FastText, a new model for word embeddings that can capture multiple word senses, sub-word structure, and uncertainty information. In particular, we represent each word with a Gaussian mixture density, where the mean of a mixture component is given by the sum of n-grams. This representation allows the model to share statistical strength across sub-word structures (e.g. Latin roots), producing accurate representations of rare, misspelt, or even unseen words. Moreover, each component of the mixture can capture a different word sense. Probabilistic FastText outperforms both FastText, which has no probabilistic model, and dictionary-level probabilistic embeddings, which do not incorporate subword structures, on several...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Probabilistic FastText for Multi-Sense Word Embeddings

Abstract

Extracted data

Probabilistic FastText for Multi-Sense Word Embeddings

Abstract

Extracted data

Related items

Related items