Probabilistic FastText for Multi-Sense Word Embeddings

Athiwaratkun, Ben
Wilson, Andrew Gordon
Anandkumar, Anima

Open PDF

Open link

Publication date

June 2018

Publisher

Association for Computational Linguistics (ACL)

Language

English

Abstract

We introduce Probabilistic FastText, a new model for word embeddings that can capture multiple word senses, sub-word structure, and uncertainty information. In particular, we represent each word with a Gaussian mixture density, where the mean of a mixture component is given by the sum of n-grams. This representation allows the model to share statistical strength across sub-word structures (e.g. Latin roots), producing accurate representations of rare, misspelt, or even unseen words. Moreover, each component of the mixture can capture a different word sense. Probabilistic FastText outperforms both FastText, which has no probabilistic model, and dictionary-level probabilistic embeddings, which do not incorporate subword structures, on several...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Probabilistic FastText for Multi-Sense Word Embeddings

Abstract

Extracted data

Probabilistic FastText for Multi-Sense Word Embeddings

Abstract

Extracted data

Related items

Related items