Distributed Negative Sampling for Word Embeddings

Stergiou, Stergios
Straznickas, Zygimantas
Wu, Rolina
Tsioutsiouliklis, Kostas

Publication date

February 2017

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Abstract

Word2Vec recently popularized dense vector word representations as fixed-length features for machine learning algorithms and is in widespread use today. In this paper we investigate one of its core components, Negative Sampling, and propose efficient distributed algorithms that allow us to scale to vocabulary sizes of more than 1 billion unique words and corpus sizes of more than 1 trillion words

Extracted data

We use cookies to provide a better user experience.

Data Protection

Distributed Negative Sampling for Word Embeddings

Abstract

Extracted data

Distributed Negative Sampling for Word Embeddings

Abstract

Extracted data

Related items

Related items