Replicated softmax: an undirected topic model

Ruslan Salakhutdinov
Geoffrey Hinton

Publication date

January 2009

Abstract

We introduce a two-layer undirected graphical model, called a “Replicated Soft-max”, that can be used to model and automatically extract low-dimensional latent semantic representations from a large unstructured collection of documents. We present efficient learning and inference algorithms for this model, and show how a Monte-Carlo based method, Annealed Importance Sampling, can be used to pro-duce an accurate estimate of the log-probability the model assigns to test data. This allows us to demonstrate that the proposed model is able to generalize much better compared to Latent Dirichlet Allocation in terms of both the log-probability of held-out documents and the retrieval accuracy.

Extracted data

We use cookies to provide a better user experience.

Data Protection

Replicated softmax: an undirected topic model

Abstract

Extracted data

Replicated softmax: an undirected topic model

Abstract

Extracted data

Related items

Related items