Abstract Learning Distributed Representations for Statistical Language Modelling and Collaborative Filtering

Andriy Mnih
Andriy Mnih

Publication date

January 2010

Abstract

With the increasing availability of large datasets machine learning techniques are be-coming an increasingly attractive alternative to expert-designed approaches to solving complex problems in domains where data is abundant. In this thesis we introduce several models for large sparse discrete datasets. Our approach, which is based on probabilistic models that use distributed representations to alleviate the effects of data sparsity, is applied to statistical language modelling and collaborative filtering. We introduce three probabilistic language models that represent words using learned real-valued vectors. Two of the models are based on the Restricted Boltzmann Machine (RBM) architecture while the third one is a simple deterministic model...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Abstract Learning Distributed Representations for Statistical Language Modelling and Collaborative Filtering

Abstract

Extracted data

Abstract Learning Distributed Representations for Statistical Language Modelling and Collaborative Filtering

Abstract

Extracted data

Related items

Related items