With the increasing availability of large datasets machine learning techniques are becoming an increasingly attractive alternative to expert-designed approaches to solving complex problems in domains where data is abundant. In this thesis we introduce several models for large sparse discrete datasets. Our approach, which is based on probabilistic models that use distributed representations to alleviate the effects of data sparsity, is applied to statistical language modelling and collaborative filtering. We introduce three probabilistic language models that represent words using learned real-valued vectors. Two of the models are based on the Restricted Boltzmann Machine (RBM) architecture while the third one is a simple deterministi...
Matrix-parametrized models, including multiclass logistic regression and sparse coding, are used in ...
Finding the right representations for words is critical for building accurate NLP systems when domai...
Learning preference models from human generated data is an important task in modern information proc...
With the increasing availability of large datasets machine learning techniques are becoming an incr...
With the increasing availability of large datasets machine learning techniques are be-coming an incr...
Statistical language models estimate the probability of a word occurring in a given context. The mos...
The restricted Boltzmann machine (RBM) is a flexible tool for modeling complex data, however there h...
Most of the existing approaches to collaborative filtering cannot handle very large data sets. In th...
Collaborative filtering has emerged as a popular way of making user recommendations, but with the in...
Significant recent advances in many areas of data collection and processing have introduced many cha...
We are interested in exploring the possibility and benefits of structure learning for deep models. A...
In spite of their superior performance, neural probabilistic language models (NPLMs) remain far less...
Multi-task learning seeks to improve the generalization performance by sharing common information am...
Many existing approaches to collaborative filtering can neither handle very large datasets nor easil...
This thesis is a compilation of five research contributions whose goal is to do unsupervised and tra...
Matrix-parametrized models, including multiclass logistic regression and sparse coding, are used in ...
Finding the right representations for words is critical for building accurate NLP systems when domai...
Learning preference models from human generated data is an important task in modern information proc...
With the increasing availability of large datasets machine learning techniques are becoming an incr...
With the increasing availability of large datasets machine learning techniques are be-coming an incr...
Statistical language models estimate the probability of a word occurring in a given context. The mos...
The restricted Boltzmann machine (RBM) is a flexible tool for modeling complex data, however there h...
Most of the existing approaches to collaborative filtering cannot handle very large data sets. In th...
Collaborative filtering has emerged as a popular way of making user recommendations, but with the in...
Significant recent advances in many areas of data collection and processing have introduced many cha...
We are interested in exploring the possibility and benefits of structure learning for deep models. A...
In spite of their superior performance, neural probabilistic language models (NPLMs) remain far less...
Multi-task learning seeks to improve the generalization performance by sharing common information am...
Many existing approaches to collaborative filtering can neither handle very large datasets nor easil...
This thesis is a compilation of five research contributions whose goal is to do unsupervised and tra...
Matrix-parametrized models, including multiclass logistic regression and sparse coding, are used in ...
Finding the right representations for words is critical for building accurate NLP systems when domai...
Learning preference models from human generated data is an important task in modern information proc...