Auto-regressive sequence models can estimate the distribution of any type of sequential data. To study sequence models, we consider the problem of language modeling, which entails predicting probability distributions over sequences of text. This thesis improves on previous language modeling approaches by giving models additional flexibility to adapt to their inputs. In particular, we focus on multiplicative LSTM (mLSTM), which has added flexibility to change its recurrent transition function depending on its input as compared with traditional LSTM, and dynamic evaluation, which helps LSTM (or other sequence models) adapt to the recent sequence history to exploit re-occurring patterns within a sequence. We find that using these adaptive appr...
Traditional machine learning sequence models, such as RNN and LSTM, can solve sequential data proble...
This paper proposes a general method for improving the structure and quality of sequences generated ...
In this paper we present a survey on the application of recurrent neural networks to the task of sta...
Huge neural autoregressive sequence models have achieved impressive performance across different app...
In this thesis, we study novel neural network structures to better model long term dependency in seq...
© 2018 Dr Florin SchimbinschiAt a high level, sequence modelling problems are of the form where the ...
Neural network sequence models have become a fundamental building block for natural language process...
Language acquisition is one of the core problems in artificial intelligence (AI) and it is generally...
In recent years, the field of language modelling has witnessed exciting developments. Especially, th...
This thesis studies the introduction of a priori structure into the design of learning systems based...
© 2019 Dr. Cong Duy Vu HoangNeural sequence models have recently achieved great success across vario...
Recurrent Neural Networks (RNNs) are theoretically Turing-complete and established themselves as a d...
Sequence processing involves several tasks such as clustering, classification, prediction, and trans...
The analysis of sequences is important for extracting information from music owing to its fundamenta...
With the advent of deep learning, research in many areas of machine learning is converging towards t...
Traditional machine learning sequence models, such as RNN and LSTM, can solve sequential data proble...
This paper proposes a general method for improving the structure and quality of sequences generated ...
In this paper we present a survey on the application of recurrent neural networks to the task of sta...
Huge neural autoregressive sequence models have achieved impressive performance across different app...
In this thesis, we study novel neural network structures to better model long term dependency in seq...
© 2018 Dr Florin SchimbinschiAt a high level, sequence modelling problems are of the form where the ...
Neural network sequence models have become a fundamental building block for natural language process...
Language acquisition is one of the core problems in artificial intelligence (AI) and it is generally...
In recent years, the field of language modelling has witnessed exciting developments. Especially, th...
This thesis studies the introduction of a priori structure into the design of learning systems based...
© 2019 Dr. Cong Duy Vu HoangNeural sequence models have recently achieved great success across vario...
Recurrent Neural Networks (RNNs) are theoretically Turing-complete and established themselves as a d...
Sequence processing involves several tasks such as clustering, classification, prediction, and trans...
The analysis of sequences is important for extracting information from music owing to its fundamenta...
With the advent of deep learning, research in many areas of machine learning is converging towards t...
Traditional machine learning sequence models, such as RNN and LSTM, can solve sequential data proble...
This paper proposes a general method for improving the structure and quality of sequences generated ...
In this paper we present a survey on the application of recurrent neural networks to the task of sta...