Probabilistic models of sequences play a central role in most machine translation, automated speech recognition, lossless compression, spell-checking, and gene identification applications to name but a few. Unfortunately, realworld sequence data often exhibit long range dependencies which can only be captured by computationally challenging, complex models. Sequence data arising from natural processes also often exhibits power-law properties, yet common sequence models do not capture such properties. The sequence memoizer is a new hierarchical Bayesian model for discrete sequence data that captures long range dependencies and power-law characteristics, while remaining computationally attractive. Its utility as a language model and general pu...
We consider problems of sequence processing and propose a solution based on a discrete state model i...
Huge neural autoregressive sequence models have achieved impressive performance across different app...
236 pagesSequence data, which consists of values organized in a certain order, is one of the most co...
Probabilistic models of sequences play a central role in most machine translation, automated speech ...
The sequence memoizer is a model for sequence data with state-of-the-art performance on language mod...
We propose an unbounded-depth, hierarchical, Bayesian nonparametric model for discrete sequence data...
We show that finite-order Markov models fail to capture long range dependencies that exist in human ...
This article presents the PST R package for categorical sequence analysis with probabilistic suffix ...
(HSM) for statistical modeling of sequence data. The HSM generalizes our previous proposal on struct...
With the advent of deep learning, research in many areas of machine learning is converging towards t...
This paper is concerned with algorithms for prediction of discrete sequences over a finite alphabet,...
This article presents the PST R package for categorical sequence analysis with probabilistic suffix ...
Article dans revue scientifique avec comité de lecture.In natural language and especially in spontan...
We propose a new discriminative framework, namely Hidden Dynamic Conditional Random Fields (HD-CRFs)...
Most language models used for natural lan-guage processing are continuous. However, the assumption o...
We consider problems of sequence processing and propose a solution based on a discrete state model i...
Huge neural autoregressive sequence models have achieved impressive performance across different app...
236 pagesSequence data, which consists of values organized in a certain order, is one of the most co...
Probabilistic models of sequences play a central role in most machine translation, automated speech ...
The sequence memoizer is a model for sequence data with state-of-the-art performance on language mod...
We propose an unbounded-depth, hierarchical, Bayesian nonparametric model for discrete sequence data...
We show that finite-order Markov models fail to capture long range dependencies that exist in human ...
This article presents the PST R package for categorical sequence analysis with probabilistic suffix ...
(HSM) for statistical modeling of sequence data. The HSM generalizes our previous proposal on struct...
With the advent of deep learning, research in many areas of machine learning is converging towards t...
This paper is concerned with algorithms for prediction of discrete sequences over a finite alphabet,...
This article presents the PST R package for categorical sequence analysis with probabilistic suffix ...
Article dans revue scientifique avec comité de lecture.In natural language and especially in spontan...
We propose a new discriminative framework, namely Hidden Dynamic Conditional Random Fields (HD-CRFs)...
Most language models used for natural lan-guage processing are continuous. However, the assumption o...
We consider problems of sequence processing and propose a solution based on a discrete state model i...
Huge neural autoregressive sequence models have achieved impressive performance across different app...
236 pagesSequence data, which consists of values organized in a certain order, is one of the most co...