This article presents the PST R package for categorical sequence analysis with probabilistic suffix trees (PSTs), i.e., structures that store variable-length Markov chains (VLMCs). VLMCs allow to model high-order dependencies in categorical sequences with parsimonious models based on simple estimation procedures. The package is specifically adapted to the field of social sciences, as it allows for VLMC models to be learned from sets of individual sequences possibly containing missing values; in addition, the package is extended to account for case weights. This article describes how a VLMC model is learned from one or more categorical sequences and stored in a PST. The PST can then be used for sequence prediction, i.e., to assign a probabil...
International audienceThe problem of mining for outliers in sequential datasets is crucial to forwar...
We present a tutorial and new publicly available computational tools for variable length Markov chai...
Abstract: The problem of discovering temporal and attribute dependencies from multi-sets of events d...
This article presents the PST R package for categorical sequence analysis with probabilistic suffix ...
This article presents the PST R package for categorical sequence analysis with probabilistic suffix ...
Motivation: A central problem in genomics is to determine the function of a protein using the inform...
Motivation: Markov models are very popular for analyzing complex sequences such as protein sequences...
Motivation: Markov models are very popular for analyzing complex sequences such as protein sequences...
This dissertation presents an in-depth analysis of the Predictive State Representation (PSR), a new ...
This paper is concerned with algorithms for prediction of discrete sequences over a finite alphabet,...
In this paper, we compare Probabilistic Suffix Trees (PST), recently proposed, to a specific smoot...
Variable order markov chains have been applied to a variety of problems in computational biology, fr...
protein analysis This paper is concerned with algorithms for prediction of discrete sequences over a...
In this paper we study higher-order Markov chain models for analyzing categorical data sequences. We...
This article describes the many capabilities offered by the TraMineR toolbox for categorical sequenc...
International audienceThe problem of mining for outliers in sequential datasets is crucial to forwar...
We present a tutorial and new publicly available computational tools for variable length Markov chai...
Abstract: The problem of discovering temporal and attribute dependencies from multi-sets of events d...
This article presents the PST R package for categorical sequence analysis with probabilistic suffix ...
This article presents the PST R package for categorical sequence analysis with probabilistic suffix ...
Motivation: A central problem in genomics is to determine the function of a protein using the inform...
Motivation: Markov models are very popular for analyzing complex sequences such as protein sequences...
Motivation: Markov models are very popular for analyzing complex sequences such as protein sequences...
This dissertation presents an in-depth analysis of the Predictive State Representation (PSR), a new ...
This paper is concerned with algorithms for prediction of discrete sequences over a finite alphabet,...
In this paper, we compare Probabilistic Suffix Trees (PST), recently proposed, to a specific smoot...
Variable order markov chains have been applied to a variety of problems in computational biology, fr...
protein analysis This paper is concerned with algorithms for prediction of discrete sequences over a...
In this paper we study higher-order Markov chain models for analyzing categorical data sequences. We...
This article describes the many capabilities offered by the TraMineR toolbox for categorical sequenc...
International audienceThe problem of mining for outliers in sequential datasets is crucial to forwar...
We present a tutorial and new publicly available computational tools for variable length Markov chai...
Abstract: The problem of discovering temporal and attribute dependencies from multi-sets of events d...