In front of modern databases, noise tolerance has become today one of the most studied topics in machine learning. Many algorithms have been suggested for dealing with noisy data in the case of numerical instances, ei-ther by filtering them during a preprocess, or by treating them during the induction. How-ever, this research subject remains widely open when one learns from unbounded sym-bolic sequences, which is the aim in gram-matical inference. In this paper, we propose a statistical approach for dealing with noisy data during the inference of automata, by the state merging algorithm RPNI. Our ap-proach is based on a proportion comparison test, which relaxes the merging rule of RPNI without endangering the generalization error. Beyond th...
Abstract. In probabilistic grammatical inference, a usual goal is to infer a good approximation of a...
We consider probabilistic automata on a general state space and study their computational power. The...
This paper describes an efficient algorithm for learn-ing a timed model from observations. The algor...
We present a new statistical framework for stochastic grammatical inference algorithms based on a ...
International audienceIn this paper, we aim at correcting distributions of noisy samples in order to...
Efficient learnability using the state merging algorithm is known for a subclass of probabilistic au...
International audienceApplications of probabilistic grammatical inference are limited due to time an...
Grammatical inference is a branch of computational learning theory that attacks the problem of learn...
The use of computers and algorithms to deal with human language, in both spoken and written form, is...
International audienceIt is now well-known that the feasibility of induc-tive learning is ruled by s...
Identication of deterministic nite automata (DFAs) has an extensive history, both in passive learnin...
The present thesis addresses several machine learning problems on generative and predictive models o...
International audienceA problem usually encountered in probabilistic automata learning is the diffic...
We study the inference of models of the analysis by reduction that forms an important tool for parsi...
AbstractWe consider probabilistic automata on a general state space and study their computational po...
Abstract. In probabilistic grammatical inference, a usual goal is to infer a good approximation of a...
We consider probabilistic automata on a general state space and study their computational power. The...
This paper describes an efficient algorithm for learn-ing a timed model from observations. The algor...
We present a new statistical framework for stochastic grammatical inference algorithms based on a ...
International audienceIn this paper, we aim at correcting distributions of noisy samples in order to...
Efficient learnability using the state merging algorithm is known for a subclass of probabilistic au...
International audienceApplications of probabilistic grammatical inference are limited due to time an...
Grammatical inference is a branch of computational learning theory that attacks the problem of learn...
The use of computers and algorithms to deal with human language, in both spoken and written form, is...
International audienceIt is now well-known that the feasibility of induc-tive learning is ruled by s...
Identication of deterministic nite automata (DFAs) has an extensive history, both in passive learnin...
The present thesis addresses several machine learning problems on generative and predictive models o...
International audienceA problem usually encountered in probabilistic automata learning is the diffic...
We study the inference of models of the analysis by reduction that forms an important tool for parsi...
AbstractWe consider probabilistic automata on a general state space and study their computational po...
Abstract. In probabilistic grammatical inference, a usual goal is to infer a good approximation of a...
We consider probabilistic automata on a general state space and study their computational power. The...
This paper describes an efficient algorithm for learn-ing a timed model from observations. The algor...