National audienceA new statistical method for Language Modeling and spoken document classification is proposed. It is based on a mixture of topic dependent probabilities. Each topic dependent probability is in turn a mixture of n-gram probabilities and the probability of Kullback-Lieber (KL) distances between key-word unigrams and distribution obtained from the content of a cache memory. Experimental result on topic classification using a corpus of 60 Mwords from the French newspaper Le Monde show the excellent performance of the cache memory and its complementary role in providing different statistics for the decision process
It is shown that the enormous improvement in the size of disk storage space in recent years can be u...
Statistical language models are widely used in automatic speech recognition in order to constrain th...
This paper describes an approach for constructing a mixture of language models based on simple stati...
National audienceA new statistical method for Language Modeling and spoken document classification i...
International audienceA new statistical method for Language Modeling and spoken document classificat...
International audienceA new statistical method for Language Modeling and spoken document classificat...
International audienceThe use of cache memories and symmetric Kullback-Leibler distances is proposed...
International audienceThe use of cache memories and symmetric Kullback-Leibler distances is proposed...
The role of a stochastic language model is to give the best estimation possible of the probability o...
International audienceThis paper describes the application of an information-theoretic approach to d...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper presents s...
In state-of-the-art large vocabulary automatic recognition systems, a large statistical language mod...
In statistical language modelling researches, there is a lack of huge text corpora, especially for s...
In statistical language modelling researches, there is a lack of huge text corpora, especially for s...
Statistical language modelling may not only be used to uncover the patterns which underlie the compo...
It is shown that the enormous improvement in the size of disk storage space in recent years can be u...
Statistical language models are widely used in automatic speech recognition in order to constrain th...
This paper describes an approach for constructing a mixture of language models based on simple stati...
National audienceA new statistical method for Language Modeling and spoken document classification i...
International audienceA new statistical method for Language Modeling and spoken document classificat...
International audienceA new statistical method for Language Modeling and spoken document classificat...
International audienceThe use of cache memories and symmetric Kullback-Leibler distances is proposed...
International audienceThe use of cache memories and symmetric Kullback-Leibler distances is proposed...
The role of a stochastic language model is to give the best estimation possible of the probability o...
International audienceThis paper describes the application of an information-theoretic approach to d...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper presents s...
In state-of-the-art large vocabulary automatic recognition systems, a large statistical language mod...
In statistical language modelling researches, there is a lack of huge text corpora, especially for s...
In statistical language modelling researches, there is a lack of huge text corpora, especially for s...
Statistical language modelling may not only be used to uncover the patterns which underlie the compo...
It is shown that the enormous improvement in the size of disk storage space in recent years can be u...
Statistical language models are widely used in automatic speech recognition in order to constrain th...
This paper describes an approach for constructing a mixture of language models based on simple stati...