In a previous paper we proposed Web-based language models relying on the possibility theory. These models explicitly repre-sent the possibility of word sequences. In this paper we propose to find the best way of combining this kind of model with clas-sical probabilistic models, in the context of automatic speech recognition. We propose several combination approaches, de-pending on the nature of the combined models. With respect to the baseline, the best combination provides an absolute word error rate reduction of about 1 % on broadcast news transcrip-tion, and of 3.5 % on domain-specific multimedia document transcription. Index Terms: language models, world wide web, possibility measure, automatic speech recognitio
The use of language is one of the defining features of human cognition. Focusing here on two key fea...
This paper investigates the integration of a statistical language model into an on-line recognition ...
In this thesis, the use of word posterior probabilities for large vocabulary continuous speech recog...
Usually, language models are built either from a closed corpus, or by using World Wide Web retrieved...
The three pillars of an automatic speech recognition system are the lexicon, the languagemodel and t...
Building models of language is a central task in natural language processing. Traditionally, languag...
Automatic Speech Recognition (ASR) systems utilize statistical acoustic and language models to find ...
Les trois piliers d’un système de reconnaissance automatique de la parole sont le lexique,le modèle ...
This thesis contributes to the research domain of statistical language modeling. In this domain, the...
International audienceSpoken language speech recognition systems need better understanding of natura...
Standard hidden Markov model (HMM) based automatic speech recogni-tion (ASR) systems use phonemes as...
In state-of-the-art large vocabulary automatic recognition systems, a large statistical language mod...
Here is presented a phonetic source model whose parameters, estimated from phonetically transcribed ...
We propose a probabilistic language model that is intended to overcome some of the limitations of th...
Traditional approaches to language modelling have relied on a fixed corpus of text to inform the par...
The use of language is one of the defining features of human cognition. Focusing here on two key fea...
This paper investigates the integration of a statistical language model into an on-line recognition ...
In this thesis, the use of word posterior probabilities for large vocabulary continuous speech recog...
Usually, language models are built either from a closed corpus, or by using World Wide Web retrieved...
The three pillars of an automatic speech recognition system are the lexicon, the languagemodel and t...
Building models of language is a central task in natural language processing. Traditionally, languag...
Automatic Speech Recognition (ASR) systems utilize statistical acoustic and language models to find ...
Les trois piliers d’un système de reconnaissance automatique de la parole sont le lexique,le modèle ...
This thesis contributes to the research domain of statistical language modeling. In this domain, the...
International audienceSpoken language speech recognition systems need better understanding of natura...
Standard hidden Markov model (HMM) based automatic speech recogni-tion (ASR) systems use phonemes as...
In state-of-the-art large vocabulary automatic recognition systems, a large statistical language mod...
Here is presented a phonetic source model whose parameters, estimated from phonetically transcribed ...
We propose a probabilistic language model that is intended to overcome some of the limitations of th...
Traditional approaches to language modelling have relied on a fixed corpus of text to inform the par...
The use of language is one of the defining features of human cognition. Focusing here on two key fea...
This paper investigates the integration of a statistical language model into an on-line recognition ...
In this thesis, the use of word posterior probabilities for large vocabulary continuous speech recog...