The three pillars of an automatic speech recognition system are the lexicon, the languagemodel and the acoustic model. The lexicon provides all the words that can betranscribed, associated with their pronunciation. The acoustic model provides an indicationof how the phone units are pronounced, and the language model brings theknowledge of how words are linked. In modern automatic speech recognition systems,the acoustic and language models are statistical. Their estimation requires large volumesof data selected, standardized and annotated.At present, the Web is by far the largest textual corpus available for English andFrench languages. The data it holds can potentially be used to build the vocabularyand the estimation and adaptation of lang...
A way to improve outputs produced by automatic speech recognition (ASR) systems is to integrate addi...
International audienceThe construction of a speech recognition system requires a recorded set of phr...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
Les trois piliers d’un système de reconnaissance automatique de la parole sont le lexique,le modèle ...
In a previous paper we proposed Web-based language models relying on the possibility theory. These m...
Usually, language models are built either from a closed corpus, or by using World Wide Web retrieved...
In statistical language modelling researches, there is a lack of huge text corpora, especially for s...
Automatic speech récognition currently arouses a great interest: it can be considered as a significa...
In this paper, a new methodology for speech corpora definition from internet documents is described,...
Standard hidden Markov model (HMM) based automatic speech recogni-tion (ASR) systems use phonemes as...
International audienceIn this paper, a new methodology for speech corpora definition from internet d...
Current automatic speech recognition (ASR) systems are based on language models (LM) which gather wo...
Texte intégral accessible uniquement aux membres de l'Université de LorraineThe framework of this th...
International audienceSpoken language speech recognition systems need better understanding of natura...
Standard hidden Markov model (HMM) based automatic speech recognition (ASR) systems use phonemes as ...
A way to improve outputs produced by automatic speech recognition (ASR) systems is to integrate addi...
International audienceThe construction of a speech recognition system requires a recorded set of phr...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
Les trois piliers d’un système de reconnaissance automatique de la parole sont le lexique,le modèle ...
In a previous paper we proposed Web-based language models relying on the possibility theory. These m...
Usually, language models are built either from a closed corpus, or by using World Wide Web retrieved...
In statistical language modelling researches, there is a lack of huge text corpora, especially for s...
Automatic speech récognition currently arouses a great interest: it can be considered as a significa...
In this paper, a new methodology for speech corpora definition from internet documents is described,...
Standard hidden Markov model (HMM) based automatic speech recogni-tion (ASR) systems use phonemes as...
International audienceIn this paper, a new methodology for speech corpora definition from internet d...
Current automatic speech recognition (ASR) systems are based on language models (LM) which gather wo...
Texte intégral accessible uniquement aux membres de l'Université de LorraineThe framework of this th...
International audienceSpoken language speech recognition systems need better understanding of natura...
Standard hidden Markov model (HMM) based automatic speech recognition (ASR) systems use phonemes as ...
A way to improve outputs produced by automatic speech recognition (ASR) systems is to integrate addi...
International audienceThe construction of a speech recognition system requires a recorded set of phr...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...