International audienceThe construction of a speech recognition system requires a recorded set of phrases to compute the pertinent acoustic models. This set of phrases must be phonetically rich and balanced in order to obtain a robust recognizer. By tradition, this set is defined manually implicating a great human effort. In this paper we propose an automated method for assembling a phonetically balanced corpus (set of phrases) from the Web. The proposed method was used to construct a phonetically balanced corpus for the Mexican Spanish language
[[abstract]]Here, we describe an efficient algorithm to select phonetically balanced scripts for col...
International audienceLanguage registers are a strongly perceptible characteristic of texts and spee...
Articulatory data offers promising developments in our understanding of speech production and advanc...
International audienceThe construction of a speech recognition system requires a recorded set of phr...
We present a method for designing a phonetically balanced speech corpus. In this method, we used a p...
The three pillars of an automatic speech recognition system are the lexicon, the languagemodel and t...
In this paper, a new methodology for speech corpora definition from internet documents is described,...
International audienceIn this paper, a new methodology for speech corpora definition from internet d...
A method is proposed for compiling a corpus of phonetically-rich triphone sentences; i.e., sentences...
In this paper, we present the design of a corpus for speech recognition to be used for the recordin...
Objective: The current study describes the collection of a new phonemically-balanced Spanish sentenc...
This paper describes the phonetic content of Albayzin, a spoken database for Spanish designed for sp...
Les trois piliers d’un système de reconnaissance automatique de la parole sont le lexique,le modèle ...
In statistical language modelling researches, there is a lack of huge text corpora, especially for s...
Este artículo presenta el proceso de definición de un corpus de texto equilibrado en términos de atr...
[[abstract]]Here, we describe an efficient algorithm to select phonetically balanced scripts for col...
International audienceLanguage registers are a strongly perceptible characteristic of texts and spee...
Articulatory data offers promising developments in our understanding of speech production and advanc...
International audienceThe construction of a speech recognition system requires a recorded set of phr...
We present a method for designing a phonetically balanced speech corpus. In this method, we used a p...
The three pillars of an automatic speech recognition system are the lexicon, the languagemodel and t...
In this paper, a new methodology for speech corpora definition from internet documents is described,...
International audienceIn this paper, a new methodology for speech corpora definition from internet d...
A method is proposed for compiling a corpus of phonetically-rich triphone sentences; i.e., sentences...
In this paper, we present the design of a corpus for speech recognition to be used for the recordin...
Objective: The current study describes the collection of a new phonemically-balanced Spanish sentenc...
This paper describes the phonetic content of Albayzin, a spoken database for Spanish designed for sp...
Les trois piliers d’un système de reconnaissance automatique de la parole sont le lexique,le modèle ...
In statistical language modelling researches, there is a lack of huge text corpora, especially for s...
Este artículo presenta el proceso de definición de un corpus de texto equilibrado en términos de atr...
[[abstract]]Here, we describe an efficient algorithm to select phonetically balanced scripts for col...
International audienceLanguage registers are a strongly perceptible characteristic of texts and spee...
Articulatory data offers promising developments in our understanding of speech production and advanc...