In this paper, we present the design of a corpus for speech recognition to be used for the recording of a speech database in Catalan. A previous database in Spanish was the reference in setting the specifications about the characteristics of the sentences and in the minimum number of units required. An analysis of unit frequencies were carried out in order to know which units were relevant for training and to compare the results with the figures from the designed corpus. Three different sub-corpora were generated, one for training, ...Peer ReviewedPostprint (published version
AbstractThis work addresses one of the common issues arising when building a speech recognition syst...
Dentro del reconocimiento automático del habla, los modelos de lenguaje estadísticos basados en la p...
Context: Automatic speech recognition requires the development of language and acoustic models for d...
In this paper, we present the design of a corpus for speech recognition to be used for the recordin...
Knowledge of phonetic unit frequency is very necessary for developing databases in both concatenativ...
This paper describes the phonetic content of Albayzin, a spoken database for Spanish designed for sp...
Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catala...
Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catala...
The voice corpus of language is the essential part of the linguistic resources, and it contains the ...
Different databases of phonetic units are required in multilingual Text-to-Speech systems based on c...
In this paper we present the evaluation of a spoken phonetic corpus designed to train acoustic model...
Data driven methods in speech and linguistic research, and system develoment require appropriate spe...
This paper describes NaniBD, a set of tools designed for transcribing and validating speech database...
International audienceThe construction of a speech recognition system requires a recorded set of phr...
[EN] Foreign language acquisition must inevitably start with phonetics, an aspect of language whose...
AbstractThis work addresses one of the common issues arising when building a speech recognition syst...
Dentro del reconocimiento automático del habla, los modelos de lenguaje estadísticos basados en la p...
Context: Automatic speech recognition requires the development of language and acoustic models for d...
In this paper, we present the design of a corpus for speech recognition to be used for the recordin...
Knowledge of phonetic unit frequency is very necessary for developing databases in both concatenativ...
This paper describes the phonetic content of Albayzin, a spoken database for Spanish designed for sp...
Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catala...
Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catala...
The voice corpus of language is the essential part of the linguistic resources, and it contains the ...
Different databases of phonetic units are required in multilingual Text-to-Speech systems based on c...
In this paper we present the evaluation of a spoken phonetic corpus designed to train acoustic model...
Data driven methods in speech and linguistic research, and system develoment require appropriate spe...
This paper describes NaniBD, a set of tools designed for transcribing and validating speech database...
International audienceThe construction of a speech recognition system requires a recorded set of phr...
[EN] Foreign language acquisition must inevitably start with phonetics, an aspect of language whose...
AbstractThis work addresses one of the common issues arising when building a speech recognition syst...
Dentro del reconocimiento automático del habla, los modelos de lenguaje estadísticos basados en la p...
Context: Automatic speech recognition requires the development of language and acoustic models for d...