In this paper an effective technique to train an acoustic model from large and unsynchronized audio and text chunks is presented. Given such a speech corpus, an algorithm to automatically segment each chunk into smaller fragments and to synchronize those to the corresponding text is defined. These smaller fragments are more suitable to be used in standard model training algorithms for usage in automatic speech recognition systems. The proposed approach is particularly suitable to bootstrap language models without relying neither on specialized training material nor borrowing from models trained for other similar languages. Extensive experimentation using the CMU Sphinx 4 recognizer and the SphinxTrain model generator in a setting designed f...
Summarization: This work presents techniques for improved cross-language transfer of speech...
Texte intégral accessible uniquement aux membres de l'Université de LorraineThe framework of this th...
We present a proposal of a kernel-based model for large vocabulary continuous speech recognizer. The...
In this paper an effective technique to train an acoustic model from large and unsynchronized audio ...
This paper describes our work on applying ensembles of acoustic models to the problem of large voca...
In this paper, three different techniques for building semicontinuousHMMbased speech recognisers are...
INTERSPEECH2006: the 9th International Conference on Spoken Language Processing (ICSLP), September 1...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
This paper describes experiments in using speech data, collected by means of commercial services, in...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
Automatic speech transcription systems are developed for various languages, domains,and applications...
SRIV 2006: ITRW on Speech Recognition and Intrinsic Variatioon, May 20, 2006, Toulouse, France.The...
Unsupervised acoustic modeling can offer a cost and time effective way of creating a solid acoustic ...
This paper describes a new method of word model gener-ation based on acoustically derived segment un...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
Summarization: This work presents techniques for improved cross-language transfer of speech...
Texte intégral accessible uniquement aux membres de l'Université de LorraineThe framework of this th...
We present a proposal of a kernel-based model for large vocabulary continuous speech recognizer. The...
In this paper an effective technique to train an acoustic model from large and unsynchronized audio ...
This paper describes our work on applying ensembles of acoustic models to the problem of large voca...
In this paper, three different techniques for building semicontinuousHMMbased speech recognisers are...
INTERSPEECH2006: the 9th International Conference on Spoken Language Processing (ICSLP), September 1...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
This paper describes experiments in using speech data, collected by means of commercial services, in...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
Automatic speech transcription systems are developed for various languages, domains,and applications...
SRIV 2006: ITRW on Speech Recognition and Intrinsic Variatioon, May 20, 2006, Toulouse, France.The...
Unsupervised acoustic modeling can offer a cost and time effective way of creating a solid acoustic ...
This paper describes a new method of word model gener-ation based on acoustically derived segment un...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
Summarization: This work presents techniques for improved cross-language transfer of speech...
Texte intégral accessible uniquement aux membres de l'Université de LorraineThe framework of this th...
We present a proposal of a kernel-based model for large vocabulary continuous speech recognizer. The...