This paper describes methods that exploit stenographic transcripts of the German parliament to improve the acoustic models of a speech recognition system for this domain. The stenographic transcripts and the speech data are available on the Internet. Using data from the Internet makes it possible to avoid the costly process of the collection and annotation of a huge amount of data. The automatic data acquisition technique works using the stenographic transcripts and acoustic data from the German parliamentary speeches plus general acoustic models, trained on different data. The idea of this technique is to generate special finite state automata from the stenographic transcripts. These finite state automata simulate potential possible corres...
The paper describes recent progress in the development the Slovak language models for transcription ...
Language models used in current automatic speech recognition systems are trained on general-purpose ...
International audienceIn this paper, a new methodology for speech corpora definition from internet d...
Automatic speech recognition (ASR) systems require large amounts of transcribed speech data, for tra...
The corpus consists of recordings from the Chamber of Deputies of the Parliament of the Czech Republ...
This paper describes different approaches to improve the transcription and indexing quality of the F...
In this paper we apply speech recognition for automatic tran-script generation for spoken document r...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
In this paper, a new methodology for speech corpora definition from internet documents is described,...
Texte intégral accessible uniquement aux membres de l'Université de LorraineThe framework of this th...
Automatic speech transcription systems are developed for various languages, domains,and applications...
This paper reports on the setup and evaluation of robust speech recognition system parts, geared tow...
Iterative Improving of Transcribed Speech Recordings Exploiting Listener's Feedback Abstract This Ph...
The goal of this thesis is to develop a complete pipeline of Automatic Speech recognition for the Cz...
This thesis introduces a general method for using information at the utterance level and across utte...
The paper describes recent progress in the development the Slovak language models for transcription ...
Language models used in current automatic speech recognition systems are trained on general-purpose ...
International audienceIn this paper, a new methodology for speech corpora definition from internet d...
Automatic speech recognition (ASR) systems require large amounts of transcribed speech data, for tra...
The corpus consists of recordings from the Chamber of Deputies of the Parliament of the Czech Republ...
This paper describes different approaches to improve the transcription and indexing quality of the F...
In this paper we apply speech recognition for automatic tran-script generation for spoken document r...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
In this paper, a new methodology for speech corpora definition from internet documents is described,...
Texte intégral accessible uniquement aux membres de l'Université de LorraineThe framework of this th...
Automatic speech transcription systems are developed for various languages, domains,and applications...
This paper reports on the setup and evaluation of robust speech recognition system parts, geared tow...
Iterative Improving of Transcribed Speech Recordings Exploiting Listener's Feedback Abstract This Ph...
The goal of this thesis is to develop a complete pipeline of Automatic Speech recognition for the Cz...
This thesis introduces a general method for using information at the utterance level and across utte...
The paper describes recent progress in the development the Slovak language models for transcription ...
Language models used in current automatic speech recognition systems are trained on general-purpose ...
International audienceIn this paper, a new methodology for speech corpora definition from internet d...