This paper compares schemes for the selection of multi-genre broadcast data and corresponding transcriptions for speech recognition model training. Selections of the same amount of data (700 hours) from lightly supervised alignments based on the same original subtitle transcripts are compared. Data segments were selected according to a maximum phone matched error rate between the lightly supervised decoding and the original transcript. The data selected with an improved lightly supervised system yields lower word error rates (WERs). Detailed comparisons of the data selected on carefully transcribed development data show how the selected portions match the true phone error rate for each genre. From a broader perspective, it is shown that for...
LREC2006: the 5th international conference on Language Resources and Evaluation, May 2006.This paper...
International audienceThis paper reports on a speech-to-text (STT) transcription system for Hungaria...
The paper addresses a scheme of lightly supervised training of an acoustic model, which exploits a l...
International audienceTraining a speech recognition system needs audio data and their corresponding ...
This paper investigates improving lightly supervised acoustic model training for an archive of broad...
This paper investigates improving lightly supervised acous-tic model training for an archive of broa...
Obtaining sufficient labelled training data is a persistent dif-ficulty for speech recognition resea...
A significant cost in obtaining acoustic training data is the generation of accurate transcriptions....
We describe the University of Sheffield system for participation in the 2015 Multi-Genre Broadcast (...
This paper describes the specification, design and development phases of two widely used telepho...
This paper describes some recent results of our collaborative work on developing a speech recognitio...
Automatic speech recognition (ASR) in the educational environment could be a solution to address the...
This paper investigates the potential of improving a hybrid automatic speech recognition model train...
This paper presents an extended study in the topic of optimal selection of speech data from a databa...
Development of an ASR application such as a speech-oriented guidance system for a real environment i...
LREC2006: the 5th international conference on Language Resources and Evaluation, May 2006.This paper...
International audienceThis paper reports on a speech-to-text (STT) transcription system for Hungaria...
The paper addresses a scheme of lightly supervised training of an acoustic model, which exploits a l...
International audienceTraining a speech recognition system needs audio data and their corresponding ...
This paper investigates improving lightly supervised acoustic model training for an archive of broad...
This paper investigates improving lightly supervised acous-tic model training for an archive of broa...
Obtaining sufficient labelled training data is a persistent dif-ficulty for speech recognition resea...
A significant cost in obtaining acoustic training data is the generation of accurate transcriptions....
We describe the University of Sheffield system for participation in the 2015 Multi-Genre Broadcast (...
This paper describes the specification, design and development phases of two widely used telepho...
This paper describes some recent results of our collaborative work on developing a speech recognitio...
Automatic speech recognition (ASR) in the educational environment could be a solution to address the...
This paper investigates the potential of improving a hybrid automatic speech recognition model train...
This paper presents an extended study in the topic of optimal selection of speech data from a databa...
Development of an ASR application such as a speech-oriented guidance system for a real environment i...
LREC2006: the 5th international conference on Language Resources and Evaluation, May 2006.This paper...
International audienceThis paper reports on a speech-to-text (STT) transcription system for Hungaria...
The paper addresses a scheme of lightly supervised training of an acoustic model, which exploits a l...