This paper presents a method for reducing the effort of transcribing user utterances to develop language models for conversational speech recognition when a small number of transcribed and a large number of untranscribed utterances are available. The recognition hypotheses for untranscribed utterances are classified according to their confidence scores such that hypotheses with high confidence are used to enhance language model training. The utterances that receive low confidence can be scheduled to be manually transcribed first to improve the language model. The results of experiments using automatic transcription of the untranscribed user utterances show the proposed methods are effective in achieving improvements in recognition accuracy...
LREC2006: the 5th international conference on Language Resources and Evaluation, May 2006.This paper...
This paper investigates improving lightly supervised acous-tic model training for an archive of broa...
User interaction with voice-powered agents generates large amounts of unlabeled utterances. In this ...
In limited data domains, many effective language modeling techniques construct models with parameter...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
Computer-Assisted Language Learning (CALL) applications for improving the oral skills of low-profi...
AbstractThis work addresses one of the common issues arising when building a speech recognition syst...
(Now with TEMIC SDS GmbH, Ulm, Germany). It has been demonstrated repeatedly that the acoustic model...
The use of the PC and Internet for placing telephone calls will present new opportunities to capture...
Automatic speech transcription systems are developed for various languages, domains,and applications...
INTERSPEECH2006: the 9th International Conference on Spoken Language Processing (ICSLP), September 1...
This disclosure describes techniques that, with user permission, use text data entered by a user to ...
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for s...
AbstractThis work addresses one of the common issues arising when building a speech recognition syst...
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for s...
LREC2006: the 5th international conference on Language Resources and Evaluation, May 2006.This paper...
This paper investigates improving lightly supervised acous-tic model training for an archive of broa...
User interaction with voice-powered agents generates large amounts of unlabeled utterances. In this ...
In limited data domains, many effective language modeling techniques construct models with parameter...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
Computer-Assisted Language Learning (CALL) applications for improving the oral skills of low-profi...
AbstractThis work addresses one of the common issues arising when building a speech recognition syst...
(Now with TEMIC SDS GmbH, Ulm, Germany). It has been demonstrated repeatedly that the acoustic model...
The use of the PC and Internet for placing telephone calls will present new opportunities to capture...
Automatic speech transcription systems are developed for various languages, domains,and applications...
INTERSPEECH2006: the 9th International Conference on Spoken Language Processing (ICSLP), September 1...
This disclosure describes techniques that, with user permission, use text data entered by a user to ...
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for s...
AbstractThis work addresses one of the common issues arising when building a speech recognition syst...
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for s...
LREC2006: the 5th international conference on Language Resources and Evaluation, May 2006.This paper...
This paper investigates improving lightly supervised acous-tic model training for an archive of broa...
User interaction with voice-powered agents generates large amounts of unlabeled utterances. In this ...