Collecting sufficient language model training data for good speech recognition performance in a new domain is often difficult. How-ever, there may be other sources of data that are matched in terms of topic or style, if not both. This paper looks at the use of text normalization tools to make these data more suitable for language model training, in conjunction with mixture models to combine data from different sources. We specifically address the task of recognizing meeting speech, showing a small reduction in word error rate over a baseline language model trained from conversa-tional speech data. 1
Introduction At the current state of the art, high-accuracy speech recognition with moderate to lar...
Statistical language models (SLMs) for speech recognition have the advantage of robustness, and gram...
Adaptive training aims at reducing the influence of speaker, channel and environment variability on...
The move towards larger vocabulary Automatic Speech Recognition (ASR) systems places greater demands...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
This paper examines techniques for speaker normalisation and adaptation that are applied in training...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
Speech recognition performance is severely aected when the lexical, syntactic, or semantic character...
Speech recognition performance is severely affected when the lexical, syntactic, or semantic charact...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
Training language model made from conversational speech is difficult due to large variation of the w...
Text-to-Speech (TTS) normalization is an essential component of natural language processing (NLP) th...
Language modeling is critical and indispensable for many natural language ap-plications such as auto...
This paper presents a method for reducing the effort of transcribing user utterances to develop lang...
Introduction At the current state of the art, high-accuracy speech recognition with moderate to lar...
Statistical language models (SLMs) for speech recognition have the advantage of robustness, and gram...
Adaptive training aims at reducing the influence of speaker, channel and environment variability on...
The move towards larger vocabulary Automatic Speech Recognition (ASR) systems places greater demands...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
This paper examines techniques for speaker normalisation and adaptation that are applied in training...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
Speech recognition performance is severely aected when the lexical, syntactic, or semantic character...
Speech recognition performance is severely affected when the lexical, syntactic, or semantic charact...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
Training language model made from conversational speech is difficult due to large variation of the w...
Text-to-Speech (TTS) normalization is an essential component of natural language processing (NLP) th...
Language modeling is critical and indispensable for many natural language ap-plications such as auto...
This paper presents a method for reducing the effort of transcribing user utterances to develop lang...
Introduction At the current state of the art, high-accuracy speech recognition with moderate to lar...
Statistical language models (SLMs) for speech recognition have the advantage of robustness, and gram...
Adaptive training aims at reducing the influence of speaker, channel and environment variability on...