Introduction This study investigates issues that arise when dealing with sparse data problems in developing language models. The practical problem under consideration is Spanish conversational speech recognition over the telephone. The primary data source, the Call Home Spanish corpus, is small in comparison with the English Switchboard corpus. With this problem in mind we explore alternative smoothing techniques. As decribed below, one of these techniques relies on a word-to-class mapping and an associated class bigram model [3]. Future extensions of this approach may allow for learning of more complex languages models, e.g. general stochastic regular grammars, at the class level or serve as constraints for language model adaptation within...
Continuous Speech Recognition systems require a Language Model (LM) to represent the syntactic const...
The article presents the method of building compact language model for speech recognition in devices...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
This paper compares different ways of estimating bigram language models and of representing them in ...
This paper compares different ways of estimating bigram language models and of representing them in ...
Stochastic n-gram language models have been successfully applied in continuous speech recognition fo...
Stochastic n-gram language models have been successfully applied in continuous speech recognition fo...
The move towards larger vocabulary Automatic Speech Recognition (ASR) systems places greater demands...
Recent work in phonetic speaker recognition has shown that modeling phone sequences using n-grams is...
Though the statistical language modeling plays an important role in speech recognition, there are st...
The move towards larger vocabulary Automatic Speech Recognition (ASR) systems places greater demands...
In this paper we present two new techniques for language modeling in speech recognition. The rst tec...
AbstractThis work addresses one of the common issues arising when building a speech recognition syst...
Abstract. In Continuous Speech Recognition (CSR) systems a Language Model (LM) is required to repres...
A COMPARISON OF TWO SMOOTHING METHODS FOR WORD BIGRAM MODELS Linda Bauman Peto Department of Compute...
Continuous Speech Recognition systems require a Language Model (LM) to represent the syntactic const...
The article presents the method of building compact language model for speech recognition in devices...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
This paper compares different ways of estimating bigram language models and of representing them in ...
This paper compares different ways of estimating bigram language models and of representing them in ...
Stochastic n-gram language models have been successfully applied in continuous speech recognition fo...
Stochastic n-gram language models have been successfully applied in continuous speech recognition fo...
The move towards larger vocabulary Automatic Speech Recognition (ASR) systems places greater demands...
Recent work in phonetic speaker recognition has shown that modeling phone sequences using n-grams is...
Though the statistical language modeling plays an important role in speech recognition, there are st...
The move towards larger vocabulary Automatic Speech Recognition (ASR) systems places greater demands...
In this paper we present two new techniques for language modeling in speech recognition. The rst tec...
AbstractThis work addresses one of the common issues arising when building a speech recognition syst...
Abstract. In Continuous Speech Recognition (CSR) systems a Language Model (LM) is required to repres...
A COMPARISON OF TWO SMOOTHING METHODS FOR WORD BIGRAM MODELS Linda Bauman Peto Department of Compute...
Continuous Speech Recognition systems require a Language Model (LM) to represent the syntactic const...
The article presents the method of building compact language model for speech recognition in devices...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...