In the speech recognition of highly inflecting or compounding languages, the traditional word-based language modeling is problematic. As the number of distinct word forms can grow very large, it becomes difficult to train language models that are both effective and cover the words of the language well. In the literature, several methods have been proposed for basing the language modeling on sub-word units instead of whole words. However, to our knowledge, considerable improvements in speech recognition performance have not been reported. In this article, we present a language-independent algorithm for discovering word fragments in an unsupervised manner from text. The algorithm uses the Minimum Description Length principle to find an invent...
In the automatic speech recognition of agglutinative and morphologically rich languages, the recogni...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
This paper presents an algorithm for the unsuper-vised learning of a simple morphology of a nat-ural...
We study continuous speech recognition based on sub-word units found in an unsupervised fashion. For...
We study class-based n-gram and neural network language models for very large vocabulary speech reco...
Automatic speech recognition systems are devices or computer programs that convert human speech into...
Automatic speech recognition systems are devices or computer programs that convert human speech into...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
By definition, words that are not present in a recognition vocabulary are called out-of-vocabulary (...
It is practically impossible to build a word-based lexicon for speech recognition in agglutinative l...
In order to develop computer applications that successfully process natural language data (text and ...
We present two methods for unsupervised segmentation of words into morpheme-like units. The model ...
We present two methods for unsupervised segmentation of words into morpheme-like units. The model ut...
This paper compares two approaches to lexical compound word reconstruction from a speech recognizer ...
Automatic speech recognition will soon be a part of everyday life. Even today many people use the sp...
In the automatic speech recognition of agglutinative and morphologically rich languages, the recogni...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
This paper presents an algorithm for the unsuper-vised learning of a simple morphology of a nat-ural...
We study continuous speech recognition based on sub-word units found in an unsupervised fashion. For...
We study class-based n-gram and neural network language models for very large vocabulary speech reco...
Automatic speech recognition systems are devices or computer programs that convert human speech into...
Automatic speech recognition systems are devices or computer programs that convert human speech into...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
By definition, words that are not present in a recognition vocabulary are called out-of-vocabulary (...
It is practically impossible to build a word-based lexicon for speech recognition in agglutinative l...
In order to develop computer applications that successfully process natural language data (text and ...
We present two methods for unsupervised segmentation of words into morpheme-like units. The model ...
We present two methods for unsupervised segmentation of words into morpheme-like units. The model ut...
This paper compares two approaches to lexical compound word reconstruction from a speech recognizer ...
Automatic speech recognition will soon be a part of everyday life. Even today many people use the sp...
In the automatic speech recognition of agglutinative and morphologically rich languages, the recogni...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
This paper presents an algorithm for the unsuper-vised learning of a simple morphology of a nat-ural...