One particular problem in large vocabulary continuous speech recognition for low-resourced languages is finding relevant training data for the statistical language models. Large amount of data is required, because models should estimate the probability for all possible word sequences. For Finnish, Estonian and the other fenno-ugric languages a special problem with the data is the huge amount of different word forms that are common in normal speech. The same problem exists also in other language technology applications such as machine translation, information retrieval, and in some extent also in other morphologically rich languages. In this paper we present methods and evaluations in four recent language modeling topics: selecting conversat...
For resource rich languages, recent works have shown Neu-ral Network based Language Models (NNLMs) t...
We study continuous speech recognition based on sub-word units found in an unsupervised fashion. For...
For purposes of automated speech recognition in under-resourced environments, techniques used to sha...
The development of a speech recognition system requires at least three resources: a large labeled sp...
We study class-based n-gram and neural network language models for very large vocabulary speech reco...
Speech recognition is the process of converting acoustic waveforms into text. This requires models t...
<p>For resource rich languages, recent works have shown Neural Network based Language Models (NNLMs)...
Speech recognition is the task of decoding an acoustic speech signal into a written text. Large voca...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
Speech technology applications for major languages are becoming widely available, but for many other...
Automatic speech recognition systems with a large vocabulary and other natural language processing a...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
How can we effectively develop speech technology for languages where no transcribed data is availabl...
We describe a novel way to implement subword language models in speech recognition systems based on ...
By definition, words that are not present in a recognition vocabulary are called out-of-vocabulary (...
For resource rich languages, recent works have shown Neu-ral Network based Language Models (NNLMs) t...
We study continuous speech recognition based on sub-word units found in an unsupervised fashion. For...
For purposes of automated speech recognition in under-resourced environments, techniques used to sha...
The development of a speech recognition system requires at least three resources: a large labeled sp...
We study class-based n-gram and neural network language models for very large vocabulary speech reco...
Speech recognition is the process of converting acoustic waveforms into text. This requires models t...
<p>For resource rich languages, recent works have shown Neural Network based Language Models (NNLMs)...
Speech recognition is the task of decoding an acoustic speech signal into a written text. Large voca...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
Speech technology applications for major languages are becoming widely available, but for many other...
Automatic speech recognition systems with a large vocabulary and other natural language processing a...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
How can we effectively develop speech technology for languages where no transcribed data is availabl...
We describe a novel way to implement subword language models in speech recognition systems based on ...
By definition, words that are not present in a recognition vocabulary are called out-of-vocabulary (...
For resource rich languages, recent works have shown Neu-ral Network based Language Models (NNLMs) t...
We study continuous speech recognition based on sub-word units found in an unsupervised fashion. For...
For purposes of automated speech recognition in under-resourced environments, techniques used to sha...