Text corpus size is an important issue when building a language model (LM). This is a particularly important issue for languages where little data is available. This paper introduces an LM adaptation technique to improve an LM built using a small amount of task-dependent text with the help of a machine-translated text corpus. Icelandic speech recognition experiments were performed using data, machine translated (MT) from English to Icelandic on a word-by-word and sentence-by-sentence basis. LM interpolation using the baseline LM and an LM built from either word-by-word or sentence-by-sentence translated text reduced the word error rate significantly when manually obtained utterances used as a baseline were very sparse
The training data size is of utmost importance for statistical machine translation (SMT), since it a...
Statistical language model estimation requires large amounts of domain-specific text, which is diffi...
In this paper, we present an end-to-end solution to the development of an automatic speech recogniti...
Abstract—Text corpus size is an important issue when building a language model (LM) in particular wh...
Language modeling is an important part for both speech recognition and machine translation systems. ...
Copyright © 2015 ISCA. Direct integration of translation model (TM) probabilities into a language mo...
Building a stochastic language model (LM) for speech recog-nition requires a large corpus of target ...
Language modeling is critical and indispensable for many natural language ap-plications such as auto...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
Machine translation research has progressed in recent years thanks to statistical machine learning m...
Large amounts of bilingual corpora are used in the training process of statistical machine translati...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
Traditionally, statistical machine translation systems have relied on parallel bi-lingual data to tr...
We explore unsupervised language model adaptation techniques for Statistical Machine Translation. Th...
The training data size is of utmost importance for statistical machine translation (SMT), since it a...
Statistical language model estimation requires large amounts of domain-specific text, which is diffi...
In this paper, we present an end-to-end solution to the development of an automatic speech recogniti...
Abstract—Text corpus size is an important issue when building a language model (LM) in particular wh...
Language modeling is an important part for both speech recognition and machine translation systems. ...
Copyright © 2015 ISCA. Direct integration of translation model (TM) probabilities into a language mo...
Building a stochastic language model (LM) for speech recog-nition requires a large corpus of target ...
Language modeling is critical and indispensable for many natural language ap-plications such as auto...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
Machine translation research has progressed in recent years thanks to statistical machine learning m...
Large amounts of bilingual corpora are used in the training process of statistical machine translati...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
Traditionally, statistical machine translation systems have relied on parallel bi-lingual data to tr...
We explore unsupervised language model adaptation techniques for Statistical Machine Translation. Th...
The training data size is of utmost importance for statistical machine translation (SMT), since it a...
Statistical language model estimation requires large amounts of domain-specific text, which is diffi...
In this paper, we present an end-to-end solution to the development of an automatic speech recogniti...