Linear mixture models are a simple and effective technique for performing domain adaptation of translation models in statis-tical MT. In this paper, we identify and correct two weaknesses of this method. First, we show that standard maximum-likelihood weights are biased toward large corpora, and that a straightforward pre-processing step that down-samples phrase tables can be used to counter this bias. Second, we show that features inspired by prototypical linear mixtures can be used to loosely simulate discriminative training for mixture models, with results that are almost certainly superior to true discrimi-native training. Taken together, these en-hancements yield BLEU gains of approx-imately 1.5 over existing linear mixture techniques ...
Current state-of-the-art Statistical Machine Translation systems are based on log-linear models that...
This paper proposes a new approach to domain adaptation in statistical machine translation (SMT) bas...
Discriminative training, a.k.a. tuning, is an important part of Statistical Machine Translation. Thi...
As larger and more diverse parallel texts become available, how can we lever-age heterogeneous data ...
Previous research on domain adaptation (DA) for statistical machine translation (SMT) has mainly foc...
In this paper, we propose two extensions to the vector space model (VSM) adaptation tech-nique (Chen...
This paper reports experiments on adapting components of a Statistical Machine Trans-lation (SMT) sy...
Mixture modelling is a standard pattern classication technique. However, in statistical machine tran...
The problem of language model adaptation in statistical machine translation is considered. A mixtur...
We describe a new approach to SMT adapta-tion that weights out-of-domain phrase pairs according to t...
Statistical machine translation (SMT) systems use statistical learning methods to learn how to trans...
Machine translation is the application of machines to translate text or speech from one natural lang...
Machine translation represents one of the core tasks in natural language processing: performing an a...
Statistical machine translation is often faced with the problem of combining training data from many...
We describe a new approach to SMT adaptation that weights out-of-domain phrase pairs according to th...
Current state-of-the-art Statistical Machine Translation systems are based on log-linear models that...
This paper proposes a new approach to domain adaptation in statistical machine translation (SMT) bas...
Discriminative training, a.k.a. tuning, is an important part of Statistical Machine Translation. Thi...
As larger and more diverse parallel texts become available, how can we lever-age heterogeneous data ...
Previous research on domain adaptation (DA) for statistical machine translation (SMT) has mainly foc...
In this paper, we propose two extensions to the vector space model (VSM) adaptation tech-nique (Chen...
This paper reports experiments on adapting components of a Statistical Machine Trans-lation (SMT) sy...
Mixture modelling is a standard pattern classication technique. However, in statistical machine tran...
The problem of language model adaptation in statistical machine translation is considered. A mixtur...
We describe a new approach to SMT adapta-tion that weights out-of-domain phrase pairs according to t...
Statistical machine translation (SMT) systems use statistical learning methods to learn how to trans...
Machine translation is the application of machines to translate text or speech from one natural lang...
Machine translation represents one of the core tasks in natural language processing: performing an a...
Statistical machine translation is often faced with the problem of combining training data from many...
We describe a new approach to SMT adaptation that weights out-of-domain phrase pairs according to th...
Current state-of-the-art Statistical Machine Translation systems are based on log-linear models that...
This paper proposes a new approach to domain adaptation in statistical machine translation (SMT) bas...
Discriminative training, a.k.a. tuning, is an important part of Statistical Machine Translation. Thi...