As larger and more diverse parallel texts become available, how can we lever-age heterogeneous data to train robust machine translation systems that achieve good translation quality on various test domains? This challenge has been ad-dressed so far by repurposing techniques developed for domain adaptation, such as linear mixture models which combine estimates learned on homogeneous sub-domains. However, learning from large heterogeneous corpora is quite different from standard adaptation tasks with clear domain distinctions. In this paper, we show that linear mixture models can re-liably improve translation quality in very heterogeneous training conditions, even if the mixtures do not use any domain knowledge and attempt to learn generic mo...
A data-driven approach to model translation suffers from the data mismatch problem and demands domai...
Machine translation is the application of machines to translate text or speech from one natural lang...
Statistical machine translation (SMT) systems use statistical learning methods to learn how to trans...
As larger and more diverse parallel texts become available, how can we lever-age heterogeneous data ...
Linear mixture models are a simple and effective technique for performing domain adaptation of trans...
Previous research on domain adaptation (DA) for statistical machine translation (SMT) has mainly foc...
This paper reports experiments on adapting components of a Statistical Machine Trans-lation (SMT) sy...
In this paper, we propose two extensions to the vector space model (VSM) adaptation tech-nique (Chen...
We consider two problems of NMT domain adaptation using meta-learning. First, we want to reach domai...
Statistical machine translation is often faced with the problem of combining training data from many...
Joint models have recently shown to improve the state-of-the-art in machine translation (MT). We app...
Sequence to sequence (SEQ2SEQ) models often lack diversity in their generated translations. This can...
Mixture modelling is a standard pattern classication technique. However, in statistical machine tran...
The problem of language model adaptation in statistical machine translation is considered. A mixtur...
We propose the on-the-fly ensembling of a machine translation model with an LLM, prompted on the sam...
A data-driven approach to model translation suffers from the data mismatch problem and demands domai...
Machine translation is the application of machines to translate text or speech from one natural lang...
Statistical machine translation (SMT) systems use statistical learning methods to learn how to trans...
As larger and more diverse parallel texts become available, how can we lever-age heterogeneous data ...
Linear mixture models are a simple and effective technique for performing domain adaptation of trans...
Previous research on domain adaptation (DA) for statistical machine translation (SMT) has mainly foc...
This paper reports experiments on adapting components of a Statistical Machine Trans-lation (SMT) sy...
In this paper, we propose two extensions to the vector space model (VSM) adaptation tech-nique (Chen...
We consider two problems of NMT domain adaptation using meta-learning. First, we want to reach domai...
Statistical machine translation is often faced with the problem of combining training data from many...
Joint models have recently shown to improve the state-of-the-art in machine translation (MT). We app...
Sequence to sequence (SEQ2SEQ) models often lack diversity in their generated translations. This can...
Mixture modelling is a standard pattern classication technique. However, in statistical machine tran...
The problem of language model adaptation in statistical machine translation is considered. A mixtur...
We propose the on-the-fly ensembling of a machine translation model with an LLM, prompted on the sam...
A data-driven approach to model translation suffers from the data mismatch problem and demands domai...
Machine translation is the application of machines to translate text or speech from one natural lang...
Statistical machine translation (SMT) systems use statistical learning methods to learn how to trans...