This thesis explores the application of unsupervised clustering for domain adaptation of machine translation systems. As in many artificial intelligence areas, creating a system that generalizes to any domain is a hard problem in machine translation. Domain adaptation, in contrast, aims to specialize a generic (or otherwise intended) system for a particular domain and translate text within that domain better. In this thesis, experiments on using unsupervised learning as a first step in solving this problem are explored, posing the research questions a) how unstructured data could be used for domain adaptation and b) how a bespoke translation of an input document could be provided. In the first part of the thesis, background theory is presen...
We propose a domain specific model for statistical machine translation. It is well-known that domain...
An index or topic hierarchy of full-text documents can organize a domain and speed information retri...
International audienceWe present a technique to improve out-of-domain statistical parsing by reducin...
Domain Adaptation in Machine Translation means to take a machine translation system that is restrict...
This chapter describes a novel multistage method for linguistic clustering of large collections of t...
Translated texts are distinctively different from original ones, to the extent that su-pervised text...
We propose an unsupervised method for clus-tering the translations of a word, such that the translat...
Prior work has shown that generalization of data in an Example Based Machine Translation (EBMT) syst...
In this work, we tackle the problem of language and translation models domain-adaptation without exp...
In this work, we tackle the problem of language and translation models domain-adaptation without exp...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
With the development of statistical machine translation, we have ready-to-use tools that can transla...
International audienceWe present a simple and effective way to perform out-of-domain statistical par...
We propose a domain specific model for statistical machine translation. It is well-known that domain...
An index or topic hierarchy of full-text documents can organize a domain and speed information retri...
International audienceWe present a technique to improve out-of-domain statistical parsing by reducin...
Domain Adaptation in Machine Translation means to take a machine translation system that is restrict...
This chapter describes a novel multistage method for linguistic clustering of large collections of t...
Translated texts are distinctively different from original ones, to the extent that su-pervised text...
We propose an unsupervised method for clus-tering the translations of a word, such that the translat...
Prior work has shown that generalization of data in an Example Based Machine Translation (EBMT) syst...
In this work, we tackle the problem of language and translation models domain-adaptation without exp...
In this work, we tackle the problem of language and translation models domain-adaptation without exp...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
With the development of statistical machine translation, we have ready-to-use tools that can transla...
International audienceWe present a simple and effective way to perform out-of-domain statistical par...
We propose a domain specific model for statistical machine translation. It is well-known that domain...
An index or topic hierarchy of full-text documents can organize a domain and speed information retri...
International audienceWe present a technique to improve out-of-domain statistical parsing by reducin...