Given much recent discussion and the shift in focus of the field, it is becoming apparent that the incorporation of syntax is the way forward for the current state-of-the-art in machine translation (MT). Parallel treebanks are a relatively recent innovation and appear to be ideal candidates for MT training material. However, until recently there has been no other means to build them than by hand. In this paper, we describe how we make use of new tools to automatically build a large parallel treebank and extract a set of linguistically motivated phrase pairs from it. We show that adding these phrase pairs to the translation model of a baseline phrase-based statistical MT (PBSMT) system leads to significant improvements in translation qualit...
International audienceIn this paper, we introduce two new methods dedicated to phrase based machine ...
Statistical Machine Translation (SMT) developed in the late 1980s, based initially upon a word-to-wo...
State-of-the-art statistical machine translation systems make use of a large translation table obta...
Given much recent discussion and the shift in focus of the field, it is becoming apparent that the i...
Given much recent discussion and the shift in focus of the field, it is becoming apparent that the i...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories. Editors: Ko...
The benefits of syntax-based approaches to data-driven machine translation (MT) are clear: given the...
The need for syntactically annotated data for use in natural language processing has increased drama...
Parallel treebanks, which comprise paired source-target parse trees aligned at sub-sentential level,...
Phrase-based statistical machine translation (PBSMT) systems represent the dominant approach in MT t...
The need for syntactically annotated data for use in natural language processing has increased drama...
The goal of a machine translation (MT) system is to automatically translate a document written in so...
This paper presents a new method to evaluate machine translation (MT) systems against a parallel tre...
Statistical Machine Translation (SMT) systems rely heavily on the quality of the phrase pairs induce...
International audienceIn this paper, we introduce two new methods dedicated to phrase based machine ...
Statistical Machine Translation (SMT) developed in the late 1980s, based initially upon a word-to-wo...
State-of-the-art statistical machine translation systems make use of a large translation table obta...
Given much recent discussion and the shift in focus of the field, it is becoming apparent that the i...
Given much recent discussion and the shift in focus of the field, it is becoming apparent that the i...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories. Editors: Ko...
The benefits of syntax-based approaches to data-driven machine translation (MT) are clear: given the...
The need for syntactically annotated data for use in natural language processing has increased drama...
Parallel treebanks, which comprise paired source-target parse trees aligned at sub-sentential level,...
Phrase-based statistical machine translation (PBSMT) systems represent the dominant approach in MT t...
The need for syntactically annotated data for use in natural language processing has increased drama...
The goal of a machine translation (MT) system is to automatically translate a document written in so...
This paper presents a new method to evaluate machine translation (MT) systems against a parallel tre...
Statistical Machine Translation (SMT) systems rely heavily on the quality of the phrase pairs induce...
International audienceIn this paper, we introduce two new methods dedicated to phrase based machine ...
Statistical Machine Translation (SMT) developed in the late 1980s, based initially upon a word-to-wo...
State-of-the-art statistical machine translation systems make use of a large translation table obta...