Given much recent discussion and the shift in focus of the field, it is becoming apparent that the incorporation of syntax is the way forward for the current state-of-the-art in machine translation (MT). Parallel treebanks are a relatively recent innovation and appear to be ideal candidates for MT training material. However, until recently there has been no other means to build them than by hand. In this paper, we describe how we make use of new tools to automatically build a large parallel treebank and extract a set of linguistically motivated phrase pairs from it. We show that adding these phrase pairs to the translation model of a baseline phrase-based statistical MT (PBSMT) system leads to significant improvements in translation qualit...
Parallel treebanks, which comprise paired source-target parse trees aligned at sub-sentential level,...
Parallel treebanks, which comprise paired source-target parse trees aligned at sub-sentential level,...
Phrase-based statistical machine translation (PBSMT) systems represent the dominant approach in MT t...
Given much recent discussion and the shift in focus of the field, it is becoming apparent that the i...
Given much recent discussion and the shift in focus of the field, it is becoming apparent that the i...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories. Editors: Ko...
The benefits of syntax-based approaches to data-driven machine translation (MT) are clear: given the...
The benefits of syntax-based approaches to data-driven machine translation (MT) are clear: given the...
The need for syntactically annotated data for use in natural language processing has increased drama...
The need for syntactically annotated data for use in natural language processing has increased drama...
The need for syntactically annotated data for use in natural language processing has increased drama...
Parallel treebanks, which comprise paired source-target parse trees aligned at sub-sentential level,...
Parallel treebanks, which comprise paired source-target parse trees aligned at sub-sentential level,...
Phrase-based statistical machine translation (PBSMT) systems represent the dominant approach in MT t...
Given much recent discussion and the shift in focus of the field, it is becoming apparent that the i...
Given much recent discussion and the shift in focus of the field, it is becoming apparent that the i...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories. Editors: Ko...
The benefits of syntax-based approaches to data-driven machine translation (MT) are clear: given the...
The benefits of syntax-based approaches to data-driven machine translation (MT) are clear: given the...
The need for syntactically annotated data for use in natural language processing has increased drama...
The need for syntactically annotated data for use in natural language processing has increased drama...
The need for syntactically annotated data for use in natural language processing has increased drama...
Parallel treebanks, which comprise paired source-target parse trees aligned at sub-sentential level,...
Parallel treebanks, which comprise paired source-target parse trees aligned at sub-sentential level,...
Phrase-based statistical machine translation (PBSMT) systems represent the dominant approach in MT t...