Parallel aligned treebanks (PAT) are linguistic corpora annotated with morphological and syntactic structures that are aligned at sentence as well as sub-sentence levels. They are valuable resources for improving machine translation (MT) quality. Recently, there has been an increasing demand for such data, especially for divergent language pairs. The Linguistic Data Consortium (LDC) and its academic partners have been developing Arabic-English and Chinese-English PATs for several years. This paper describes the PAT corpus creation effort for the program GALE (Global Autonomous Language Exploitation) and introduces the potential issues of scaling up this PAT effort for the program BOLT (Broad Operational Language Translation). Based on exist...
Prague Arabic Dependency Treebank (PADT) consists of refined multi-level linguistic annotations over...
Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories. Editors: Ko...
This paper discusses the construction of a parallel treebank currently involving ten languages from ...
Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora AEPC 2010. Editors:...
This study extends our initial efforts in building an English-Turkish parallel treebank corpus for s...
We describe a syntactically annotated parallel corpus containing typologically partly different lang...
In this paper, we report our preliminary efforts in building an English-Turkish parallel treebank co...
The availability of large multi-parallel corpora offers an enormous wealth of material to contrastiv...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
This paper discusses the role played by parallel corpora in the design and implementation of fully a...
This chapter gives an overview of parallel corpora, i.e. corpora containing source texts in a given ...
The availability of large multi-parallel corpora offers an enormous wealth of material to contrastiv...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
Prague Arabic Dependency Treebank (PADT) consists of refined multi-level linguistic annotations over...
Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories. Editors: Ko...
This paper discusses the construction of a parallel treebank currently involving ten languages from ...
Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora AEPC 2010. Editors:...
This study extends our initial efforts in building an English-Turkish parallel treebank corpus for s...
We describe a syntactically annotated parallel corpus containing typologically partly different lang...
In this paper, we report our preliminary efforts in building an English-Turkish parallel treebank co...
The availability of large multi-parallel corpora offers an enormous wealth of material to contrastiv...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
This paper discusses the role played by parallel corpora in the design and implementation of fully a...
This chapter gives an overview of parallel corpora, i.e. corpora containing source texts in a given ...
The availability of large multi-parallel corpora offers an enormous wealth of material to contrastiv...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. ...
Prague Arabic Dependency Treebank (PADT) consists of refined multi-level linguistic annotations over...
Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories. Editors: Ko...
This paper discusses the construction of a parallel treebank currently involving ten languages from ...