The empirical adequacy of synchronous context-free grammars of rank two (2-SCFGs) (Satta and Peserico, 2005), used in syntax-based machine translation systems such as Wu (1997), Zhang et al. (2006) and Chi-ang (2007), in terms of what alignments they induce, has been discussed in Wu (1997) and Wellington et al. (2006), but with a one-sided focus on so-called “inside-out alignments”. Other alignment configurations that cannot be induced by 2-SCFGs are identified in this pa-per, and their frequencies across a wide col-lection of hand-aligned parallel corpora are examined. Empirical lower bounds on two measures of alignment error rate, i.e. the one introduced in Och and Ney (2000) and one where only complete translation units are con-sidered, ...
Syntactic machine translation systems extract rules from bilingual, word-aligned, syntacti-cally par...
The parameters of statistical translation models are typically estimated from sentence-aligned paral...
In most statistical machine translation (SMT) systems, bilingual segments are extracted via word a...
When parallel or comparable corpora are harvested from the web, there is typically a tradeoff betwee...
UnrestrictedAll state of the art statistical machine translation systems and many example-based mach...
Most statistical machine translation systems employ a word-based alignment model. In this paper we d...
Automatic word alignment is a key step in training statistical machine translation systems. Despite ...
In most statistical machine translation (SMT) systems, bilingual segments are extracted via word ali...
Statistical Word Alignments represent lexical word-to-word translations between source and target la...
In most statistical machine translation (SMT) systems, bilingual segments are extracted via word ali...
In most statistical machine translation (SMT) systems, bilingual segments are ex-tracted via word al...
We introduce a word alignment framework that facilitates the incorporation of syntax en-coded in bil...
We present a method for improving word alignment for statistical syntax-based ma-chine translation t...
In machine translation, the alignment of corpora has evolved into a mature research area, aimed at p...
Training a state-of-the-art syntax-based statistical machine translation (MT) system to translate fr...
Syntactic machine translation systems extract rules from bilingual, word-aligned, syntacti-cally par...
The parameters of statistical translation models are typically estimated from sentence-aligned paral...
In most statistical machine translation (SMT) systems, bilingual segments are extracted via word a...
When parallel or comparable corpora are harvested from the web, there is typically a tradeoff betwee...
UnrestrictedAll state of the art statistical machine translation systems and many example-based mach...
Most statistical machine translation systems employ a word-based alignment model. In this paper we d...
Automatic word alignment is a key step in training statistical machine translation systems. Despite ...
In most statistical machine translation (SMT) systems, bilingual segments are extracted via word ali...
Statistical Word Alignments represent lexical word-to-word translations between source and target la...
In most statistical machine translation (SMT) systems, bilingual segments are extracted via word ali...
In most statistical machine translation (SMT) systems, bilingual segments are ex-tracted via word al...
We introduce a word alignment framework that facilitates the incorporation of syntax en-coded in bil...
We present a method for improving word alignment for statistical syntax-based ma-chine translation t...
In machine translation, the alignment of corpora has evolved into a mature research area, aimed at p...
Training a state-of-the-art syntax-based statistical machine translation (MT) system to translate fr...
Syntactic machine translation systems extract rules from bilingual, word-aligned, syntacti-cally par...
The parameters of statistical translation models are typically estimated from sentence-aligned paral...
In most statistical machine translation (SMT) systems, bilingual segments are extracted via word a...