Most statistical machine translation systems employ a word-based alignment model. In this paper we demonstrate that word-based align-ment is a major cause of translation errors. We propose a new alignment model based on shal-low phrase structures, and the structures can be automatically acquired from parallel corpus. This new model achieved over 10 % error reduc-tion for our spoken language translation task. 1 In t roduct ion Most (if not all) statistical machine translation systems employ a word-based alignment model (Brown et al., 1993; Vogel, Ney, and Tillman, 1996; Wang and Waibel, 1997), which treats words in a sentence as independent entities an
This article presents a method for aligning words between translations, that imposes a composition...
Current word alignment models for statistical machine translation do not address morphology beyond m...
In this paper, we present a novel distortion model for phrase-based statistical machine translation....
Statistical Word Alignments represent lexical word-to-word translations between source and target la...
UnrestrictedAll state of the art statistical machine translation systems and many example-based mach...
The goal of a machine translation (MT) system is to automatically translate a document written in so...
We present a method for improving word alignment for statistical syntax-based ma-chine translation t...
In most statistical machine translation (SMT) systems, bilingual segments are ex-tracted via word al...
When parallel or comparable corpora are harvested from the web, there is typically a tradeoff betwee...
Automatic word alignment is a key step in training statistical machine translation systems. Despite ...
The main problems of statistical word alignment lie in the facts that source words can only be align...
The parameters of statistical translation models are typically estimated from sentence-aligned paral...
In machine translation, the alignment of corpora has evolved into a mature research area, aimed at p...
Current word alignment models for statisti-cal machine translation do not address mor-phology beyond...
We argue that learning word alignments through a compositionally-structured, joint process yields hi...
This article presents a method for aligning words between translations, that imposes a composition...
Current word alignment models for statistical machine translation do not address morphology beyond m...
In this paper, we present a novel distortion model for phrase-based statistical machine translation....
Statistical Word Alignments represent lexical word-to-word translations between source and target la...
UnrestrictedAll state of the art statistical machine translation systems and many example-based mach...
The goal of a machine translation (MT) system is to automatically translate a document written in so...
We present a method for improving word alignment for statistical syntax-based ma-chine translation t...
In most statistical machine translation (SMT) systems, bilingual segments are ex-tracted via word al...
When parallel or comparable corpora are harvested from the web, there is typically a tradeoff betwee...
Automatic word alignment is a key step in training statistical machine translation systems. Despite ...
The main problems of statistical word alignment lie in the facts that source words can only be align...
The parameters of statistical translation models are typically estimated from sentence-aligned paral...
In machine translation, the alignment of corpora has evolved into a mature research area, aimed at p...
Current word alignment models for statisti-cal machine translation do not address mor-phology beyond...
We argue that learning word alignments through a compositionally-structured, joint process yields hi...
This article presents a method for aligning words between translations, that imposes a composition...
Current word alignment models for statistical machine translation do not address morphology beyond m...
In this paper, we present a novel distortion model for phrase-based statistical machine translation....