fast align is a simple, fast, and effi-cient approach for word alignment based on the IBM model 2. fast align per-forms well for language pairs with rel-atively similar word orders; however, it does not perform well for language pairs with drastically different word orders. We propose a segmenting-reversing reorder-ing process to solve this problem by al-ternately applying fast align and re-ordering source sentences during train-ing. Experimental results with Japanese-English translation demonstrate that the proposed approach improves the per-formance of fast align significantly without the loss of efficiency. Experiments using other languages are also reported.
International audienceWord alignments identify translational correspondences between words in a para...
We describe two methods to improve SMT accuracy using shallow syntax information. First, we use chun...
Generative word alignment models, such as IBM Models, are restricted to one-to-many alignment, and c...
We present an improved method for automated word alignment of parallel texts which takes advantage o...
We introduce a simple method to pack words for statistical word alignment. Our goal is to simplify t...
Previous studies of the effect of word alignment on translation quality in SMT generally explore lin...
Abstract. We propose a lexicalized syntactic reordering framework for cross-language word aligning a...
Fast alignment is essential for many nat-ural language tasks. But in the setting of monolingual alig...
In this paper, we present a new word alignment combination approach on language pairs where one lang...
This paper describes how word alignment information makes machine translation more efficient. Follow...
This paper compares four preprocess-ing approaches for word alignment: 1) sentence removal approach,...
We present a word alignment framework that can incorporate partial manual align-ments. The core of t...
In this paper we describe a statistical tech-nique for aligning sentences with their translations in...
We describe two methods to improve SMT accuracy using shallow syntax information. First, we use chun...
International audienceThis paper describes a new alignment method that extracts high quality multi-w...
International audienceWord alignments identify translational correspondences between words in a para...
We describe two methods to improve SMT accuracy using shallow syntax information. First, we use chun...
Generative word alignment models, such as IBM Models, are restricted to one-to-many alignment, and c...
We present an improved method for automated word alignment of parallel texts which takes advantage o...
We introduce a simple method to pack words for statistical word alignment. Our goal is to simplify t...
Previous studies of the effect of word alignment on translation quality in SMT generally explore lin...
Abstract. We propose a lexicalized syntactic reordering framework for cross-language word aligning a...
Fast alignment is essential for many nat-ural language tasks. But in the setting of monolingual alig...
In this paper, we present a new word alignment combination approach on language pairs where one lang...
This paper describes how word alignment information makes machine translation more efficient. Follow...
This paper compares four preprocess-ing approaches for word alignment: 1) sentence removal approach,...
We present a word alignment framework that can incorporate partial manual align-ments. The core of t...
In this paper we describe a statistical tech-nique for aligning sentences with their translations in...
We describe two methods to improve SMT accuracy using shallow syntax information. First, we use chun...
International audienceThis paper describes a new alignment method that extracts high quality multi-w...
International audienceWord alignments identify translational correspondences between words in a para...
We describe two methods to improve SMT accuracy using shallow syntax information. First, we use chun...
Generative word alignment models, such as IBM Models, are restricted to one-to-many alignment, and c...