Sentence alignment is most important for Chinese-English bilingual corpus alignment. This paper analyzes the features of the length-based method and the lexical-information method. First, the N-Path method is proposed which combines length and lexical-information. This method avoids the insufficiency of length-based method and limitations of lexical-information methods. Secondly, we can get expanded result aggregate and improve the alignment effect using the N-Path alignment method. Finally, we can confirm the best one from the result aggregate by using synonym-expansion alignment. The experiment result shows that our methods can be applied to general text processing and improves alignment accuracy. © 2014 Taylor & Francis Group, London
We introduce a word alignment framework that facilitates the incorporation of syntax en-coded in bil...
Most of the current Chinese word alignment tasks often adopt word segmentation systems firstly to id...
Parallel corpora are critical resources for machine translation research and development since paral...
Abstract:- In this paper, we use a lexical method to do sentence alignment for an English-Chinese co...
One of the bilingual corpus processing methods is the alignment of two languages on each linguistic ...
Bilingual alignment is a crucial problem in the research of natural language processing, and word al...
[[abstract]]In this paper, we use a lexical method to do sentence alignment for an English-Chinese c...
In this paper, a method for the word alignment of English-Chinese corpus based on chunks is proposed...
Word alignment in bilingual or multilingual parallel corpora has been a challenging issue for natura...
We describe our experience with automatic alignment of sentences inparallel English-Chinese texts. ...
information in languages other than English has grown significantly in recent years. This highlights...
This paper first describes an experiment to construct an English-Chinese parallel corpus, then apply...
This paper describes an accurate and robust text alignment system for struc-turally different langua...
[[abstract]]©1997 MIT-This paper presents an algorithm capable of identifying the translation for ea...
Automatic word alignment is an important technology for extracting translation knowledge from parall...
We introduce a word alignment framework that facilitates the incorporation of syntax en-coded in bil...
Most of the current Chinese word alignment tasks often adopt word segmentation systems firstly to id...
Parallel corpora are critical resources for machine translation research and development since paral...
Abstract:- In this paper, we use a lexical method to do sentence alignment for an English-Chinese co...
One of the bilingual corpus processing methods is the alignment of two languages on each linguistic ...
Bilingual alignment is a crucial problem in the research of natural language processing, and word al...
[[abstract]]In this paper, we use a lexical method to do sentence alignment for an English-Chinese c...
In this paper, a method for the word alignment of English-Chinese corpus based on chunks is proposed...
Word alignment in bilingual or multilingual parallel corpora has been a challenging issue for natura...
We describe our experience with automatic alignment of sentences inparallel English-Chinese texts. ...
information in languages other than English has grown significantly in recent years. This highlights...
This paper first describes an experiment to construct an English-Chinese parallel corpus, then apply...
This paper describes an accurate and robust text alignment system for struc-turally different langua...
[[abstract]]©1997 MIT-This paper presents an algorithm capable of identifying the translation for ea...
Automatic word alignment is an important technology for extracting translation knowledge from parall...
We introduce a word alignment framework that facilitates the incorporation of syntax en-coded in bil...
Most of the current Chinese word alignment tasks often adopt word segmentation systems firstly to id...
Parallel corpora are critical resources for machine translation research and development since paral...