Some authors (Simard et al.; Melamed; Danielsson & Mühlenbock) have suggested measures of similarity of words in different languages so as to find extra clues for alignment of parallel texts. Cognate words, like ‘Parliament ’ and ‘Parlement’, in English and French respectively, provide extra anchors that help to improve the quality of the alignment. In this paper, we will extend an alignment algorithm proposed by Ribeiro et al. using typical contiguous and non-contiguous sequences of characters extracted using a statistically sound method (Dias et al.). With these typical sequences, we are able to find more reliable correspondence points and improve the alignment quality without recurring to heuristics to identify cognates
Various methods have been proposed for aligning texts in two or more languages such as the Canadian ...
This paper describes the opportunities that arise from automatic word alignment for bilingual concor...
The identification of cognates between two distinct languages has recently start- ed to attract the ...
This paper describes a method for extracting Portuguese-Spanish word translation equivalents from al...
We present an algorithm for bilingual word alignment that extends previous work by treating multi-wo...
Parallel corpora such as the Canadian Hansard corpus and the International Telecommunications Union ...
Abstract. Parallel text alignment is a special type of pattern recognition task aimed to discover th...
This paper describes a cognate identifica-tion method, used by a lexical alignment system for French...
This paper reports an experience on producing manual word alignments over six different language pai...
We evaluate several orthographic word similarity measures in the context of bitext word alignment. W...
Num cenário mundial de rápida expansão das relações interculturais e da transmissão de conhecimento ...
This paper describes a method for searching word correspondences between pairs of translation sen-te...
This paper describes the work achieved in the #rst half of a 4-year cooperative research project #AR...
Word alignment in bilingual or multilingual parallel corpora has been a challenging issue for natura...
This paper describes an accurate and robust text alignment system for struc-turally different langua...
Various methods have been proposed for aligning texts in two or more languages such as the Canadian ...
This paper describes the opportunities that arise from automatic word alignment for bilingual concor...
The identification of cognates between two distinct languages has recently start- ed to attract the ...
This paper describes a method for extracting Portuguese-Spanish word translation equivalents from al...
We present an algorithm for bilingual word alignment that extends previous work by treating multi-wo...
Parallel corpora such as the Canadian Hansard corpus and the International Telecommunications Union ...
Abstract. Parallel text alignment is a special type of pattern recognition task aimed to discover th...
This paper describes a cognate identifica-tion method, used by a lexical alignment system for French...
This paper reports an experience on producing manual word alignments over six different language pai...
We evaluate several orthographic word similarity measures in the context of bitext word alignment. W...
Num cenário mundial de rápida expansão das relações interculturais e da transmissão de conhecimento ...
This paper describes a method for searching word correspondences between pairs of translation sen-te...
This paper describes the work achieved in the #rst half of a 4-year cooperative research project #AR...
Word alignment in bilingual or multilingual parallel corpora has been a challenging issue for natura...
This paper describes an accurate and robust text alignment system for struc-turally different langua...
Various methods have been proposed for aligning texts in two or more languages such as the Canadian ...
This paper describes the opportunities that arise from automatic word alignment for bilingual concor...
The identification of cognates between two distinct languages has recently start- ed to attract the ...