This paper presents a methodology for calculating a modified Levenshtein edit distance between character strings, and applies it to the task of automated cognate identification from non-parallel (comparable) corpora. This task is an important stage in developing MT systems and bilingual dictionaries beyond the coverage of traditionally used aligned parallel corpora, which can be used for finding translation equivalents for the ‘long tail’ in Zipfian distribution: low-frequency and usually unambiguous lexical items in closely-related languages (many of those often under-resourced). Graphonological Levenshtein edit distance relies on editing hierarchical representations of phonological features for graphemes (graphonological representations) ...
In this paper we describe an approach to automatic cognate identification in mono-lingual texts usin...
This paper presents a solution to the problem of matching personal names in English to the same name...
In this paper a range of methods for measuring the phonetic distance between dialectal variants are ...
We present three methods for weighting edit distance algorithms based on linguistic information. The...
Researchers on bilingual processing can benefit from computational tools developed in artificial int...
Identification of cognates is an important component of computer assisted second language learning s...
Lexical Similarity (LS) between two languages uncovers many interesting linguistic insights such as ...
The Levenshtein distance is an established metric to represent phonological distances between dialec...
Abstract: This article proposes the use of an extended weighted Levenshtein distance to model the ti...
In this paper we describe an approach to automatic cognate identification in monolingual texts using...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
Levenshtein is a Minimum Edit Distance method; it is usually used in spell checking applications for...
The Levenshtein distance is a simple distance metric derived from the number of edit operations need...
We present a new automatic learning system for cognate identification. We design a linguistic-inspir...
This paper tests the influence of the training dataset dimension on a recently proposed orthographic...
In this paper we describe an approach to automatic cognate identification in mono-lingual texts usin...
This paper presents a solution to the problem of matching personal names in English to the same name...
In this paper a range of methods for measuring the phonetic distance between dialectal variants are ...
We present three methods for weighting edit distance algorithms based on linguistic information. The...
Researchers on bilingual processing can benefit from computational tools developed in artificial int...
Identification of cognates is an important component of computer assisted second language learning s...
Lexical Similarity (LS) between two languages uncovers many interesting linguistic insights such as ...
The Levenshtein distance is an established metric to represent phonological distances between dialec...
Abstract: This article proposes the use of an extended weighted Levenshtein distance to model the ti...
In this paper we describe an approach to automatic cognate identification in monolingual texts using...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
Levenshtein is a Minimum Edit Distance method; it is usually used in spell checking applications for...
The Levenshtein distance is a simple distance metric derived from the number of edit operations need...
We present a new automatic learning system for cognate identification. We design a linguistic-inspir...
This paper tests the influence of the training dataset dimension on a recently proposed orthographic...
In this paper we describe an approach to automatic cognate identification in mono-lingual texts usin...
This paper presents a solution to the problem of matching personal names in English to the same name...
In this paper a range of methods for measuring the phonetic distance between dialectal variants are ...