We present a new automatic learning system for cognate identification. We design a linguistic-inspired substitution matrix to align sensibly our training dataset. We introduce a PAM-like technique, similar to the one successfully used in biological sequence analysis, in order to learn substitution parameters. We propose a novel family of parameterised string similarity measures and we apply them together with the PAM-like matrices to the task of cognate identification. We train and test our proposal on standard datasets of Indo-European languages in orthographic format based on the Latin alphabet, but it could easily be adapted to datasets using any other alphabet, including the phonetic alphabet if data was available. We compare our system...
We present a system for computing similarity between pairs of words. Our system is based on Pair H...
This paper presents a methodology for calculating a modified Levenshtein edit distance between chara...
The identification of cognates in natural languages is a crucial part of automatic translation lexic...
This paper tests the influence of the training dataset dimension on a recently proposed orthographic...
We investigate the problem of measuring phonetic similarity, focusing on the identification of cogna...
Natural languages that originate from a common ancestor are genetically related, words are the core ...
We apply to the task of linguistic phylogenetic inference a successful cognate identification learni...
In this paper we describe an approach to automatic cognate identification in monolingual texts using...
In this paper we describe an approach to automatic cognate identification in mono-lingual texts usin...
AbstractWe propose a sequence labeling approach to cognate production based on the orthography of th...
The identification of cognate word pairs has recently started to attract the attention of NLP resear...
The coinciding form and meaning similarity of cognates, e.g. 'flamme' (French), 'Flamme' (German), '...
This paper describes a cognate identifica-tion method, used by a lexical alignment system for French...
The coinciding form and meaning similarity of cognates, e.g. ‘flamme’ (French), ‘Flamme’ (German), ‘...
<div><p>The coinciding form and meaning similarity of cognates, e.g. ‘flamme’ (French), ‘Flamme’ (Ge...
We present a system for computing similarity between pairs of words. Our system is based on Pair H...
This paper presents a methodology for calculating a modified Levenshtein edit distance between chara...
The identification of cognates in natural languages is a crucial part of automatic translation lexic...
This paper tests the influence of the training dataset dimension on a recently proposed orthographic...
We investigate the problem of measuring phonetic similarity, focusing on the identification of cogna...
Natural languages that originate from a common ancestor are genetically related, words are the core ...
We apply to the task of linguistic phylogenetic inference a successful cognate identification learni...
In this paper we describe an approach to automatic cognate identification in monolingual texts using...
In this paper we describe an approach to automatic cognate identification in mono-lingual texts usin...
AbstractWe propose a sequence labeling approach to cognate production based on the orthography of th...
The identification of cognate word pairs has recently started to attract the attention of NLP resear...
The coinciding form and meaning similarity of cognates, e.g. 'flamme' (French), 'Flamme' (German), '...
This paper describes a cognate identifica-tion method, used by a lexical alignment system for French...
The coinciding form and meaning similarity of cognates, e.g. ‘flamme’ (French), ‘Flamme’ (German), ‘...
<div><p>The coinciding form and meaning similarity of cognates, e.g. ‘flamme’ (French), ‘Flamme’ (Ge...
We present a system for computing similarity between pairs of words. Our system is based on Pair H...
This paper presents a methodology for calculating a modified Levenshtein edit distance between chara...
The identification of cognates in natural languages is a crucial part of automatic translation lexic...