This paper tests the influence of the training dataset dimension on a recently proposed orthographic learning system, inspired from biological sequence analysis and successfully applied to cognate identification. This system automatically aligns a given set of cognate pairs producing a meaningful training dataset, learns from it substitution parameters using a PAM-like technique and utilises them to recognise cognate pairs. The results show no difference in the performance when training the system with about 650 cognate pairs extracted from 6 Indo-European languages or with about 62,000 cognate pairs extracted from 76 Indo-European languages. In both cases the system outperforms all comparable orthographic and phonetic methods previously pr...
International audienceCognate prediction is the task of generating, in a given language, the likely ...
With increasing amounts of digitally available data from all over the world, manual annotation of co...
This study describes the structure and the results of the SIGTYP 2022 shared task on the prediction ...
We present a new automatic learning system for cognate identification. We design a linguistic-inspir...
Natural languages that originate from a common ancestor are genetically related, words are the core ...
We investigate the problem of measuring phonetic similarity, focusing on the identification of cogna...
AbstractWe propose a sequence labeling approach to cognate production based on the orthography of th...
We apply to the task of linguistic phylogenetic inference a successful cognate identification learni...
The identification of cognate word pairs has recently started to attract the attention of NLP resear...
In this paper we describe an approach to automatic cognate identification in monolingual texts using...
In this paper we describe an approach to automatic cognate identification in mono-lingual texts usin...
This paper describes an algorithm to automatically generate a list of cognates in a target language ...
This paper describes a cognate identifica-tion method, used by a lexical alignment system for French...
The identification of cognates in natural languages is a crucial part of automatic translation lexic...
We evaluate the performance of state-of-the-art algorithms for automatic cognate detection by compar...
International audienceCognate prediction is the task of generating, in a given language, the likely ...
With increasing amounts of digitally available data from all over the world, manual annotation of co...
This study describes the structure and the results of the SIGTYP 2022 shared task on the prediction ...
We present a new automatic learning system for cognate identification. We design a linguistic-inspir...
Natural languages that originate from a common ancestor are genetically related, words are the core ...
We investigate the problem of measuring phonetic similarity, focusing on the identification of cogna...
AbstractWe propose a sequence labeling approach to cognate production based on the orthography of th...
We apply to the task of linguistic phylogenetic inference a successful cognate identification learni...
The identification of cognate word pairs has recently started to attract the attention of NLP resear...
In this paper we describe an approach to automatic cognate identification in monolingual texts using...
In this paper we describe an approach to automatic cognate identification in mono-lingual texts usin...
This paper describes an algorithm to automatically generate a list of cognates in a target language ...
This paper describes a cognate identifica-tion method, used by a lexical alignment system for French...
The identification of cognates in natural languages is a crucial part of automatic translation lexic...
We evaluate the performance of state-of-the-art algorithms for automatic cognate detection by compar...
International audienceCognate prediction is the task of generating, in a given language, the likely ...
With increasing amounts of digitally available data from all over the world, manual annotation of co...
This study describes the structure and the results of the SIGTYP 2022 shared task on the prediction ...