This paper describes a cognate identifica-tion method, used by a lexical alignment system for French and Romanian. We combine statistical techniques and lin-guistic information to extract cognates from lemmatized, tagged and sentence-aligned parallel corpora. We evaluate the cognate identification model and we compare it to other methods using pure statistical techniques. We show that the use of linguistic information in the cog-nate identification system improves sig-nificantly the results.
Parallel corpora such as the Canadian Hansard corpus and the International Telecommunications Union ...
This paper presents an overview of the 6th ALTA shared task that ran in 2015. The task was to identi...
The identification of cognates in natural languages is a crucial part of automatic translation lexic...
In this paper we describe an approach to automatic cognate identification in mono-lingual texts usin...
In this paper we describe an approach to automatic cognate identification in monolingual texts using...
The identification of cognate word pairs has recently started to attract the attention of NLP resear...
Some authors (Simard et al.; Melamed; Danielsson & Mühlenbock) have suggested measures of simila...
The identification of cognates between two distinct languages has recently start- ed to attract the ...
One of the most common lexical transfor-mations between cognates in French and English is the presen...
Cognates are words in different languages that have similar spelling and meaning. They can help seco...
This paper tests the influence of the training dataset dimension on a recently proposed orthographic...
We present a new automatic learning system for cognate identification. We design a linguistic-inspir...
We investigate the problem of measuring phonetic similarity, focusing on the identification of cogna...
Identification of cognates is an important component of computer assisted second language learning s...
This paper introduces CogNet, a new, large-scale lexical database that provides cognates—words of co...
Parallel corpora such as the Canadian Hansard corpus and the International Telecommunications Union ...
This paper presents an overview of the 6th ALTA shared task that ran in 2015. The task was to identi...
The identification of cognates in natural languages is a crucial part of automatic translation lexic...
In this paper we describe an approach to automatic cognate identification in mono-lingual texts usin...
In this paper we describe an approach to automatic cognate identification in monolingual texts using...
The identification of cognate word pairs has recently started to attract the attention of NLP resear...
Some authors (Simard et al.; Melamed; Danielsson & Mühlenbock) have suggested measures of simila...
The identification of cognates between two distinct languages has recently start- ed to attract the ...
One of the most common lexical transfor-mations between cognates in French and English is the presen...
Cognates are words in different languages that have similar spelling and meaning. They can help seco...
This paper tests the influence of the training dataset dimension on a recently proposed orthographic...
We present a new automatic learning system for cognate identification. We design a linguistic-inspir...
We investigate the problem of measuring phonetic similarity, focusing on the identification of cogna...
Identification of cognates is an important component of computer assisted second language learning s...
This paper introduces CogNet, a new, large-scale lexical database that provides cognates—words of co...
Parallel corpora such as the Canadian Hansard corpus and the International Telecommunications Union ...
This paper presents an overview of the 6th ALTA shared task that ran in 2015. The task was to identi...
The identification of cognates in natural languages is a crucial part of automatic translation lexic...