We introduce a new measure of distance between languages based on word embedding, called word embedding language divergence (WELD). WELD is defined as divergence between unified similarity distribution of words between languages. Using such a measure, we perform language comparison for fifty natural languages and twelve genetic languages. Our natural language dataset is a collection of sentence-aligned parallel corpora from bible translations for fifty languages spanning a variety of language families. Although we use parallel corpora, which guarantees having the same content in all languages, interestingly in many cases languages within the same family cluster together. In addition to natural languages, we perform language comparison for t...
In Ref. [13], Petroni and Serva discuss the use of Levenshtein distances (LD) between words referrin...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
Phylogenetic analyses of languages need to explicitly address whether the languages un-der considera...
Languages evolve over time according to a process in which reproduction, mutation and extinction are...
none7Objective: To propose a new approach for comparing genetic and linguistic diversity in populati...
Objective: To propose a new approach for comparing genetic and linguistic diversity in populations b...
Objective: To propose a new approach for comparing genetic and linguistic diversity in populations ...
The evolution of languages closely resembles the evolution of haploid organisms. This similarity has...
Objectives: The notion that patterns of linguistic and biological variation may cast light on each ...
The recent availability of typological databases such as World Atlas of Language Structures(WALS) ha...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
The notion that patterns of linguistic and biological variation may cast light on each other and on ...
We consider a large size population which evolves according to neutral haploid reproduction. The gen...
In this study we relate language differences on a global scale with genetic distances for the same p...
Do all languages convey semantic knowledge in the same way? If language simply mirrors the structure...
In Ref. [13], Petroni and Serva discuss the use of Levenshtein distances (LD) between words referrin...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
Phylogenetic analyses of languages need to explicitly address whether the languages un-der considera...
Languages evolve over time according to a process in which reproduction, mutation and extinction are...
none7Objective: To propose a new approach for comparing genetic and linguistic diversity in populati...
Objective: To propose a new approach for comparing genetic and linguistic diversity in populations b...
Objective: To propose a new approach for comparing genetic and linguistic diversity in populations ...
The evolution of languages closely resembles the evolution of haploid organisms. This similarity has...
Objectives: The notion that patterns of linguistic and biological variation may cast light on each ...
The recent availability of typological databases such as World Atlas of Language Structures(WALS) ha...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
The notion that patterns of linguistic and biological variation may cast light on each other and on ...
We consider a large size population which evolves according to neutral haploid reproduction. The gen...
In this study we relate language differences on a global scale with genetic distances for the same p...
Do all languages convey semantic knowledge in the same way? If language simply mirrors the structure...
In Ref. [13], Petroni and Serva discuss the use of Levenshtein distances (LD) between words referrin...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
Phylogenetic analyses of languages need to explicitly address whether the languages un-der considera...