This paper will focus on automatic methods for quantifying language similarity. This is achieved by ascribing language similarity to the similarity of text corpora. This corpus similarity will first be determined by the resemblance of the vocabulary of languages. Thereto words or parts of them such as letter n-grams are examined. Extensions like transliteration of the text data will ensure the independence of the methods from text characteristics such as the writing system used. Further analyzes will show to what extent knowledge about the distribution of words in parallel text can be used in the context of language similarity
Computing the semantic similarity between terms (or short text expressions) that have the same meani...
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and ...
The present paper discusses how to measure the degree of similarity or difference in the vocabulary ...
This research addresses the problem of deriving semantic similarity between words of language using ...
In this paper we show how a single framework for computational modeling of linguistic similarity can...
In this paper we inspect a series of methods for language identification on web data. We start from ...
Deep processing of natural language requires large scale lexical resources that have sufficient cove...
Quantifying the similarity or dissimilarity between documents is an important task in authorship att...
Measuring semantic similarity between texts is calculating semantic relatedness between texts using ...
Deep processing of natural language requires large scale lexical resources that have sufficient cove...
Describing, comparing and evaluating corpora are key issues in corpus-based translation and corpus l...
Computing the semantic similarity between terms (or short text expressions) that have the same mean...
In this study, we present Dice\u27s coefficient on trigram profiles as metric for language similarit...
We present a comprehensive study of computing similarity between texts. We start from the observatio...
In this study we consider the problem of determining whether an English corpus constructed from a gi...
Computing the semantic similarity between terms (or short text expressions) that have the same meani...
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and ...
The present paper discusses how to measure the degree of similarity or difference in the vocabulary ...
This research addresses the problem of deriving semantic similarity between words of language using ...
In this paper we show how a single framework for computational modeling of linguistic similarity can...
In this paper we inspect a series of methods for language identification on web data. We start from ...
Deep processing of natural language requires large scale lexical resources that have sufficient cove...
Quantifying the similarity or dissimilarity between documents is an important task in authorship att...
Measuring semantic similarity between texts is calculating semantic relatedness between texts using ...
Deep processing of natural language requires large scale lexical resources that have sufficient cove...
Describing, comparing and evaluating corpora are key issues in corpus-based translation and corpus l...
Computing the semantic similarity between terms (or short text expressions) that have the same mean...
In this study, we present Dice\u27s coefficient on trigram profiles as metric for language similarit...
We present a comprehensive study of computing similarity between texts. We start from the observatio...
In this study we consider the problem of determining whether an English corpus constructed from a gi...
Computing the semantic similarity between terms (or short text expressions) that have the same meani...
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and ...
The present paper discusses how to measure the degree of similarity or difference in the vocabulary ...