Abstract. The paper discusses the application of a similarity metric based on compression to the measurement of the distance among Bulgarian dia-lects. The similarity metric is dened on the basis of the notion of Kolmo-gorov complexity of a le (or binary string). The application of Kolmogorov complexity in practice is not possible because its calculation over a le is an undecidable problem. Thus, the actual similarity metric is based on a real life compressor which only approximates the Kolmogorov complexity. To use the metric for distance measurement of Bulgarian dialects we rst represent the dialectological data in such a way that the metric is applicable. We propose two such representations which are compared to a baseline distance betwe...
Abstract: The Levenshtein distance is an established metric to represent phono-logical distances bet...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
Traditional dialectology relies on identifying language features which are common to one dialect are...
The paper discusses the application of a similarity metric based on compression to the measurement o...
Dialect classification is a classical problem in traditional dialectology. In the course of the last...
In this paper a range of methods for measuring the phonetic distance between dialectal variants are ...
We present a new similarity measure based on information theoretic measures which is superior than N...
We examine various string distance measures for suitability in modeling dialect distance, especially...
We examine various string distance measures for suitability in modeling dialect distance, especially...
This paper proposes a simple metric of dialect distance, based on the ratio between identical word p...
Dialectometry is a multidisciplinary field that uses quantitative methods in the analysis of dialect...
The Levenshtein distance is an established metric to represent phonological distances between dialec...
First we consider pair-wise distances for literal objects consisting of finite binary files. These f...
Information distance is a parameter-free similarity measure based on compression, used in pattern re...
AbstractNormalized information distance (NID) uses the theoretical notion of Kolmogorov complexity, ...
Abstract: The Levenshtein distance is an established metric to represent phono-logical distances bet...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
Traditional dialectology relies on identifying language features which are common to one dialect are...
The paper discusses the application of a similarity metric based on compression to the measurement o...
Dialect classification is a classical problem in traditional dialectology. In the course of the last...
In this paper a range of methods for measuring the phonetic distance between dialectal variants are ...
We present a new similarity measure based on information theoretic measures which is superior than N...
We examine various string distance measures for suitability in modeling dialect distance, especially...
We examine various string distance measures for suitability in modeling dialect distance, especially...
This paper proposes a simple metric of dialect distance, based on the ratio between identical word p...
Dialectometry is a multidisciplinary field that uses quantitative methods in the analysis of dialect...
The Levenshtein distance is an established metric to represent phonological distances between dialec...
First we consider pair-wise distances for literal objects consisting of finite binary files. These f...
Information distance is a parameter-free similarity measure based on compression, used in pattern re...
AbstractNormalized information distance (NID) uses the theoretical notion of Kolmogorov complexity, ...
Abstract: The Levenshtein distance is an established metric to represent phono-logical distances bet...
The idea of measuring distance between languages seems to have its roots in the work of the French e...
Traditional dialectology relies on identifying language features which are common to one dialect are...