Abstract: The Levenshtein distance is an established metric to represent phono-logical distances between dialects. So far, this metric has usually been applied on manually transcribed word lists. In this study we introduce several extensions of the Levenshtein distance by incorporating probabilistic edit costs as well as temporal alignment costs. We tested all variants for compliance with the axioms that within-dialect utterance pairs are phonologically more similar than across-dialect ones. In contrast to former studies we are not applying the metrics on preselected, prototypi-cal word lists but on real connected speech data which was automatically segmented and labeled. It turned out, that the transcription edit distances already performe...
This article proposes the use of an extended weighted Levenshtein distance to model the time depth b...
The primary data on pronunciation variation — e.g., dialect atlas data — is often recorded incommens...
This paper evaluates various character alignment methods on the task of sentence-level standardizati...
The Levenshtein distance is an established metric to represent phonological distances between dialec...
Traditional dialectology relies on identifying language features which are common to one dialect are...
We examine various string distance measures for suitability in modeling dialect distance, especially...
We examine various string distance measures for suitability in modeling dialect distance, especially...
Structuralists famously observed that language is "un systeme oil tout se tient" (Meillet, 1903, p.4...
In this paper a range of methods for measuring the phonetic distance between dialectal variants are ...
This article surveys recent developments furthering dialectometric research which the authors have b...
Gooskens (2003) described an experiment which determined linguistic distances between 15 Norwegian d...
Measuring dialect distances can be based on the comparison of words, and the comparison words should...
This paper proposes a simple metric of dialect distance, based on the ratio between identical word p...
This project measures and classifies language variation. In contrast to earlier dialectology, we see...
This article proposes the use of an extended weighted Levenshtein distance to model the time depth b...
The primary data on pronunciation variation — e.g., dialect atlas data — is often recorded incommens...
This paper evaluates various character alignment methods on the task of sentence-level standardizati...
The Levenshtein distance is an established metric to represent phonological distances between dialec...
Traditional dialectology relies on identifying language features which are common to one dialect are...
We examine various string distance measures for suitability in modeling dialect distance, especially...
We examine various string distance measures for suitability in modeling dialect distance, especially...
Structuralists famously observed that language is "un systeme oil tout se tient" (Meillet, 1903, p.4...
In this paper a range of methods for measuring the phonetic distance between dialectal variants are ...
This article surveys recent developments furthering dialectometric research which the authors have b...
Gooskens (2003) described an experiment which determined linguistic distances between 15 Norwegian d...
Measuring dialect distances can be based on the comparison of words, and the comparison words should...
This paper proposes a simple metric of dialect distance, based on the ratio between identical word p...
This project measures and classifies language variation. In contrast to earlier dialectology, we see...
This article proposes the use of an extended weighted Levenshtein distance to model the time depth b...
The primary data on pronunciation variation — e.g., dialect atlas data — is often recorded incommens...
This paper evaluates various character alignment methods on the task of sentence-level standardizati...