There has been a lot of recent interest in the natural language processing (NLP) community in the computational processing of language varieties and dialects, with the aim to improve the performance of applications such as machine translation, speech recognition, and dialogue systems. Here, we attempt to survey this growing field of research, with focus on computational methods for processing similar languages, varieties, and dialects. In particular, we discuss the most important challenges when dealing with diatopic language variation, and we present some of the available datasets, the process of data collection, and the most common data collection strategies used to compile datasets for similar languages, varieties, and dialects. We furth...
In this paper, we aim to explore the degree to which translated texts preserve linguistic features o...
Language variationists, study how languages vary along geographical or social lines or along lines o...
Building NLP systems that serve everyone requires accounting for dialect differences. But dialects a...
There has been a lot of recent interest in the natural language processing (NLP) community in the co...
Most Natural Language Processing (NLP) applications focus on standardized, written language varietie...
Languages are fundamental to human communication and serve as a means to express social and cultural...
Addressing the cross-lingual variation of grammatical structures and meaning categorization is a key...
The goal of this paper is to provide a complete representation of regional linguistic variation on a...
This project measures and classifies language variation. In contrast to earlier dialectology, we see...
Language label tokens are often used in multilingual neural language modeling and sequence-to-sequen...
Even now techniques are in common use in computational linguistics which could lead to important adv...
In this paper a range of methods for measuring the phonetic distance between dialectal variants are ...
This paper provides an overview of computational work in dialectology. Wehave published similar surv...
Dialectology is the study of dialects, and dialectometry is the measurement of dialect differences, ...
The most important reasons for examining “non-standard data” with CL methods are the facts that this...
In this paper, we aim to explore the degree to which translated texts preserve linguistic features o...
Language variationists, study how languages vary along geographical or social lines or along lines o...
Building NLP systems that serve everyone requires accounting for dialect differences. But dialects a...
There has been a lot of recent interest in the natural language processing (NLP) community in the co...
Most Natural Language Processing (NLP) applications focus on standardized, written language varietie...
Languages are fundamental to human communication and serve as a means to express social and cultural...
Addressing the cross-lingual variation of grammatical structures and meaning categorization is a key...
The goal of this paper is to provide a complete representation of regional linguistic variation on a...
This project measures and classifies language variation. In contrast to earlier dialectology, we see...
Language label tokens are often used in multilingual neural language modeling and sequence-to-sequen...
Even now techniques are in common use in computational linguistics which could lead to important adv...
In this paper a range of methods for measuring the phonetic distance between dialectal variants are ...
This paper provides an overview of computational work in dialectology. Wehave published similar surv...
Dialectology is the study of dialects, and dialectometry is the measurement of dialect differences, ...
The most important reasons for examining “non-standard data” with CL methods are the facts that this...
In this paper, we aim to explore the degree to which translated texts preserve linguistic features o...
Language variationists, study how languages vary along geographical or social lines or along lines o...
Building NLP systems that serve everyone requires accounting for dialect differences. But dialects a...