Computational measures of linguistic diversity help us understand the linguistic landscape using digital language data. The contribution of this paper is to calibrate measures of linguistic diversity using restrictions on international travel resulting from the COVID-19 pandemic. Previous work has mapped the distribution of languages using geo-referenced social media and web data. The goal, however, has been to describe these corpora themselves rather than to make inferences about underlying populations. This paper shows that a difference-indifferences method based on the Herfindahl Hirschman Index can identify the bias in digital corpora that is introduced by non-local populations. These methods tell us where significant changes have taken p...
The Index of Linguistic Diversity (ILD) is a new quantitative measure of trends in linguistic divers...
This article develops new indices to measure linguistic diversity. It is new in two respects: firstl...
Twitter is a popular social media platform for scholarly research, because the user-generated conten...
This paper evaluates large georeferenced corpora, taken from both web-crawled and social media sour...
The COVID-19 pandemic has massively limited how linguists can collect data, and out of necessity, re...
This thesis discusses the methods used to assess the linguistic diversity of the\ud internet. I crit...
UNESCO has been emphasizing the concept of “knowledge societies”, which stresses plurality and diver...
There is a growing trend in sociolinguistics and dialectology to analyse large corpora of social med...
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
In this paper we present a new computational technique to detect and analyze statistically significa...
This paper measures similarity both within and between 84 language varieties across nine languages....
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
Globalization, urbanization and international mobility have led to increasingly diverse urban popula...
International audienceWe perform a large-scale analysis of language diatopic variation using geotagg...
International audienceIt is well known that AI-based language technology—large language models, mach...
The Index of Linguistic Diversity (ILD) is a new quantitative measure of trends in linguistic divers...
This article develops new indices to measure linguistic diversity. It is new in two respects: firstl...
Twitter is a popular social media platform for scholarly research, because the user-generated conten...
This paper evaluates large georeferenced corpora, taken from both web-crawled and social media sour...
The COVID-19 pandemic has massively limited how linguists can collect data, and out of necessity, re...
This thesis discusses the methods used to assess the linguistic diversity of the\ud internet. I crit...
UNESCO has been emphasizing the concept of “knowledge societies”, which stresses plurality and diver...
There is a growing trend in sociolinguistics and dialectology to analyse large corpora of social med...
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
In this paper we present a new computational technique to detect and analyze statistically significa...
This paper measures similarity both within and between 84 language varieties across nine languages....
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
Globalization, urbanization and international mobility have led to increasingly diverse urban popula...
International audienceWe perform a large-scale analysis of language diatopic variation using geotagg...
International audienceIt is well known that AI-based language technology—large language models, mach...
The Index of Linguistic Diversity (ILD) is a new quantitative measure of trends in linguistic divers...
This article develops new indices to measure linguistic diversity. It is new in two respects: firstl...
Twitter is a popular social media platform for scholarly research, because the user-generated conten...