Large-scale dialect surveys have long been a fundamental component of sociolin-guistics and variation studies, but they have traditionally required significant investments of time and resources to collect relatively small amounts of data. In this study, I exam-ine whether textual corpora collected from the Internet, particularly the social-networking website Twitter, can be used to conduct such surveys more quickly with less effort. I dis-cuss the utility of Twitter as a linguistic data source, explain the computational and linguis-tic methods necessary to collect and process worthwhile data, and use corpora from Twit-ter to plot the distribution of three regional variables in American English: soft drink ter-minology, the use of ‘hella ’ a...
This article presents a new method for data collection in regional dialectology based on site-restri...
64 pages. Presented to the Department of Linguistics and the Robert D. Clark Honors College in parti...
Twitter has become a staple social media platform for millions of English speakers of different soci...
There is a growing trend in sociolinguistics and dialectology to analyse large corpora of social med...
Electronic social media offers new opportunities for informal communication in written language, whi...
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
We analyze a Big Data set of geo-tagged tweets for a year (Oct. 2013–Oct. 2014) to understand the re...
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
Recent research on dialect variation using social media data has so far provided evidence that spell...
The use of both production and perceptual data has the potential to provide a more complete picture ...
Established models of the spatial diffusion of linguistic innovations vary in their relationship to ...
Geotagged Twitter data allows us to investigate correlations of geographic language variation, both ...
Abstract—Having access to content of messages sent by some given group of subscribers of a social ne...
This dissertation takes a quantitative perspective on variation in English world-wide. It applies a ...
International audienceWe perform a large-scale analysis of language diatopic variation using geotagg...
This article presents a new method for data collection in regional dialectology based on site-restri...
64 pages. Presented to the Department of Linguistics and the Robert D. Clark Honors College in parti...
Twitter has become a staple social media platform for millions of English speakers of different soci...
There is a growing trend in sociolinguistics and dialectology to analyse large corpora of social med...
Electronic social media offers new opportunities for informal communication in written language, whi...
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
We analyze a Big Data set of geo-tagged tweets for a year (Oct. 2013–Oct. 2014) to understand the re...
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
Recent research on dialect variation using social media data has so far provided evidence that spell...
The use of both production and perceptual data has the potential to provide a more complete picture ...
Established models of the spatial diffusion of linguistic innovations vary in their relationship to ...
Geotagged Twitter data allows us to investigate correlations of geographic language variation, both ...
Abstract—Having access to content of messages sent by some given group of subscribers of a social ne...
This dissertation takes a quantitative perspective on variation in English world-wide. It applies a ...
International audienceWe perform a large-scale analysis of language diatopic variation using geotagg...
This article presents a new method for data collection in regional dialectology based on site-restri...
64 pages. Presented to the Department of Linguistics and the Robert D. Clark Honors College in parti...
Twitter has become a staple social media platform for millions of English speakers of different soci...