International audienceLarge scale analysis and statistics of socio-technical systems that just a few short years ago would have required the use of consistent economic and human resources can nowadays be conveniently performed by mining the enormous amount of digital data produced by human activities. Although a characterization of several aspects of our societies is emerging from the data revolution, a number of questions concerning the reliability and the biases inherent to the big data "proxies" of social life are still open. Here, we survey worldwide linguistic indicators and trends through the analysis of a large-scale dataset of microblogging posts. We show that available data allow for the study of language geography at scales rangin...
Recently, numerous approaches have emerged in the social sciences to exploit the opportunities made ...
<p>A) Raw Twitter signal. Each color corresponds to a language. Densely populated areas are easily i...
Trabajo presentado en el Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDi...
Large scale analysis and statistics of socio-technical systems that just a few short years ago would...
International audienceLarge scale analysis and statistics of socio-technical systems that just a few...
Geotagged Twitter data allows us to investigate correlations of geographic language variation, both ...
Abstract—Having access to content of messages sent by some given group of subscribers of a social ne...
International audienceWe perform a large-scale analysis of language diatopic variation using geotagg...
This paper evaluates large georeferenced corpora, taken from both web-crawled and social media sour...
Twitter is a popular social media platform for scholarly research, because the user-generated conten...
The movements of ideas and content between locations and languages are unquestionably crucial concer...
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
Cities are growing as melting pots of people with different culture, religion, and language. In this...
<p>Computer-mediated communication is driving fundamental changes in the nature of written language....
International audienceOur usage of language is not solely reliant on cognition but is arguably deter...
Recently, numerous approaches have emerged in the social sciences to exploit the opportunities made ...
<p>A) Raw Twitter signal. Each color corresponds to a language. Densely populated areas are easily i...
Trabajo presentado en el Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDi...
Large scale analysis and statistics of socio-technical systems that just a few short years ago would...
International audienceLarge scale analysis and statistics of socio-technical systems that just a few...
Geotagged Twitter data allows us to investigate correlations of geographic language variation, both ...
Abstract—Having access to content of messages sent by some given group of subscribers of a social ne...
International audienceWe perform a large-scale analysis of language diatopic variation using geotagg...
This paper evaluates large georeferenced corpora, taken from both web-crawled and social media sour...
Twitter is a popular social media platform for scholarly research, because the user-generated conten...
The movements of ideas and content between locations and languages are unquestionably crucial concer...
Computer-mediated communication is driving fundamental changes in the nature of written language. We...
Cities are growing as melting pots of people with different culture, religion, and language. In this...
<p>Computer-mediated communication is driving fundamental changes in the nature of written language....
International audienceOur usage of language is not solely reliant on cognition but is arguably deter...
Recently, numerous approaches have emerged in the social sciences to exploit the opportunities made ...
<p>A) Raw Twitter signal. Each color corresponds to a language. Densely populated areas are easily i...
Trabajo presentado en el Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDi...