This article presents the Nordic Tweet Stream (NTS), a cross-disciplinarycorpus project of computer scientists and a group of sociolinguists interestedin language variability and in the global spread of English. Our research integratestwo types of empirical data: We not only rely on traditional structured corpusdata but also use unstructured data sources that are often big and rich inmetadata, such as Twitter streams. The NTS downloads tweets and associatedmetadata from Denmark, Finland, Iceland, Norway and Sweden. We first introducesome technical aspects in creating a dynamic real-time monitor corpus, andthe following case study illustrates how the corpus could be used as empiricalevidence in sociolinguistic studies focusing on the global ...
Large scale analysis and statistics of socio-technical systems that just a few short years ago would...
International audienceOur usage of language is not solely reliant on cognition but is arguably deter...
International audienceLarge scale analysis and statistics of socio-technical systems that just a few...
This article presents the Nordic Tweet Stream (NTS), a cross-disciplinarycorpus project of computer ...
This paper presents the Nordic Tweet Stream, a cross-disciplinary digital humanities project that do...
Twitter is a popular social media platform for scholarly research, because the user-generated conten...
Geotagged Twitter data allows us to investigate correlations of geographic language variation, both ...
A trilingual Latvian-Russian-English corpus of tweets is presented with an analysis of users, langua...
This paper investigates the usability of Twitter as a resource for the study of language change in p...
This study is intended to unveil the difference of social mediated world via major languages and inv...
<p>Twitter has become a rich source for linguistic data. Here, a possibility of building a trilingua...
Large-scale dialect surveys have long been a fundamental component of sociolin-guistics and variatio...
We carried out a study in which we explored the feasibility of machine translation for Twitter for t...
The cross-disciplinary Nordic Tweet Stream (NTS) is a project aiming at creating a multilingual text...
Social networks like Twitter are increasingly important in the creation of new ways of communication...
Large scale analysis and statistics of socio-technical systems that just a few short years ago would...
International audienceOur usage of language is not solely reliant on cognition but is arguably deter...
International audienceLarge scale analysis and statistics of socio-technical systems that just a few...
This article presents the Nordic Tweet Stream (NTS), a cross-disciplinarycorpus project of computer ...
This paper presents the Nordic Tweet Stream, a cross-disciplinary digital humanities project that do...
Twitter is a popular social media platform for scholarly research, because the user-generated conten...
Geotagged Twitter data allows us to investigate correlations of geographic language variation, both ...
A trilingual Latvian-Russian-English corpus of tweets is presented with an analysis of users, langua...
This paper investigates the usability of Twitter as a resource for the study of language change in p...
This study is intended to unveil the difference of social mediated world via major languages and inv...
<p>Twitter has become a rich source for linguistic data. Here, a possibility of building a trilingua...
Large-scale dialect surveys have long been a fundamental component of sociolin-guistics and variatio...
We carried out a study in which we explored the feasibility of machine translation for Twitter for t...
The cross-disciplinary Nordic Tweet Stream (NTS) is a project aiming at creating a multilingual text...
Social networks like Twitter are increasingly important in the creation of new ways of communication...
Large scale analysis and statistics of socio-technical systems that just a few short years ago would...
International audienceOur usage of language is not solely reliant on cognition but is arguably deter...
International audienceLarge scale analysis and statistics of socio-technical systems that just a few...