This paper describes a multilingual study on how much information is contained in a single post of microblog text from Twitter in 26 different languages. In order to answer this question in a quantitative fashion, we take an information-theoretic approach, using entropy as our criterion for quantifying “how much is said ” in a tweet. Our results find that, as expected, languages with larger character sets such as Chinese and Japanese contain more information per character than other languages. However, we also find that, somewhat surprisingly, in-formation per character does not have a strong corre-lation with information per microblog post, as authors of microblog posts in languages with more information per character do not necessarily us...
<div><p>The increasing usage of social media for conversations, together with the availability of it...
International audienceMicroblog (e.g. Twitter) is fast emerging social medium for information diffus...
Microblogging websites, such as Twitter, provide seemingly endless amount of textual information on ...
This paper presents a multilingual study on, per single post of microblog text, (a) how much can be ...
The purpose of this study is to estimate and compare the entropy and redundancy of written English a...
Twitter has become the de facto information sharing and communication platform. Given the factors th...
Wikipedia has long become a standard source of informa-tion on the web and as such is widely referen...
While various claims have been made about text in social media text being noisy, there has never bee...
The increasing usage of social media for conversations, together with the availability of its data t...
This study is intended to unveil the difference of social mediated world via major languages and inv...
The fast increase in the ease of access to computing, coupled with the rapid growth of social media ...
This paper applies Multi-Dimensional Analysis (MDA) to a corpus of English tweets to uncover the mos...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
Despite the widespread adoption of Twitter internationally, little research has investigated the dif...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
<div><p>The increasing usage of social media for conversations, together with the availability of it...
International audienceMicroblog (e.g. Twitter) is fast emerging social medium for information diffus...
Microblogging websites, such as Twitter, provide seemingly endless amount of textual information on ...
This paper presents a multilingual study on, per single post of microblog text, (a) how much can be ...
The purpose of this study is to estimate and compare the entropy and redundancy of written English a...
Twitter has become the de facto information sharing and communication platform. Given the factors th...
Wikipedia has long become a standard source of informa-tion on the web and as such is widely referen...
While various claims have been made about text in social media text being noisy, there has never bee...
The increasing usage of social media for conversations, together with the availability of its data t...
This study is intended to unveil the difference of social mediated world via major languages and inv...
The fast increase in the ease of access to computing, coupled with the rapid growth of social media ...
This paper applies Multi-Dimensional Analysis (MDA) to a corpus of English tweets to uncover the mos...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
Despite the widespread adoption of Twitter internationally, little research has investigated the dif...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
<div><p>The increasing usage of social media for conversations, together with the availability of it...
International audienceMicroblog (e.g. Twitter) is fast emerging social medium for information diffus...
Microblogging websites, such as Twitter, provide seemingly endless amount of textual information on ...