Abstract—Over the last years, many papers have been published about how to use machine learning for classifying postings on microblogging platforms like Twitter, e. g., in order to assist users to reach tweets that interest them. Typically, the automatic classification results are then evaluated against a gold standard classification which consists of either (i) the hashtags of the tweets ’ authors, or (ii) manual annotations of independent human annotators. In this paper, we show that there are fundamental differences between these two kinds of gold standard classifications, i. e., human annotators are more likely to classify tweets like other human annotators than like the tweets ’ authors. Furthermore, we discuss how these differences ma...
KDIR is part of IC3K, the International Joint Conference on Knowledge Discovery, Knowledge Engineeri...
KDIR is part of IC3K, the International Joint Conference on Knowledge Discovery, Knowledge Engineeri...
Crowdsourcing is a popular means to obtain human-crafted information, for example labels of tweets, ...
Over the last years, many papers have been published about how to use machine learning for classifyi...
Hashtags in Twitter posts may carry dif-ferent semantic payloads. Their dual form (word and label) m...
The use of bots to inuence public debate, spread disinformation and spam, creates a need for efficie...
The use of bots to inuence public debate, spread disinformation and spam, creates a need for efficie...
This paper deals with the quality of textual features in messages in order to classify tweets. The a...
This paper addresses the task of building a classifier that would categorise tweets in Twitter. Micr...
Classification of data is an important aspect of getting vigorous knowledge and help to analyze and...
The recent advent and evolution of deep learning models and pre-trained embedding techniques have cr...
The recent advent and evolution of deep learning models and pre-trained embedding techniques have cr...
The rapid growth in social media data has motivated the development of a real time framework to unde...
The rapid growth in social media data has motivated the development of a real time framework to unde...
Machine learning has a wide range of uses, and one of its key uses is classification. A new observat...
KDIR is part of IC3K, the International Joint Conference on Knowledge Discovery, Knowledge Engineeri...
KDIR is part of IC3K, the International Joint Conference on Knowledge Discovery, Knowledge Engineeri...
Crowdsourcing is a popular means to obtain human-crafted information, for example labels of tweets, ...
Over the last years, many papers have been published about how to use machine learning for classifyi...
Hashtags in Twitter posts may carry dif-ferent semantic payloads. Their dual form (word and label) m...
The use of bots to inuence public debate, spread disinformation and spam, creates a need for efficie...
The use of bots to inuence public debate, spread disinformation and spam, creates a need for efficie...
This paper deals with the quality of textual features in messages in order to classify tweets. The a...
This paper addresses the task of building a classifier that would categorise tweets in Twitter. Micr...
Classification of data is an important aspect of getting vigorous knowledge and help to analyze and...
The recent advent and evolution of deep learning models and pre-trained embedding techniques have cr...
The recent advent and evolution of deep learning models and pre-trained embedding techniques have cr...
The rapid growth in social media data has motivated the development of a real time framework to unde...
The rapid growth in social media data has motivated the development of a real time framework to unde...
Machine learning has a wide range of uses, and one of its key uses is classification. A new observat...
KDIR is part of IC3K, the International Joint Conference on Knowledge Discovery, Knowledge Engineeri...
KDIR is part of IC3K, the International Joint Conference on Knowledge Discovery, Knowledge Engineeri...
Crowdsourcing is a popular means to obtain human-crafted information, for example labels of tweets, ...