This repository contains the data set developed for the paper: “Shruti Rijhwani and Daniel Preoțiuc-Pietro. Temporally-Informed Analysis of Named Entity Recognition. In Proceedings of the Association for Computational Linguistics (ACL). 2020.” It includes 12,000 tweets annotated for the named entity recognition task. The tweets are uniformly distributed over the years 2014-2019, with 2,000 tweets from each year. The goal is to have a temporally diverse corpus to account for data drift over time when building NER models. The entity types annotated are locations (LOC), persons (PER) and organizations (ORG). The tweets are preprocessed to replace usernames and URLs with a unique token. Hashtags are left intact and can be annotated as named ...
Social media texts are significant information sources for several application areas including trend...
We present a memory-based named entity recognition system that participated in the MSM-2013 Concept ...
Named Entity Disambiguation (NED) is a Natural Language Processing task of linking mentions of named...
In recent years, social media outlets such as Twitter and Facebook have drawn attention from compani...
In recent years, social media outlets such as Twitter and Facebook have drawn attention from compani...
The large number of tweets generated daily is providing decision makers with means to obtain insight...
Applying natural language processing for mining and intelligent information access to tweets (a form...
Social media data such as Twitter messages ("tweets") pose a particular challenge to NLP systems bec...
The large number of tweets generated daily is providing policy makers with means to obtain insights ...
Social media texts are significant informa-tion sources for several application areas including tren...
Named Entity Recognition (NER) is an important subtask of information extraction that seeks to locat...
The data on Social Network Services (SNSs) has recently become an interesting source for researchers...
amed Entity Recognition (NER) is an important subtask of information extraction that seeks to locate...
Named entity recognition (NER) systems trained on newswire perform very badly when tested on Twitter...
Many private and/or public organizations have been reported to create and monitor targeted Twitter s...
Social media texts are significant information sources for several application areas including trend...
We present a memory-based named entity recognition system that participated in the MSM-2013 Concept ...
Named Entity Disambiguation (NED) is a Natural Language Processing task of linking mentions of named...
In recent years, social media outlets such as Twitter and Facebook have drawn attention from compani...
In recent years, social media outlets such as Twitter and Facebook have drawn attention from compani...
The large number of tweets generated daily is providing decision makers with means to obtain insight...
Applying natural language processing for mining and intelligent information access to tweets (a form...
Social media data such as Twitter messages ("tweets") pose a particular challenge to NLP systems bec...
The large number of tweets generated daily is providing policy makers with means to obtain insights ...
Social media texts are significant informa-tion sources for several application areas including tren...
Named Entity Recognition (NER) is an important subtask of information extraction that seeks to locat...
The data on Social Network Services (SNSs) has recently become an interesting source for researchers...
amed Entity Recognition (NER) is an important subtask of information extraction that seeks to locate...
Named entity recognition (NER) systems trained on newswire perform very badly when tested on Twitter...
Many private and/or public organizations have been reported to create and monitor targeted Twitter s...
Social media texts are significant information sources for several application areas including trend...
We present a memory-based named entity recognition system that participated in the MSM-2013 Concept ...
Named Entity Disambiguation (NED) is a Natural Language Processing task of linking mentions of named...