This paper describes a pilot NER system for Twitter, comprising the USFD system en-try to the W-NUT 2015 NER shared task. The goal is to correctly label entities in a tweet dataset, using an inventory of ten types. We employ structured learning, drawing on gazetteers taken from Linked Data, and on un-supervised clustering features, and attempting to compensate for stylistic and topic drift – a key challenge in social media text. Our result is competitive; we provide an analysis of the components of our methodology, and an ex-amination of the target dataset in the context of this task.
The content posted by users on Social Networks represents an important source of information for a m...
Named Entity Recognition (NER) is an important subtask of information extraction that seeks to locat...
Motivated by the increasing need of information retrieval from social media, a lexicon-based approac...
This paper describes a pilot NER system for Twitter, comprising the USFD system entry to the W-NUT 2...
We present two new NER datasets for Twitter; a manually annotated set of 1,467 tweets (kappa=0.942) ...
Named entity recognition (NER) systems trained on newswire perform very badly when tested on Twitter...
The data on Social Network Services (SNSs) has recently become an interesting source for researchers...
International audienceNamed Entity Recognition (NER) is a traditional Natural Language Processing (N...
In recent years, social media outlets such as Twitter and Facebook have drawn attention from compani...
In recent years, social media outlets such as Twitter and Facebook have drawn attention from compani...
amed Entity Recognition (NER) is an important subtask of information extraction that seeks to locate...
Social media data such as Twitter messages ("tweets") pose a particular challenge to NLP systems bec...
We experiment with using different sources of distant supervision to guide unsupervised and semi-sup...
Twitter scales 500 million tweets per day and has 316 million monthly active users. The majority of ...
Various recent studies show that the performance of named entity recognition (NER) systems developed...
The content posted by users on Social Networks represents an important source of information for a m...
Named Entity Recognition (NER) is an important subtask of information extraction that seeks to locat...
Motivated by the increasing need of information retrieval from social media, a lexicon-based approac...
This paper describes a pilot NER system for Twitter, comprising the USFD system entry to the W-NUT 2...
We present two new NER datasets for Twitter; a manually annotated set of 1,467 tweets (kappa=0.942) ...
Named entity recognition (NER) systems trained on newswire perform very badly when tested on Twitter...
The data on Social Network Services (SNSs) has recently become an interesting source for researchers...
International audienceNamed Entity Recognition (NER) is a traditional Natural Language Processing (N...
In recent years, social media outlets such as Twitter and Facebook have drawn attention from compani...
In recent years, social media outlets such as Twitter and Facebook have drawn attention from compani...
amed Entity Recognition (NER) is an important subtask of information extraction that seeks to locate...
Social media data such as Twitter messages ("tweets") pose a particular challenge to NLP systems bec...
We experiment with using different sources of distant supervision to guide unsupervised and semi-sup...
Twitter scales 500 million tweets per day and has 316 million monthly active users. The majority of ...
Various recent studies show that the performance of named entity recognition (NER) systems developed...
The content posted by users on Social Networks represents an important source of information for a m...
Named Entity Recognition (NER) is an important subtask of information extraction that seeks to locat...
Motivated by the increasing need of information retrieval from social media, a lexicon-based approac...