Bootstrapped pattern learning for entity extraction usually starts with seed entities and iteratively learns patterns and entities from unlabeled text. Patterns are scored by their ability to extract more positive en-tities and less negative entities. A prob-lem is that due to the lack of labeled data, unlabeled entities are either assumed to be negative or are ignored by the existing pat-tern scoring measures. In this paper, we improve pattern scoring by predicting the labels of unlabeled entities. We use var-ious unsupervised features based on con-trasting domain-specific and general text, and exploiting distributional similarity and edit distances to learned entities. Our system outperforms existing pattern scor-ing algorithms for extrac...
One of the central tasks of medical text analysis is to extract and structure meaningful information...
Entity Recognition (ER) is a key component of relation extraction systems and many other natural-lan...
Entity resolution is one of the central challenges when integrating data from large numbers of data ...
Bootstrapped pattern learning for entity extraction usually starts with seed entities and iterativel...
With the rise in social media platform usage, the average number of people profiles has increased as...
This paper discusses the use of unlabeled examples for the problem of named entity classification. A...
In this paper, we describe a new method for the problem of named entity classifica-tion for speciali...
AbstractNamed entity recognition is a crucial component of biomedical natural language processing, e...
This paper aims to provide an effective interface for progressive refinement of pattern-based inform...
Named Entity Recognition is a basic task in Information Extraction that aims at identifying entities...
We investigate a largely unsupervised approach to learning interpretable, domain-specific entity typ...
Bootstrapping is the process of improving the performance of a trained classifier by iteratively add...
The rapidly growing body of literature in the biomedical domain presents opportunities for the devel...
An accurate Named Entity Recognition (NER) is important for knowledge discovery in text mining. This...
International audienceBiomedical named entity recognition (NER) is a challenging problem. In this pa...
One of the central tasks of medical text analysis is to extract and structure meaningful information...
Entity Recognition (ER) is a key component of relation extraction systems and many other natural-lan...
Entity resolution is one of the central challenges when integrating data from large numbers of data ...
Bootstrapped pattern learning for entity extraction usually starts with seed entities and iterativel...
With the rise in social media platform usage, the average number of people profiles has increased as...
This paper discusses the use of unlabeled examples for the problem of named entity classification. A...
In this paper, we describe a new method for the problem of named entity classifica-tion for speciali...
AbstractNamed entity recognition is a crucial component of biomedical natural language processing, e...
This paper aims to provide an effective interface for progressive refinement of pattern-based inform...
Named Entity Recognition is a basic task in Information Extraction that aims at identifying entities...
We investigate a largely unsupervised approach to learning interpretable, domain-specific entity typ...
Bootstrapping is the process of improving the performance of a trained classifier by iteratively add...
The rapidly growing body of literature in the biomedical domain presents opportunities for the devel...
An accurate Named Entity Recognition (NER) is important for knowledge discovery in text mining. This...
International audienceBiomedical named entity recognition (NER) is a challenging problem. In this pa...
One of the central tasks of medical text analysis is to extract and structure meaningful information...
Entity Recognition (ER) is a key component of relation extraction systems and many other natural-lan...
Entity resolution is one of the central challenges when integrating data from large numbers of data ...