This paper discusses the use of unlabeled examples for the problem of named entity classification. A large number of rules is needed for coverage of the domain, suggesting that a fairly large number of la-beled examples should be required to train a classi-fier. However, we show that the use of unlabeled data can reduce the requirements for supervision to just 7 simple “seed ” rules. The approach gains leverage from natural redundancy in the data: for many named-entity instances both the spelling of the name and the context in which it appears are sufficient to determine its type. We present two algorithms. The first method uses a similar algorithm to that of (Yarowsky 95), with modifications motivated by (Blum and Mitchell 98). The second ...
Entity Linking (EL) systems have achieved impressive results on standard benchmarks, mainly thanks t...
This paper presents a named entity classifica-tion system that utilises both orthographic and contex...
The preservation of the privacy of persons mentioned in text requires the ability to automatically ...
Named Entity Recognition (NER) aims to extract and to classify rigid designators in text such as pro...
In this paper, we describe a new method for the problem of named entity classifica-tion for speciali...
In this paper, we present a simple yet novel method of exploiting unlabeled text to further improve ...
In this paper, we propose a named-entity recognition (NER) system that addresses two major limitatio...
The regularity of named entities is used to learn names and to extract named entities. Having only a...
Cimiano P, Völker J. Towards large-scale, open-domain and ontology-based named entity classification...
The task of Named Entity Recognition (NER) is aimed at identifying named entities in a given text an...
Bootstrapping is the process of improving the performance of a trained classifier by iteratively add...
This paper investigates stacking and voting methods for combining strong classifiers like boosting...
This paper investigates stacking and voting methods for combining strong classifiers like boosting, ...
While good results have been achieved for named entity recognition (NER) in supervised settings, it ...
It is often claimed that Named En-tity recognition systems need extensive gazetteers|lists of names ...
Entity Linking (EL) systems have achieved impressive results on standard benchmarks, mainly thanks t...
This paper presents a named entity classifica-tion system that utilises both orthographic and contex...
The preservation of the privacy of persons mentioned in text requires the ability to automatically ...
Named Entity Recognition (NER) aims to extract and to classify rigid designators in text such as pro...
In this paper, we describe a new method for the problem of named entity classifica-tion for speciali...
In this paper, we present a simple yet novel method of exploiting unlabeled text to further improve ...
In this paper, we propose a named-entity recognition (NER) system that addresses two major limitatio...
The regularity of named entities is used to learn names and to extract named entities. Having only a...
Cimiano P, Völker J. Towards large-scale, open-domain and ontology-based named entity classification...
The task of Named Entity Recognition (NER) is aimed at identifying named entities in a given text an...
Bootstrapping is the process of improving the performance of a trained classifier by iteratively add...
This paper investigates stacking and voting methods for combining strong classifiers like boosting...
This paper investigates stacking and voting methods for combining strong classifiers like boosting, ...
While good results have been achieved for named entity recognition (NER) in supervised settings, it ...
It is often claimed that Named En-tity recognition systems need extensive gazetteers|lists of names ...
Entity Linking (EL) systems have achieved impressive results on standard benchmarks, mainly thanks t...
This paper presents a named entity classifica-tion system that utilises both orthographic and contex...
The preservation of the privacy of persons mentioned in text requires the ability to automatically ...