The preservation of the privacy of persons mentioned in text requires the ability to automatically recognize and identify names. Named entity recognition is a mature field and most current approaches are based on supervised machine learning techniques. Such learning requires the presence of labeled examples on which to train; training examples are usually provided to the learner on the form of annotated corpora. Creating and annotating corpora is a tedious, meticulous and error prone process; obtaining good training examples is a hard task in itself. This paper describes the development and in-depth empirical investigation of a method, called BootMark, for bootstrapping the marking up of named entities in textual documents. Exper...
casual reader, careful observer or scientific analyst it is becoming more important to quickly recog...
International audienceSince the Message Understanding Conferences on Information Extraction in the 8...
International audienceA vast amount of crucial information about patients resides solely in unstruct...
This thesis describes the development and in-depth empirical investigation of a method, called BootM...
The regularity of named entities is used to learn names and to extract named entities. Having only a...
Sharing data is an important part of the progress of science in many fields. In the largely deep lea...
Text data collections enable the deployment of artificial intelligence algorithms for novel tasks. S...
Named Entity Recognition (NER) aims to extract and to classify rigid designators in text such as pro...
This paper discusses the use of unlabeled examples for the problem of named entity classification. A...
Scholars in inter-disciplinary fields like the Digital Humanities are increasingly interested in sem...
In this paper, we propose a named-entity recognition (NER) system that addresses two major limitatio...
Abstract—The rise of the social web has brought a series of privacy concerns and threats. In particu...
Named Entity Recognition is a basic task in Information Extraction that aims at identifying entities...
One of issues in the bootstrapping for named entity recognition is how to control annotation errors ...
The development of Named Entity Recognition (NER) in recent years is partially attributed to the ava...
casual reader, careful observer or scientific analyst it is becoming more important to quickly recog...
International audienceSince the Message Understanding Conferences on Information Extraction in the 8...
International audienceA vast amount of crucial information about patients resides solely in unstruct...
This thesis describes the development and in-depth empirical investigation of a method, called BootM...
The regularity of named entities is used to learn names and to extract named entities. Having only a...
Sharing data is an important part of the progress of science in many fields. In the largely deep lea...
Text data collections enable the deployment of artificial intelligence algorithms for novel tasks. S...
Named Entity Recognition (NER) aims to extract and to classify rigid designators in text such as pro...
This paper discusses the use of unlabeled examples for the problem of named entity classification. A...
Scholars in inter-disciplinary fields like the Digital Humanities are increasingly interested in sem...
In this paper, we propose a named-entity recognition (NER) system that addresses two major limitatio...
Abstract—The rise of the social web has brought a series of privacy concerns and threats. In particu...
Named Entity Recognition is a basic task in Information Extraction that aims at identifying entities...
One of issues in the bootstrapping for named entity recognition is how to control annotation errors ...
The development of Named Entity Recognition (NER) in recent years is partially attributed to the ava...
casual reader, careful observer or scientific analyst it is becoming more important to quickly recog...
International audienceSince the Message Understanding Conferences on Information Extraction in the 8...
International audienceA vast amount of crucial information about patients resides solely in unstruct...