This thesis describes the development and in-depth empirical investigation of a method, called BootMark, for bootstrapping the marking up of named entities in textual documents. The reason for working with documents, as opposed to for instance sentences or phrases, is that the BootMark method is concerned with the creation of corpora. The claim made in the thesis is that BootMark requires a human annotator to manually annotate fewer documents in order to produce a named entity recognizer with a given performance, than would be needed if the documents forming the basis for the recognizer were randomly drawn from the same corpus. The intention is then to use the created named en- tity recognizer as a pre-tagger and thus eventually turn the ma...
We have trained a named entity recognition (NER) model that screens Swedish job ads for different ki...
This poster proposes the use of Named Entity Recognition as a heuristic tool for improving manual do...
This paper describes a new method, COMBI-BOOTSTRAP, to exploit existing taggers and lexical resource...
This thesis describes the development and in-depth empirical investigation of a method, called BootM...
The preservation of the privacy of persons mentioned in text requires the ability to automatically ...
Scholars in inter-disciplinary fields like the Digital Humanities are increasingly interested in sem...
One of issues in the bootstrapping for named entity recognition is how to control annotation errors ...
Automatic pre-annotation is often used to improve human annotation speed and ac-curacy. We address h...
A novel bootstrapping approach to Named Entity (NE)tagging using concept-based seeds and successive ...
PosterInternational audienceToday, the named entity recognition task is considered as fundamental, b...
Named Entity Recognition (NER) is an essential step for many natural language processing tasks, incl...
Named Entity Recognition is a basic task in Information Extraction that aims at identifying entities...
Named Entity Recognition (NER) aims to extract and to classify rigid designators in text such as pro...
Linguistic annotation is time-consuming and expensive. One common annotation task is to mark entitie...
The development of Named Entity Recognition (NER) in recent years is partially attributed to the ava...
We have trained a named entity recognition (NER) model that screens Swedish job ads for different ki...
This poster proposes the use of Named Entity Recognition as a heuristic tool for improving manual do...
This paper describes a new method, COMBI-BOOTSTRAP, to exploit existing taggers and lexical resource...
This thesis describes the development and in-depth empirical investigation of a method, called BootM...
The preservation of the privacy of persons mentioned in text requires the ability to automatically ...
Scholars in inter-disciplinary fields like the Digital Humanities are increasingly interested in sem...
One of issues in the bootstrapping for named entity recognition is how to control annotation errors ...
Automatic pre-annotation is often used to improve human annotation speed and ac-curacy. We address h...
A novel bootstrapping approach to Named Entity (NE)tagging using concept-based seeds and successive ...
PosterInternational audienceToday, the named entity recognition task is considered as fundamental, b...
Named Entity Recognition (NER) is an essential step for many natural language processing tasks, incl...
Named Entity Recognition is a basic task in Information Extraction that aims at identifying entities...
Named Entity Recognition (NER) aims to extract and to classify rigid designators in text such as pro...
Linguistic annotation is time-consuming and expensive. One common annotation task is to mark entitie...
The development of Named Entity Recognition (NER) in recent years is partially attributed to the ava...
We have trained a named entity recognition (NER) model that screens Swedish job ads for different ki...
This poster proposes the use of Named Entity Recognition as a heuristic tool for improving manual do...
This paper describes a new method, COMBI-BOOTSTRAP, to exploit existing taggers and lexical resource...