International audienceWe present the named entity annotation task within the on-going project of the National Corpus of Polish. To the best of our knowledge, this is the first attempt at a large-scale corpus annotation of Polish named entities. We describe the scope and the TEI-inspired hierarchy of named entities admitted for this task, as well as the TEI-conformant multi-level stand-off annotation format. We also discuss some methodological strategies including the annotation of embedded, coordinated and discontinuous names. Our annotation platform consists of two main tools interconnected by converting facilities. A rule-based natural language processing platform SProUT is used for the automatic pre-annotation of named entities, due to t...
Within the framework of the construction of a fact database, we defined guidelines to extract named ...
Within the framework of the construction of a fact database, we defined guidelines to extract named ...
Czech Named Entity Corpus 2.0 is a corpus of 8993 Czech sentences with manually annotated 35220 Czec...
We present the named entity annotation task within the on-going project of the National Corpus of Po...
International audienceWe present initial results in the named entity annotation subtask of a project...
International audienceThe on-going project aiming at the creation of the National Corpus of Polish a...
Abstract-The on-going project aiming at the creation of the National Corpus of Polish assumes severa...
The article presents named entity recognition system, which participated in the second task of PolEv...
Abstract. In this paper, we present a rule-based named-entity recognition sys-tem for Polish built o...
This paper presents the procedure of the syntactic annotation of the National Corpus of Polish. Synt...
The aim of the paper is to present recent — as of March 2010 — developments in the construction of t...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
This document presents the first edition of the Polish Sejm Corpus – a new specialized resource cont...
Towards an event annotated corpus of Polish The paper presents a typology of events built on the ba...
Within the framework of the construction of a fact database, we defined guidelines to extract named ...
Within the framework of the construction of a fact database, we defined guidelines to extract named ...
Czech Named Entity Corpus 2.0 is a corpus of 8993 Czech sentences with manually annotated 35220 Czec...
We present the named entity annotation task within the on-going project of the National Corpus of Po...
International audienceWe present initial results in the named entity annotation subtask of a project...
International audienceThe on-going project aiming at the creation of the National Corpus of Polish a...
Abstract-The on-going project aiming at the creation of the National Corpus of Polish assumes severa...
The article presents named entity recognition system, which participated in the second task of PolEv...
Abstract. In this paper, we present a rule-based named-entity recognition sys-tem for Polish built o...
This paper presents the procedure of the syntactic annotation of the National Corpus of Polish. Synt...
The aim of the paper is to present recent — as of March 2010 — developments in the construction of t...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
This document presents the first edition of the Polish Sejm Corpus – a new specialized resource cont...
Towards an event annotated corpus of Polish The paper presents a typology of events built on the ba...
Within the framework of the construction of a fact database, we defined guidelines to extract named ...
Within the framework of the construction of a fact database, we defined guidelines to extract named ...
Czech Named Entity Corpus 2.0 is a corpus of 8993 Czech sentences with manually annotated 35220 Czec...