Abstract-The on-going project aiming at the creation of the National Corpus of Polish assumes several levels of linguistic annotation. We present the technical environment and methodological background developed for the three upper annotation levels: the level of syntactic words and groups, and the level of named entities. We show how knowledge-based platforms Spejd and Sprout are used for the automatic pre-annotation of the corpus, and we discuss some particular problems faced during the elaboration of the syntactic grammar, which contains over 800 rules and is one of the largest chunking grammars for Polish. We also show how the tree editor TrEd has been customized for manual post-editing of annotations, and for further revision of discre...
International audienceIn the paper we present a contribution to the SyntLex long-term-project aiming...
International audienceIn the paper we present a contribution to the SyntLex long-term-project aiming...
International audienceIn the paper we present a contribution to the SyntLex long-term-project aiming...
International audienceThe on-going project aiming at the creation of the National Corpus of Polish a...
We present the named entity annotation task within the on-going project of the National Corpus of Po...
International audienceWe present the named entity annotation task within the on-going project of the...
International audienceWe present initial results in the named entity annotation subtask of a project...
This paper presents the procedure of the syntactic annotation of the National Corpus of Polish. Synt...
The aim of the paper is to present recent — as of March 2010 — developments in the construction of t...
In this paper we present the principles of lexico-semantic annotation of Skład-nica Treebank using P...
The purpose of this paper is to describe recent developments in the morphological, syntactic, and se...
There is a need for a general framework for linguistic annotation that is flexible and extensible en...
The project presented here is a part of a long term research program aiming at a full lexicon gramma...
The paper is devoted to the issue of correction of the erroneous and ambiguous corpus of Frequency D...
There is a need for a general framework for linguistic annotation that is flexible and extensible en...
International audienceIn the paper we present a contribution to the SyntLex long-term-project aiming...
International audienceIn the paper we present a contribution to the SyntLex long-term-project aiming...
International audienceIn the paper we present a contribution to the SyntLex long-term-project aiming...
International audienceThe on-going project aiming at the creation of the National Corpus of Polish a...
We present the named entity annotation task within the on-going project of the National Corpus of Po...
International audienceWe present the named entity annotation task within the on-going project of the...
International audienceWe present initial results in the named entity annotation subtask of a project...
This paper presents the procedure of the syntactic annotation of the National Corpus of Polish. Synt...
The aim of the paper is to present recent — as of March 2010 — developments in the construction of t...
In this paper we present the principles of lexico-semantic annotation of Skład-nica Treebank using P...
The purpose of this paper is to describe recent developments in the morphological, syntactic, and se...
There is a need for a general framework for linguistic annotation that is flexible and extensible en...
The project presented here is a part of a long term research program aiming at a full lexicon gramma...
The paper is devoted to the issue of correction of the erroneous and ambiguous corpus of Frequency D...
There is a need for a general framework for linguistic annotation that is flexible and extensible en...
International audienceIn the paper we present a contribution to the SyntLex long-term-project aiming...
International audienceIn the paper we present a contribution to the SyntLex long-term-project aiming...
International audienceIn the paper we present a contribution to the SyntLex long-term-project aiming...