The paper presents experiments on part-of-speech and full morphological tagging of the Slavic minority language Rusyn. The proposed approach relies on transfer learning and uses only annotated resources from related Slavic languages, namely Russian, Ukrainian, Slovak, Polish, and Czech. It does not require any annotated Rusyn training data, nor parallel data or bilingual dictionaries involving Rusyn. Compared to earlier work, we improve tagging performance by using a neural network tagger and larger training data from the neighboring Slavic languages.We experiment with various data preprocessing and sampling strategies and evaluate the impact of multitask learning strategies and of pretrained word embeddings. Overall, while genre discrepanc...
We take a novel approach to rapid, low-cost development of morpho-syntactically annotated resources ...
The thesis explores the status quo of the Kazakh language in terms of corpus linguistics. The proj...
Abstract. Comparative studies in theoretical linguistics and the production of bi- and multilingual ...
The paper presents experiments on part-of-speech and full morphological tagging of the Slavic minori...
This paper deals with the development of morphosyntactic taggers for spoken varieties of the Slavic ...
This paper reports on challenges and results in developing NLP resources for spoken Rusyn. Being a S...
This paper presents a methodology for rapid development of Ukrainian morphological disambiguation re...
We explore how well a sequence labeling approach, namely, recurrent neural network, is suited for th...
This paper reports the principles behind designing a tagset to cover Russian morphosyntactic phenome...
In this paper, we describe a resource-light system for the automatic morphological analysis and tag-...
This paper presents winning solution to PolEval 2020 morphosyntactic tagging of Middle, New and Mode...
In this paper, we describe a resource-light system for the automatic morphological analysis and tagg...
The article presents a state-of-the-art complete part-of-speech tagger for Polish which uses recurre...
Neural networks represent a promising approach to problems, which exact algorithmic solution is unkn...
The paper evaluates tagging techniques on a corpus of Slovene, where we are faced with a large numbe...
We take a novel approach to rapid, low-cost development of morpho-syntactically annotated resources ...
The thesis explores the status quo of the Kazakh language in terms of corpus linguistics. The proj...
Abstract. Comparative studies in theoretical linguistics and the production of bi- and multilingual ...
The paper presents experiments on part-of-speech and full morphological tagging of the Slavic minori...
This paper deals with the development of morphosyntactic taggers for spoken varieties of the Slavic ...
This paper reports on challenges and results in developing NLP resources for spoken Rusyn. Being a S...
This paper presents a methodology for rapid development of Ukrainian morphological disambiguation re...
We explore how well a sequence labeling approach, namely, recurrent neural network, is suited for th...
This paper reports the principles behind designing a tagset to cover Russian morphosyntactic phenome...
In this paper, we describe a resource-light system for the automatic morphological analysis and tag-...
This paper presents winning solution to PolEval 2020 morphosyntactic tagging of Middle, New and Mode...
In this paper, we describe a resource-light system for the automatic morphological analysis and tagg...
The article presents a state-of-the-art complete part-of-speech tagger for Polish which uses recurre...
Neural networks represent a promising approach to problems, which exact algorithmic solution is unkn...
The paper evaluates tagging techniques on a corpus of Slovene, where we are faced with a large numbe...
We take a novel approach to rapid, low-cost development of morpho-syntactically annotated resources ...
The thesis explores the status quo of the Kazakh language in terms of corpus linguistics. The proj...
Abstract. Comparative studies in theoretical linguistics and the production of bi- and multilingual ...