The article proposes a solution to the problem of automatic recognition of Russian noun and adjective cases in the Google Books Ngram corpus. The recognition was performed by using information on word co-occurrence statistics extracted from the corpus. Explicit Word Vectors composed of frequencies of ordinary and syntactic bigrams that include a given word were fed to the input of the recognizer. Comparative testing of several types of vector representation and preliminary data normalization were carried out. The trained model was a multi-layer perceptron with a softmax output layer. To train and test the model, we selected 50000 adjectives and 50000 nouns that were most frequently used in the Google Books Ngram Russian subcorpus between 19...
Abstract Models of morphologically rich languages suffer from data sparsity when words are treated a...
The aim of the project is to analyse the inventory and functioning of scientific terms and special l...
An extraction of significant information from Internet sources is an important task of pharmacovigil...
© 2020, Springer Nature Switzerland AG. This paper describes how to build a recognizer to identify n...
© Springer Nature Switzerland AG 2020. This paper describes how to automatically recognize parts of ...
© Springer Nature Switzerland AG 2020. The article discusses representativeness of Google Books Ngra...
© Published under licence by IOP Publishing Ltd. Large dictionaries of abstract/concrete words were ...
The paper presents the full-size Russian corpus of Internet users’ reviews on medicines with complex...
This paper presents a method of automatic construction extraction from a large corpus of Russian. Th...
© 2016 FRUCT.Recent advances in deep leaming for natural language processing achieve and improve ove...
© 2018 Institute of Physics Publishing. All rights reserved. Creation of the Google Books Ngram corp...
The thesis presents named-entity recognition in Czech historical newspapers from Modern Access to Hi...
The article presents findings of distribution patterns of Russian grammatical categories computed wi...
Current research efforts in Named Entity Recognition deal mostly with the English language. Even tho...
International audienceNeoveille is a web platform that automatically detects new words and monitors ...
Abstract Models of morphologically rich languages suffer from data sparsity when words are treated a...
The aim of the project is to analyse the inventory and functioning of scientific terms and special l...
An extraction of significant information from Internet sources is an important task of pharmacovigil...
© 2020, Springer Nature Switzerland AG. This paper describes how to build a recognizer to identify n...
© Springer Nature Switzerland AG 2020. This paper describes how to automatically recognize parts of ...
© Springer Nature Switzerland AG 2020. The article discusses representativeness of Google Books Ngra...
© Published under licence by IOP Publishing Ltd. Large dictionaries of abstract/concrete words were ...
The paper presents the full-size Russian corpus of Internet users’ reviews on medicines with complex...
This paper presents a method of automatic construction extraction from a large corpus of Russian. Th...
© 2016 FRUCT.Recent advances in deep leaming for natural language processing achieve and improve ove...
© 2018 Institute of Physics Publishing. All rights reserved. Creation of the Google Books Ngram corp...
The thesis presents named-entity recognition in Czech historical newspapers from Modern Access to Hi...
The article presents findings of distribution patterns of Russian grammatical categories computed wi...
Current research efforts in Named Entity Recognition deal mostly with the English language. Even tho...
International audienceNeoveille is a web platform that automatically detects new words and monitors ...
Abstract Models of morphologically rich languages suffer from data sparsity when words are treated a...
The aim of the project is to analyse the inventory and functioning of scientific terms and special l...
An extraction of significant information from Internet sources is an important task of pharmacovigil...