In our submission to the SIGMORPHON 2022 Shared Task on Morpheme Segmentation, we study whether an unsupervised morphological segmentation method, Morfessor, can help in a supervised setting. Previous research has shown the effectiveness of the approach in semisupervised settings with small amounts of labeled data. The current tasks vary in data size: the amount of word-level annotated training data is much larger, but the amount of sentencelevel annotated training data remains small. Our approach is to pre-segment the input data for a neural sequence-to-sequence model with the unsupervised method. As the unsupervised method can be trained with raw text data, we use Wikipedia to increase the amount of training data. In addition, we train mu...
In natural language processing many practical tasks, such as speech recognition, information retriev...
We investigate the usage of semantic information for morphological segmentation since words that are...
Morfessor is a family of methods for learning morphological segmentations of words based on unannota...
In our submission to the SIGMORPHON 2022 Shared Task on Morpheme Segmentation, we study whether an u...
Morfessor is a family of probabilistic machine learning methods forfinding the morphological segment...
In order to develop computer applications that successfully process natural language data (text and ...
Morphological analysis provides a decomposition of words into smaller constituents. It is an importa...
Machine learning methods are increasingly applied to automated processing of natural language data. ...
This paper describes the submissions of the team of the Department of Computational Linguistics, Uni...
Data-driven segmentation of words into subword units has been used in various natural language proce...
| openaire: EC/H2020/780069/EU//MeMADData-driven segmentation of words into subword units has been u...
Morphological Segmentation involves decomposing words into morphemes, the smallest meaning-bearing u...
In this work, Morfessor, a morpheme segmentation model and algorithm developed by the organizers of ...
Morfette is a modular, data-driven, probabilistic system which learns to perform joint morphological...
We present an extension of the Morfessor Base line model of unsupervised morphological seg mentation...
In natural language processing many practical tasks, such as speech recognition, information retriev...
We investigate the usage of semantic information for morphological segmentation since words that are...
Morfessor is a family of methods for learning morphological segmentations of words based on unannota...
In our submission to the SIGMORPHON 2022 Shared Task on Morpheme Segmentation, we study whether an u...
Morfessor is a family of probabilistic machine learning methods forfinding the morphological segment...
In order to develop computer applications that successfully process natural language data (text and ...
Morphological analysis provides a decomposition of words into smaller constituents. It is an importa...
Machine learning methods are increasingly applied to automated processing of natural language data. ...
This paper describes the submissions of the team of the Department of Computational Linguistics, Uni...
Data-driven segmentation of words into subword units has been used in various natural language proce...
| openaire: EC/H2020/780069/EU//MeMADData-driven segmentation of words into subword units has been u...
Morphological Segmentation involves decomposing words into morphemes, the smallest meaning-bearing u...
In this work, Morfessor, a morpheme segmentation model and algorithm developed by the organizers of ...
Morfette is a modular, data-driven, probabilistic system which learns to perform joint morphological...
We present an extension of the Morfessor Base line model of unsupervised morphological seg mentation...
In natural language processing many practical tasks, such as speech recognition, information retriev...
We investigate the usage of semantic information for morphological segmentation since words that are...
Morfessor is a family of methods for learning morphological segmentations of words based on unannota...