Linguists have long been producing grammatical decriptions of yet undescribed languages. This is a time-consuming process, which has already adapted to improved technology for recording and storage. We present here a novel application of NLP techniques to bootstrap analysis of collected data and speed-up manual selection work. To be more precise, we argue that unsupervised induction of morphology and part-of-speech analysis from raw text data is mature enough to produce useful results. Experiments with Latent Semantic Analysis were less fruitful. We exemplify this on Mpiemo, a so-far essentially undescribed Bantu language of the Central African Republic, for which raw text data was available. 1
In this paper, we demonstrate the efficacy of a POS annotation method that employed the services of ...
Computational morphological analysis is an important first step in the automatic treatment of natura...
For more than 30 years, there have been renewed interests in computational morphology resulting in n...
Linguists have long been producing grammatical decriptions of yet undescribed languages. This is a t...
Linguists have long been producing grammatical decriptions of yet undescribed languages. This is a t...
This paper addresses the experimental bootstrapping of the development of broad-coverage finite-stat...
Abstract This paper describes an endeavour to build natural language processing (NLP)...
In this article it is shown how distributional corpus analysis may be used to start the description ...
In this article it is shown how distributional corpus analysis may be used to start the description ...
Abstract In this paper the development of computational morphological analysers for six South Africa...
The world-wide proliferation of digital communications has created the need for language and speech ...
The development of natural language processing (NLP) components is resource-intensive and therefore ...
In this article it is shown how distributional corpus analysis may be used to start the description ...
The trends emerging in the natural language processing (NLP) of African languages spoken in South Af...
The paper describes a collaboration approach in progress for morphological analysis of less-resource...
In this paper, we demonstrate the efficacy of a POS annotation method that employed the services of ...
Computational morphological analysis is an important first step in the automatic treatment of natura...
For more than 30 years, there have been renewed interests in computational morphology resulting in n...
Linguists have long been producing grammatical decriptions of yet undescribed languages. This is a t...
Linguists have long been producing grammatical decriptions of yet undescribed languages. This is a t...
This paper addresses the experimental bootstrapping of the development of broad-coverage finite-stat...
Abstract This paper describes an endeavour to build natural language processing (NLP)...
In this article it is shown how distributional corpus analysis may be used to start the description ...
In this article it is shown how distributional corpus analysis may be used to start the description ...
Abstract In this paper the development of computational morphological analysers for six South Africa...
The world-wide proliferation of digital communications has created the need for language and speech ...
The development of natural language processing (NLP) components is resource-intensive and therefore ...
In this article it is shown how distributional corpus analysis may be used to start the description ...
The trends emerging in the natural language processing (NLP) of African languages spoken in South Af...
The paper describes a collaboration approach in progress for morphological analysis of less-resource...
In this paper, we demonstrate the efficacy of a POS annotation method that employed the services of ...
Computational morphological analysis is an important first step in the automatic treatment of natura...
For more than 30 years, there have been renewed interests in computational morphology resulting in n...