Abstract—In this paper we introduce a method for part-of-speech disambiguation of Persian texts, which uses word class probabilities in a relatively small training corpus in order to automatically tag unrestricted Persian texts. The experiment has been carried out in two levels as unigram and bi-gram genotypes disambiguation. Comparing the results gained from the two levels, we show that using immediate right context to which a given word belongs can increase the accuracy rate of the system to a high degree. Index Terms—genotype, machine translation, part of speech disambiguation, word class probabilities In linguistics, the term ‘corpus ’ refers to a relatively large number of raw or annotated words in the body of text. Computational lingu...
In this paper we describe a proof-of-concept for the bootstrapping of a Persian WordNet. This effort...
Part of Speech (POS) tagging is an essential part of text processing applications. A POS tagger assi...
This thesis presents open source resources in the form of annotated corpora and modules for automati...
Part-Of-Speech (POS) tagging is the process of marking-up the words in a text with their correspondi...
One of the fundamental tasks in natural language processing is part of speech (POS) tagging. A POS t...
Part-Of-Speech (POS) tagging is the proc-ess of marking-up the words in a text with their correspond...
Proceedings of the 18th Nordic Conference of Computational Linguistics NODALIDA 2011. Editors: Bol...
Present study introduces a machine-based approach for Word Sense Disambiguation (WSD). In Persian, a...
This paper presents a methodology for improving part-of-speech disambiguation using word classes. We...
Persian with its about 100,000,000 speakers in the world belongs to the group of languages with less...
In many applications of natural language processing (NLP) grammatically tagged corpora are needed. T...
AbstractOne of the important actions in the processing of languages is part-of-speech tagging. Again...
This paper describes a method based on morphological analysis of words for a Persian Part-Of-Speech ...
Collocation is an important level of lexis and its significance in the pedagogy of a language is an ...
Abstract. Increasing the domain of locality by using Tree Adjoining Grammars (TAG) caused some appli...
In this paper we describe a proof-of-concept for the bootstrapping of a Persian WordNet. This effort...
Part of Speech (POS) tagging is an essential part of text processing applications. A POS tagger assi...
This thesis presents open source resources in the form of annotated corpora and modules for automati...
Part-Of-Speech (POS) tagging is the process of marking-up the words in a text with their correspondi...
One of the fundamental tasks in natural language processing is part of speech (POS) tagging. A POS t...
Part-Of-Speech (POS) tagging is the proc-ess of marking-up the words in a text with their correspond...
Proceedings of the 18th Nordic Conference of Computational Linguistics NODALIDA 2011. Editors: Bol...
Present study introduces a machine-based approach for Word Sense Disambiguation (WSD). In Persian, a...
This paper presents a methodology for improving part-of-speech disambiguation using word classes. We...
Persian with its about 100,000,000 speakers in the world belongs to the group of languages with less...
In many applications of natural language processing (NLP) grammatically tagged corpora are needed. T...
AbstractOne of the important actions in the processing of languages is part-of-speech tagging. Again...
This paper describes a method based on morphological analysis of words for a Persian Part-Of-Speech ...
Collocation is an important level of lexis and its significance in the pedagogy of a language is an ...
Abstract. Increasing the domain of locality by using Tree Adjoining Grammars (TAG) caused some appli...
In this paper we describe a proof-of-concept for the bootstrapping of a Persian WordNet. This effort...
Part of Speech (POS) tagging is an essential part of text processing applications. A POS tagger assi...
This thesis presents open source resources in the form of annotated corpora and modules for automati...