Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important minority language in Europe, North America, and elsewhere. This thesis describes the development of a computer-based system for part-of-speech tagging of Urdu texts, consisting of a tagset, a set of tagging guidelines for manual tagging or post-editing, and the tagger itself. The tagset is defined in accordance with a set of design principles, derived from a survey of good practice in the field of tagset design, including compliance with the EAGLES guidelines on morphosyntactic annotation. These are shown to be extensible to languages, such as Urdu, that are closely related to those languages for which the guidelines were originally devised. T...
A variety of verb phrases exist in Urdu includ-ing simple verb phrases, conjunct verb phrases and co...
Corpus based morphology has emerged into a science across the languages and in various special...
Corpus based morphology has emerged into a science across the languages and in various special...
Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important m...
While part-of-speech tagging is an established technology for Western European languages such as Eng...
In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with...
In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with...
In this paper, we focus on improving part-of-speech (POS) tagging for Urdu by using exist-ing tools ...
This work presents the linguistics-based grammar modeling of Urdu language under the framework of Le...
We address the problem of Part-of-Speech (POS) tagging of Urdu. POS tagging is the process of assign...
This work presents the development of the URDU.KON-TB treebank, its annotation evaluation & guidelin...
This work presents the development of the URDU.KON-TB treebank, its annotation evaluation & guidelin...
In this paper, we present our parsing efforts for Urdu, a South Asian language with rich morphology....
We release a sizeable monolingual Urdu corpus automatically tagged with part-of-speech tags. We exte...
The rise of social networking sites and blogs has simulated a bull market in personal opinion; consu...
A variety of verb phrases exist in Urdu includ-ing simple verb phrases, conjunct verb phrases and co...
Corpus based morphology has emerged into a science across the languages and in various special...
Corpus based morphology has emerged into a science across the languages and in various special...
Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important m...
While part-of-speech tagging is an established technology for Western European languages such as Eng...
In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with...
In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with...
In this paper, we focus on improving part-of-speech (POS) tagging for Urdu by using exist-ing tools ...
This work presents the linguistics-based grammar modeling of Urdu language under the framework of Le...
We address the problem of Part-of-Speech (POS) tagging of Urdu. POS tagging is the process of assign...
This work presents the development of the URDU.KON-TB treebank, its annotation evaluation & guidelin...
This work presents the development of the URDU.KON-TB treebank, its annotation evaluation & guidelin...
In this paper, we present our parsing efforts for Urdu, a South Asian language with rich morphology....
We release a sizeable monolingual Urdu corpus automatically tagged with part-of-speech tags. We exte...
The rise of social networking sites and blogs has simulated a bull market in personal opinion; consu...
A variety of verb phrases exist in Urdu includ-ing simple verb phrases, conjunct verb phrases and co...
Corpus based morphology has emerged into a science across the languages and in various special...
Corpus based morphology has emerged into a science across the languages and in various special...