A variety of verb phrases exist in Urdu includ-ing simple verb phrases, conjunct verb phrases and compound verb phrases. This paper ex-plains the structure of Urdu verb phrases, and details a series of experiment to automatically tag them. Initially, a rule based model is de-veloped using 21 linguistic rules for automatic VP chunking. A 100,000 word Urdu corpus is manually tagged with VP chunk tags. The corpus is then used to develop a hybrid ap-proach using HMM based statistical chunking and correction rules. The technique is en-hanced by changing chunking direction and merging chunk and POS tags. The automati-cally chunked data is compared with manually tagged held-out data to identify and analyze the errors. Based on the analysis, correc...
The rise of social networking sites and blogs has simulated a bull market in personal opinion; consu...
We present an approach for online handling of Out-of-Vocabulary (OOV) terms in Urdu-English MT. Sinc...
This work presents the development of the URDU.KON-TB treebank, its annotation evaluation & guidelin...
A variety of verb phrases exist in Urdu includ-ing simple verb phrases, conjunct verb phrases and co...
Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important m...
Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important m...
In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with...
In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with...
We address the problem of Part-of-Speech (POS) tagging of Urdu. POS tagging is the process of assign...
While part-of-speech tagging is an established technology for Western European languages such as Eng...
We release a sizeable monolingual Urdu corpus automatically tagged with part-of-speech tags. We exte...
In this paper, we focus on improving part-of-speech (POS) tagging for Urdu by using exist-ing tools ...
This work presents the linguistics-based grammar modeling of Urdu language under the framework of Le...
Urdu is the national language of Pakistan, also the most widely spoken and understandable language o...
Part-of-Speech (POS) tagging can be described as a task of doing automatic annotation of syntactic c...
The rise of social networking sites and blogs has simulated a bull market in personal opinion; consu...
We present an approach for online handling of Out-of-Vocabulary (OOV) terms in Urdu-English MT. Sinc...
This work presents the development of the URDU.KON-TB treebank, its annotation evaluation & guidelin...
A variety of verb phrases exist in Urdu includ-ing simple verb phrases, conjunct verb phrases and co...
Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important m...
Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important m...
In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with...
In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with...
We address the problem of Part-of-Speech (POS) tagging of Urdu. POS tagging is the process of assign...
While part-of-speech tagging is an established technology for Western European languages such as Eng...
We release a sizeable monolingual Urdu corpus automatically tagged with part-of-speech tags. We exte...
In this paper, we focus on improving part-of-speech (POS) tagging for Urdu by using exist-ing tools ...
This work presents the linguistics-based grammar modeling of Urdu language under the framework of Le...
Urdu is the national language of Pakistan, also the most widely spoken and understandable language o...
Part-of-Speech (POS) tagging can be described as a task of doing automatic annotation of syntactic c...
The rise of social networking sites and blogs has simulated a bull market in personal opinion; consu...
We present an approach for online handling of Out-of-Vocabulary (OOV) terms in Urdu-English MT. Sinc...
This work presents the development of the URDU.KON-TB treebank, its annotation evaluation & guidelin...