Annotated corpora such as treebanks are important for the development of parsers, language applications as well as understanding of the language itself. Only very few languages possess these scarce resources. In this paper, we describe our effort in syntactically annotating a small corpora (600 sentences) of Tamil language. Our annotation is similar to Prague Dependency Treebank (PDT 2.0) and consists of 2 levels or layers: (i) morphological layer (m-layer) and (ii) analytical layer (a-layer). For both the layers, we introduce annotation schemes i.e. positional tagging for m-layer and dependency relations (and how dependency structures should be drawn) for a-layers. Finally, we evaluate our corpora in the tagging and parsing task using well...
TamilTB 1.0 improves over the very first release of the Tamil dependency treebank (TamilTB v0.1). Th...
Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories. Editors: Mar...
The paper describes an approach to expedite the process of manual annotation of a Hindi dependency t...
This talk is aimed at presenting our ongoing effort to build a PDT style dependency treebank for Tam...
Tamil Dependency Treebank version 0.1 (TamilTB.v0.1) is an attempt to develop a syntactically annota...
TamilTB is a first published syntactically annotated corpus of Tamil. TamilTB will allow a more rapi...
In this paper, we describe the annotation and development of Telugu treebank following the Universal...
Corpora are fundamental tools for Natural Language Processing. Part of Speech tagging provides morem...
The paper introduces a dependency annotation effort which aims to fully annotate a million word Hind...
Key to fast adaptation of language technologies for any language hinges on the availability of funda...
Abstract: Language syntax and semantics can be recorded in various forms like grammar rules, diction...
Kashmiri is a resource poor language with very less computational and language resources available f...
This paper presents an open source and extendable Morphological Analyser cum Generator (MAG) for Tam...
The Paninian Grammar framework, given by Panini for his analysis of Sanskrit Language, is finding it...
This thesis presents a syntactic description of spoken Tamil, based on the author's own speech. The ...
TamilTB 1.0 improves over the very first release of the Tamil dependency treebank (TamilTB v0.1). Th...
Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories. Editors: Mar...
The paper describes an approach to expedite the process of manual annotation of a Hindi dependency t...
This talk is aimed at presenting our ongoing effort to build a PDT style dependency treebank for Tam...
Tamil Dependency Treebank version 0.1 (TamilTB.v0.1) is an attempt to develop a syntactically annota...
TamilTB is a first published syntactically annotated corpus of Tamil. TamilTB will allow a more rapi...
In this paper, we describe the annotation and development of Telugu treebank following the Universal...
Corpora are fundamental tools for Natural Language Processing. Part of Speech tagging provides morem...
The paper introduces a dependency annotation effort which aims to fully annotate a million word Hind...
Key to fast adaptation of language technologies for any language hinges on the availability of funda...
Abstract: Language syntax and semantics can be recorded in various forms like grammar rules, diction...
Kashmiri is a resource poor language with very less computational and language resources available f...
This paper presents an open source and extendable Morphological Analyser cum Generator (MAG) for Tam...
The Paninian Grammar framework, given by Panini for his analysis of Sanskrit Language, is finding it...
This thesis presents a syntactic description of spoken Tamil, based on the author's own speech. The ...
TamilTB 1.0 improves over the very first release of the Tamil dependency treebank (TamilTB v0.1). Th...
Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories. Editors: Mar...
The paper describes an approach to expedite the process of manual annotation of a Hindi dependency t...