The work was accepted in Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, colocated with ACL 2017 The last decade saw a surge in digitisation efforts for ancient manuscripts in Sanskrit. Due to various linguistic peculiarities inherent to the language, even the preliminary tasks such as word segmentation are non-trivial in Sanskrit. Elegant models for Word Segmentation in Sanskrit are indispensable for further syntactic and semantic processing of the manuscripts. Current works in word segmentation for Sanskrit, though commendable in their novelty, often have variations in their objective and evaluation criteria. In this work, we set the record straight. We formally define...
The work is accepted at TextGraphs - 17 colocated with ACL 2017 (http://acl2017.org/) Derivational...
Pali Sandhi is a phonetic transformation from two words into a new word. The phonemes of the neighbo...
One of the important features of Sanskrit language is the long tradition of lexicons. The early sour...
<p>The work was accepted in Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage...
This is a proof-of-concept Sanskrit corpus developed for the study of Buddhist Sanskrit lexicology. ...
Sanskrit is one of the most ancient attested Indo-European languages, and it has one of the oldest l...
This is a proof-of-concept Sanskrit corpus developed for the study of Buddhist Sanskrit lexicology. ...
We describe an innovative computer interface designed to assist annotators in the efficient selectio...
Lexical datasets containing annotated concordances of words pertaining to the conceptual domains of ...
This is a Sanskrit corpus developed at the Mangalam Research Center (Berkeley, California) for the s...
International audienceThis paper focuses on the classifications of words which were elaborated in th...
This is the repository for word segmentation in sanskrit using energy based models. # Word Segme...
Sanskrit has a rich source of lexical resources in the form of various kinds of dictionaries, and a ...
Sanskrit (संस्कृत), sometimes referred to as the mother of all Indian languages, has a significant ...
The work is accepted at TextGraphs - 17 colocated with ACL 2017 (http://acl2017.org/) Derivational...
Pali Sandhi is a phonetic transformation from two words into a new word. The phonemes of the neighbo...
One of the important features of Sanskrit language is the long tradition of lexicons. The early sour...
<p>The work was accepted in Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage...
This is a proof-of-concept Sanskrit corpus developed for the study of Buddhist Sanskrit lexicology. ...
Sanskrit is one of the most ancient attested Indo-European languages, and it has one of the oldest l...
This is a proof-of-concept Sanskrit corpus developed for the study of Buddhist Sanskrit lexicology. ...
We describe an innovative computer interface designed to assist annotators in the efficient selectio...
Lexical datasets containing annotated concordances of words pertaining to the conceptual domains of ...
This is a Sanskrit corpus developed at the Mangalam Research Center (Berkeley, California) for the s...
International audienceThis paper focuses on the classifications of words which were elaborated in th...
This is the repository for word segmentation in sanskrit using energy based models. # Word Segme...
Sanskrit has a rich source of lexical resources in the form of various kinds of dictionaries, and a ...
Sanskrit (संस्कृत), sometimes referred to as the mother of all Indian languages, has a significant ...
The work is accepted at TextGraphs - 17 colocated with ACL 2017 (http://acl2017.org/) Derivational...
Pali Sandhi is a phonetic transformation from two words into a new word. The phonemes of the neighbo...
One of the important features of Sanskrit language is the long tradition of lexicons. The early sour...