<p>The work was accepted in Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, colocated with ACL 2017</p> <p>The last decade saw a surge in digitisation efforts for ancient manuscripts in Sanskrit. Due to various linguistic peculiarities inherent to the language, even the preliminary tasks such as word segmentation are non-trivial in Sanskrit. Elegant models for Word Segmentation in Sanskrit are indispensable for further syntactic and semantic processing of the manuscripts. Current works in word segmentation for Sanskrit, though commendable in their novelty, often have variations in their objective and evaluation criteria. In this work, we set the record straight. We forma...
Pali Sandhi is a phonetic transformation from two words into a new word. The phonemes of the neighbo...
Sanskrit has a rich source of lexical resources in the form of various kinds of dictionaries, and a ...
Dravidian languages, such as Kannada and Tamil, are notoriously difficult to translate by state-of-t...
The work was accepted in Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, S...
This is a proof-of-concept Sanskrit corpus developed for the study of Buddhist Sanskrit lexicology. ...
Sanskrit is one of the most ancient attested Indo-European languages, and it has one of the oldest l...
This is a proof-of-concept Sanskrit corpus developed for the study of Buddhist Sanskrit lexicology. ...
We describe an innovative computer interface designed to assist annotators in the efficient selectio...
Lexical datasets containing annotated concordances of words pertaining to the conceptual domains of ...
Sanskrit (संस्कृत), sometimes referred to as the mother of all Indian languages, has a significant ...
This is a Sanskrit corpus developed at the Mangalam Research Center (Berkeley, California) for the s...
This is the repository for word segmentation in sanskrit using energy based models. # Word Segme...
International audienceThis paper focuses on the classifications of words which were elaborated in th...
The work is accepted at TextGraphs - 17 colocated with ACL 2017 (http://acl2017.org/) Derivational...
Pali Sandhi is a phonetic transformation from two words into a new word. The phonemes of the neighbo...
Sanskrit has a rich source of lexical resources in the form of various kinds of dictionaries, and a ...
Dravidian languages, such as Kannada and Tamil, are notoriously difficult to translate by state-of-t...
The work was accepted in Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, S...
This is a proof-of-concept Sanskrit corpus developed for the study of Buddhist Sanskrit lexicology. ...
Sanskrit is one of the most ancient attested Indo-European languages, and it has one of the oldest l...
This is a proof-of-concept Sanskrit corpus developed for the study of Buddhist Sanskrit lexicology. ...
We describe an innovative computer interface designed to assist annotators in the efficient selectio...
Lexical datasets containing annotated concordances of words pertaining to the conceptual domains of ...
Sanskrit (संस्कृत), sometimes referred to as the mother of all Indian languages, has a significant ...
This is a Sanskrit corpus developed at the Mangalam Research Center (Berkeley, California) for the s...
This is the repository for word segmentation in sanskrit using energy based models. # Word Segme...
International audienceThis paper focuses on the classifications of words which were elaborated in th...
The work is accepted at TextGraphs - 17 colocated with ACL 2017 (http://acl2017.org/) Derivational...
Pali Sandhi is a phonetic transformation from two words into a new word. The phonemes of the neighbo...
Sanskrit has a rich source of lexical resources in the form of various kinds of dictionaries, and a ...
Dravidian languages, such as Kannada and Tamil, are notoriously difficult to translate by state-of-t...