Pali Sandhi is a phonetic transformation from two words into a new word. The phonemes of the neighbouring words are changed and merged. Pali Sandhi word segmentation is more challenging than Thai word segmentation because Pali is a highly inflected language. This study proposes a novel approach that predicts splitting locations by classifying the sample Sandhi words into five classes with a bidirectional long short-term memory model. We applied the classified rules to rectify the words from the splitting locations. We identified 6,345 Pali Sandhi words from Dhammapada Atthakatha. We evaluated the performance of our proposed model on the basis of the accuracy of the splitting locations and compared the results with the dataset. Results showe...
Myanmar script has no fixed delimiters between words or syllables. Therefore, to achieve meaningful ...
For languages without word boundary delimiters, dictionaries are needed for segmenting running texts...
The Thai written language is one of the languages that does not have word boundaries. In order to di...
<p>The work was accepted in Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage...
Word segmentation is a basic task and animportant problem in natural language processing. InMyanmar ...
The aim of this paper is to assist Buddhists andthe new Buddhist students who are unfamiliar withsom...
Myanmar language has long been influencedby the usage of Pali words since ancient times.This study p...
This study reports the development of a Myanmar word segmentation method using Unicode standard enco...
Word segmentation is a problem in several Asian languages that have no explicit word boundary delimi...
For any Indian language, the accuracy of the morphological analyser, depends on the pre-edition of t...
Word segmentation is a basic task and an important problem in naturallanguage processing. In Myanmar...
A dictionary based automatic syllabification tool has been given for Speech Synthesis in Sinhala lan...
In Thai language, the word boundary is not explicitly clear, therefore, word segmentation is needed ...
Myanmar sentences are written as contiguoussequences of syllables with no characters delimiting thew...
This study is to develop a word segmentation algorithm and solution for Myanmar language. This is a ...
Myanmar script has no fixed delimiters between words or syllables. Therefore, to achieve meaningful ...
For languages without word boundary delimiters, dictionaries are needed for segmenting running texts...
The Thai written language is one of the languages that does not have word boundaries. In order to di...
<p>The work was accepted in Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage...
Word segmentation is a basic task and animportant problem in natural language processing. InMyanmar ...
The aim of this paper is to assist Buddhists andthe new Buddhist students who are unfamiliar withsom...
Myanmar language has long been influencedby the usage of Pali words since ancient times.This study p...
This study reports the development of a Myanmar word segmentation method using Unicode standard enco...
Word segmentation is a problem in several Asian languages that have no explicit word boundary delimi...
For any Indian language, the accuracy of the morphological analyser, depends on the pre-edition of t...
Word segmentation is a basic task and an important problem in naturallanguage processing. In Myanmar...
A dictionary based automatic syllabification tool has been given for Speech Synthesis in Sinhala lan...
In Thai language, the word boundary is not explicitly clear, therefore, word segmentation is needed ...
Myanmar sentences are written as contiguoussequences of syllables with no characters delimiting thew...
This study is to develop a word segmentation algorithm and solution for Myanmar language. This is a ...
Myanmar script has no fixed delimiters between words or syllables. Therefore, to achieve meaningful ...
For languages without word boundary delimiters, dictionaries are needed for segmenting running texts...
The Thai written language is one of the languages that does not have word boundaries. In order to di...